Text to Speech Software

View 198 business solutions

Browse free open source Text to Speech software and projects below. Use the toggles on the left to filter open source Text to Speech software by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 1
    Capture2Text

    Capture2Text

    Quickly OCR part of the screen and save resulting text to clipboard

    Capture2Text enables users to quickly OCR a portion of the screen using a keyboard shortcut. The resulting text will be saved to the clipboard by default. Supports 90+ languages including Chinese, English, French, German, Japanese, Korean, Russian, and Spanish. Portable and does not require installation. See http://capture2text.sourceforge.net for details.
    Leader badge
    Downloads: 2,992 This Week
    Last Update:
    See Project
  • 2
    eSpeak: speech synthesis
    Text to Speech engine for English and many other languages. Compact size with clear but artificial pronunciation. Available as a command-line program with many options, a shared library for Linux, and a Windows SAPI5 version.
    Leader badge
    Downloads: 2,455 This Week
    Last Update:
    See Project
  • 3
    Piper TTS

    Piper TTS

    A fast, local neural text to speech system

    Piper is a fast, local neural text-to-speech (TTS) system developed by the Rhasspy team. Optimized for devices like the Raspberry Pi 4, Piper enables high-quality speech synthesis without relying on cloud services, making it ideal for privacy-conscious applications. It utilizes ONNX models trained with VITS to deliver natural-sounding voices across various languages and accents. Piper is particularly suited for offline voice assistants and embedded systems.
    Downloads: 148 This Week
    Last Update:
    See Project
  • 4
    PNotes
    PNotes is light-weight, flexible, skinnable manager of virtual notes on your desktop. It supports multiple languages, individual note's settings, transparency and scheduling. Absolutely portable as well - no traces in registry. PNotes.NET edition requires .NET framework 4 Client Profile
    Leader badge
    Downloads: 380 This Week
    Last Update:
    See Project
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 5
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 216 This Week
    Last Update:
    See Project
  • 6
    TTS Voice Wizard

    TTS Voice Wizard

    Speech to Text to Speech, sends text as OSC messages

    Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) Use TTS Voice Wizard's accessibility features to improve your VRChat experience (it works outside of VRChat too!) You can convert your Speech-to-Text and back to Speech through various Speech Recognition and Text-to-Speech methods. You can send what you say as OSC messages to VRChat to be displayed on your avatar using KillFrenzyAvatarText or VRChats Chatbox. The app can translate your speech from one language to over 20 other support languages. There are 100+ different voices with various customization options so you can pick a voice that best suits you. Display the current song you are listening to on Spotify or via your browser. Display tracker and controller battery life in conjunction with XSOverlay. Use in conjunction with HRtoVRChat_OSC to enable you to display your heartrate in VRChat's Chatbox.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 7
    Simple TTS Reader

    Simple TTS Reader

    A small clipboard reader

    Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.
    Leader badge
    Downloads: 88 This Week
    Last Update:
    See Project
  • 8
    Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.
    Leader badge
    Downloads: 311 This Week
    Last Update:
    See Project
  • 9
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 9 This Week
    Last Update:
    See Project
  • The All-in-One Commerce Platform for Businesses - Shopify Icon
    The All-in-One Commerce Platform for Businesses - Shopify

    Shopify offers plans for anyone that wants to sell products online and build an ecommerce store, small to mid-sized businesses as well as enterprise

    Shopify is a leading all-in-one commerce platform that enables businesses to start, build, and grow their online and physical stores. It offers tools to create customized websites, manage inventory, process payments, and sell across multiple channels including online, in-person, wholesale, and global markets. The platform includes integrated marketing tools, analytics, and customer engagement features to help merchants reach and retain customers. Shopify supports thousands of third-party apps and offers developer-friendly APIs for custom solutions. With world-class checkout technology, Shopify powers over 150 million high-intent shoppers worldwide. Its reliable, scalable infrastructure ensures fast performance and seamless operations at any business size.
    Learn More
  • 10
    MeloTTS

    MeloTTS

    High-quality multi-lingual text-to-speech library by MyShell.ai

    MeloTTS is an open-source text-to-speech (TTS) system that generates natural-sounding speech from text input. It utilizes advanced machine-learning models to produce high-quality audio outputs.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    NVIDIA NeMo

    NVIDIA NeMo

    Toolkit for conversational AI

    NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. NeMo uses PyTorch Lightning for easy and performant multi-GPU/multi-node mixed-precision training. Supported models: Jasper, QuartzNet, CitriNet, Conformer-CTC, Conformer-Transducer, Squeezeformer-CTC, Squeezeformer-Transducer, ContextNet, LSTM-Transducer (RNNT), LSTM-CTC. NGC collection of pre-trained speech processing models.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Dragonfire

    Dragonfire

    The open-source virtual assistant for Ubuntu based Linux distributions

    Dragonfire is the open-source virtual assistant project for Ubuntu-based Linux distributions. Her main objective is to serve as a command and control interface to the helmet user. So that you will be able to give orders just by using your voice commands and your eye movements. That makes the helmet handsfree. We are planning to ship Dragonfire as a preinstalled software package on DragonOS Linux Distribution. DragonOS will be a Linux distribution specially designed for the helmet. It will contain various software packages for controlling the helmet. It will be the first of its kind. Dragonfire uses Mozilla DeepSpeech to understand your voice commands and Festival Speech Synthesis System to handle text-to-speech tasks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Kitten TTS

    Kitten TTS

    State-of-the-art TTS model under 25MB

    KittenTTS is an open-source, ultra-lightweight, and high-quality text-to-speech model featuring just 15 million parameters and a binary size under 25 MB. It is designed for real-time CPU-based deployment across diverse platforms. Ultra-lightweight, model size less than 25MB. CPU-optimized, runs without GPU on any device. High-quality voices, several premium voice options available. Fast inference, optimized for real-time speech synthesis.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    ChatTTS

    ChatTTS

    A generative speech model for daily dialogue

    ChatTTS is an open-source conversational text-to-speech model optimized for dialogue, developed by 2Noise. Trained on 100,000+ hours of English and Chinese conversation data, it excels at generating expressive prosody—pauses, interjections, laughter—for more natural-sounding speech synthesis in assistant and chatbot applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Chatterbox

    Chatterbox

    SoTA open-source TTS

    Chatterbox is Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs and is consistently preferred in side-by-side evaluations. Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out. Try it now on our Hugging Face Gradio app. If you like the model but need to scale or tune it for higher accuracy, check out our competitively priced TTS service (link). It delivers reliable performance with ultra-low latency of sub-200ms—ideal for production use in agents, applications, or interactive media.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    TTS

    TTS

    Deep learning for text to speech

    TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). If you are only interested in synthesizing speech with the released TTS models, installing from PyPI is the easiest option.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    vinuxproject

    vinuxproject

    Vinux is an Ubuntu derived distribution for blind & visually impaired.

    Vinux supports software text to speech and Braille support from boot-up to shutdown. Users can use installation medium to install independently with no sighted assistance required. Vinux supports command line environment speech, Desktop environment speech and magnification features. Vinux comes with an accessible suite of software and has an excellent mailing list support group.
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 19
    QChartist

    QChartist

    Free and Open Source Technical Analysis Charting Software

    QChartist is a free and open source technical analysis charting software. Its purpose is to provide a complete set of tools to perform technical analysis on charts and data. It helps to make forecasts mainly for markets but can also be used for weather or any quantifiable data. The program is flexible and its functionalities can be easily extended. You can draw geometrical shapes on your charts or plot programmable indicators from your data. It is also possible to filter or merge data. I got a little inspired from MT4 allowing a fairly easy portability of programmed indicators from MT4 to QChartist. It is now faster and much more professional thanks to the use of a C++ layer (used mostly for calculations) over the standard Basic layer (used mostly for the GUI interface). You can use astro indicators and functions from a library for astronomical calculations. You can get real time quotations thanks to Yahoo Finance, Alpha Vantage, Tiingo, Stooq, Finnhub and Twelvedata data sources.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20
    Text2Speech is a small and easy to use Text To Speech (TTS) application written in C#. It uses the Microsoft .NET Framework 2.0 to run.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    This is a development package for IBM Text To Speech (TTS). It is intended to be used to build applications when a licensed ibmtts is not available. Only the ECI ABIs are provided. There is no TTS runtime code provided.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    A modular, extensible Hebrew text-to-speech engine tuned for Standard Israeli Hebrew, and associated tools.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24

    Russian Text-to-speech programs

    читание, чтение, говорение

    For Windows (on Linux trought Wine can work) 3 russian text-to-speech programs (Chitanie, Chtenie and Govorenie). If you want donate. paypal.me/alkbab Читание, Чтение, Говорение есть программы пробующие преобразовать русский текст в русскую речь . Для Windows. На Linux через Wine... Кто хочет может пожертвовать paypal.me/alkbab
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    EasyTTS

    EasyTTS

    Text to Speech Utility

    EasyTTS is a text to speech app for 64 bit Windows that offers online and offline text-to-speech, with settings for how fast the voice is. It supports languages other than English but only if you are connected to the Internet. These are Spanish, Portuguese, Russian, French, and Mandarin (?) Chinese.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next

Guide to Open Source Text to Speech Software

Open source text to speech software is a type ofprogram that can read written text aloud in different languages and accents. It utilizes Artificial Intelligence (AI) technology, Natural Language Processing (NLP), and voice synthesis algorithms to generate synthesized audio output from raw typewritten or digital text. This type of software is particularly useful for those with visual impairments, as it makes reading onscreen easier by providing an audible version of the content rather than relying solely on visuals. Additionally, open source text to speech software can be used as a tool for people who need assistance mastering a language as they are able to listen to the pronunciation of words and phrases within the context of their conversations or studies.

Open source TTS differs from commercial solutions in that its code is made freely available to anyone who wishes to use or modify it, making alterations easier and quicker compared to closed-off coding programs. As such, developers have more control over how their project will look and run, which helps them create specialized applications tailored specifically for their needs without having to pay exorbitant licensing fees associated with some proprietary technologies. Furthermore, because open source projects are publicly available under liberal licenses like GPL (GNU General Public License), many talented developers contribute time and resources into perfecting existing pieces of code, thus allowing everyone access top-notch tools backed by strong support communities without worrying about cost constraints.

All in all, open source text to speech technology has empowered developers around the world by giving them greater control over how they create their applications than ever before through its freely available resources across platforms such as Windows or MacOS as well as Linux distributions such as Ubuntu/Debian/Fedora etc.. Thanks its accessibility and flexibility, users can manipulate software according to specific needs while taking advantage amazing contributions from its user base.

Features of Open Source Text to Speech Software

  • Text-to-Speech Synthesis: This feature allows users to convert a written text into an audio version, which is produced by a computerized voice. The text can include articles, emails, news stories, and other documents.
  • Language Options: Open source text-to-speech software often provides multiple language options, making it suitable for international applications. This allows users to generate audio files in any language they choose.
  • Customizable Voices: Some open source text-to-speech programs offer customizable voices, allowing the user to adjust the tone and tempo of the synthetic voice output to create more natural sounding speech patterns.
  • Volume Control: Open source text-to-speech software usually offers volume control options so that users can adjust how loud or quiet their audio output will be.
  • File Formats: Most open source programs allow for the creation of both MP3 and WAV files for easy playback on any type of device or platform you may use.
  • Editing Tools: Many open source text-to-speech programs also come with editing tools inclusive of creating sound effects and modifying frequency ranges to customize your audio even further.

What Types of Open Source Text to Speech Software Are There?

  • Artificial intelligence-based Text to Speech (AI TTS): AI TTS is a category of open source text to speech software that uses artificial intelligence algorithms to analyze input data and generate synthetic voice output. Artificial intelligence technology can be used to create synthetic voices that have natural sounding intonations, accents, and expressions.
  • Standard-based Text to Speech (SSTS): SSTS is an open source text to speech system developed according to an industry standard such as the SSML specification maintained by the World Wide Web Consortium (W3C). This type of text to speech software adheres strictly to the standards and may provide consistent results across different devices or platforms.
  • Reusable Component Text To Speech (RC TTS): RC TTS is an open source text-to-speech application that uses standardized components or modules which can be reused in various applications or projects. RCTTS provides flexibility and customization options when it comes to integrating a text-to-speech solution into different projects.
  • Machine Learning Based Text To Speech (ML TTS): ML TTS is an open source application based on machine learning technology which analyses input data and generates appropriate outputs for a given task. This type of text to speech software often combines different elements like natural language understanding (NLU), deep learning, predictive analytics etc., and relies heavily on statistical models generated from real world data sets.

Open Source Text to Speech Software Benefits

  1. Cost-Effective: Open source text to speech software eliminates the need to purchase expensive proprietary solutions and helps organizations reduce costs. Many open source solutions are free, while others have reasonably priced commercial versions available. This makes them ideal for startups, small businesses, and individuals with limited budgets.
  2. Flexible Customization Options: Open source text to speech tools often provide a wide range of customization options that enable users to adapt the software so it better meets their specific needs. This flexibility can be useful in adapting content for different markets or target audiences.
  3. Improved Accessibility: By converting language into audio output, open source text to speech technology can help improve access for those who are visually impaired or otherwise challenged when it comes to reading printed materials. It is also useful for those learning new languages who require audio feedback as they progress through lessons.
  4. Greater Efficiency: Open source text to speech solutions streamline processes by automating certain tasks (such as generating transcripts), freeing up staff time for more important work or creative pursuits. Additionally, multiple formats (audio files, videos) can be generated from one source document without manual effort or additional cost involved in production/editing process.
  5. Easy Deployment: Most open source text to speech tools have simple installation procedures and setup wizards that make them easy even for novice users to get started with quickly, making deployment fast and efficient across a variety of devices and platforms regardless of technical proficiency level or budget constraints.

Types of Users That Use Open Source Text to Speech Software

  • Students: Students may use open source text to speech software for class assignments such as transcribing audio recordings or reading aloud from documents. Additionally, people with disabilities or difficulty speaking can benefit from the tool to read aloud digital content and participate in classroom discussions.
  • Call Center Agents: Open source text-to-speech software can help improve customer service by providing customers with automated messages that sound natural and make them feel more comfortable when dealing with a company’s customer service department.
  • Writers and Editors: Open source text-to-speech software can be used during the writing/editing process to ensure clarity of the written word and make sure that the language is precise enough for professional work.
  • Business Professionals: Open source text-to-speech software is beneficial for business professionals who need to present presentations quickly without having to memorize long passages of spoken material. It also helps reduce mistakes by allowing business professionals to review their words before presentations are given.
  • Bloggers/Content Creators: Open source text-to-speech software can be used by bloggers and content creators looking for ways to add audio components into their blogs or other online content, thus making their posts more engaging for readers.
  • Developers: Developers may use open source text to speech software as an affordable optionfor creating apps that make use of synthesized speech, such as virtual assistants, interactive books, education apps, etc.

How Much Does Open Source Text to Speech Software Cost?

Open source text to speech software is usually free of cost. However, depending on the platform you choose to use, there may be associated costs for additional features or services related to the text-to-speech technology. For instance, some open source platforms may charge for developers’ tools and/or for cloud hosting and storage of your audio files. Additionally, some open source projects may require donations in order to continue development or provide support services. In most cases though, the cost of using an open source text to speech software should be minimal or non-existent — allowing you a great way to produce natural sounding voices at no cost.

What Does Open Source Text to Speech Software Integrate With?

Open source text to speech software can integrate with a variety of types of software in order to create an automated voice experience for users or machines. These types of software include customer service platforms, customer relationship management (CRM) systems, web browsers, word processors, and natural language processing tools. Additionally, open source text to speech software can be integrated into voice-enabled applications such as virtual assistant services and interactive response systems. By integrating open source text to speech with these other types ofprograms, developers are able to leverage the power of automated voices in order to make the user experience more natural and efficient.

Open Source Text to Speech Software Trends

  1. Increased Availability: Open source text to speech software is becoming increasingly available and accessible for users, with more options for customization and personalization.
  2. Enhanced Quality: The quality of open source text to speech software has improved over time, with better sounding voices and more natural sounding pronunciations.
  3. Increased Efficiency: Open source text to speech software is becoming more efficient, with shorter response times and higher accuracy rates.
  4. Expanded Platforms: More platforms are offering open source text to speech software, making it easier for users to access and use the technology.
  5. Improved Applications: Open source text to speech software is being applied in a wider range of contexts, such as education, customer service, and other commercial endeavors.
  6. Greater Customization: Users have access to more features that allow for greater customization of the generated speech, such as adjusting the speed, pausing between words, adding pauses, and changing the pitch of the voice.
  7. Extended Language Support: More language support is being offered for open source text to speech software, allowing users to generate speech in multiple languages.
  8. Widening Accessibility: Open source text to speech software is becoming more accessible for people with disabilities, with options such as voice-driven menus and touchscreen interfaces.

Getting Started With Open Source Text to Speech Software

Getting started with open source text to speech software is easy and can be done in just a few steps. First, make sure you have the necessary hardware, such as a computer or mobile device with a microphone and headset for audio output. Next, select an open source text to speech software of your choice, such as eSpeak, Festival Speech Synthesis System, or MaryTTS. Once you choose the desired open source software, it’s time to install it on your device. This step will vary depending on which software you chose - some may require you to install from the command line while others offer downloadable files that can be installed directly from your browser or via specific app stores. After installation is complete, launch the program and begin using it. You’ll want to familiarize yourself with how each program works in order to get the best results out of it. Consult user guides and tutorials if needed in order to understand its capabilities. Finally, test out different commands or write some sample scripts that you wish for the program to synthesize into audible output. With enough practice, soon you’ll become accustomed to using this type of technology and take advantage of all its potential applications.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.