C++ Speech Software

View 4814 business solutions

Browse free open source C++ Speech Software and projects below. Use the toggles on the left to filter open source C++ Speech Software by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    eSpeak: speech synthesis
    Text to Speech engine for English and many other languages. Compact size with clear but artificial pronunciation. Available as a command-line program with many options, a shared library for Linux, and a Windows SAPI5 version.
    Leader badge
    Downloads: 2,332 This Week
    Last Update:
    See Project
  • 2
    eGuideDog free software for the blind
    eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
    Leader badge
    Downloads: 281 This Week
    Last Update:
    See Project
  • 3
    RHVoice

    RHVoice

    Free open source speech synthesizer for Russian and other languages

    RHVoice is a free and open-source multilingual speech synthesizer. Its developers hope to give more visually impaired people the ability to use a good free synthesis voice reading in their native language with their screen reader. We are especially interested in supporting those languages for which there are currently no good voices that could be used with a screen reader. The creator of RHVoice, Olga Yakovleva, is blind herself. Many of the contributors to the RHVoice project, both programmers and non-programmers, are blind or partially sighted.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 4
    Mumble

    Mumble

    Low-latency, high quality voice chat for gamers

    Mumble is an open source, low-latency, high quality voice chat software primarily intended for use while gaming. It includes game linking, so voice from other players comes from the direction of their characters, and has echo cancellation so the sound from your loudspeakers won't be audible to other players.
    Leader badge
    Downloads: 109 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 5
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 6
    Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.
    Leader badge
    Downloads: 458 This Week
    Last Update:
    See Project
  • 7
    MMDAgent is the toolkit for building voice interaction systems. Users can design users own dialog scenario, 3D agents, and voices. This software is released under the Modified BSD license.
    Leader badge
    Downloads: 92 This Week
    Last Update:
    See Project
  • 8
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems in post. With Coqui, dubbing is a delight. Effortlessly clone the voice of your talent into another language and let the clone do the dub. With text-to-speech, experience the immediacy of script-to-performance. Cast from a wide selection of high-quality, directable, emotive voices or clone a voice to suit your needs. With Coqui text-to-speech, production times go from months to minutes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    The Open VXI VoiceXML interpreter is a portable open source library that interprets the VoiceXML dialog markup language. It is designed to serve as a reference for parties interested in understanding how VoiceXML markup might be executed.
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • AI-First Supply Chain Management Icon
    AI-First Supply Chain Management

    Supply chain managers, executives, and businesses seeking AI-powered solutions to optimize planning, operations, and decision-making across the supply

    Logility is a market-leading provider of AI-first supply chain management solutions engineered to help organizations build sustainable digital supply chains that improve people’s lives and the world we live in. The company’s approach is designed to reimagine supply chain planning by shifting away from traditional “what happened” processes to an AI-driven strategy that combines the power of humans and machines to predict and be ready for what’s coming. Logility’s fully integrated, end-to-end platform helps clients know faster, turn uncertainty into opportunity, and transform the supply chain from a cost center to an engine for growth.
    Learn More
  • 10
    Audiobook Cutter is an easy-to-use tool which splits large speech MP3 files into smaller ones without re-encoding. The split points are determined by silent parts. The main purpose is to make audiobooks or podcasts more manageable in a user-friendly way.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More help: https://sourceforge.net/p/skrybotdomowy/wiki/ Domain advanced versions (Polish Language) 1. SkryBot Prawo - for judicial professionals. 2. SkryBot Administracyjny - for civil and government administration. 3. SkryBot Medycyna Rodzinna - for physicians Professional version of SkryBot (commercial) offers you: 1. Audio conversion and cutting sound files into smaller ones. 2. Searching for words or phrases in sound files (recognized by SkryBot). 3. Editing sound files and automatic cutting off long silence parts in audio file.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 13
    mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14

    Omilo - a text to speech application

    Omilo is a simple text to speech application

    Omilo is a simple text to speech application for Windows and Linux using Festival, Flite, Marytts and Piper voices.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    JuliusModels

    JuliusModels

    Open source speech models for Julius in English and other languages.

    Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 16

    Sinsy

    HMM-based singing voice synthesis system

    Sinsy is an HMM-based singing voice synthesis system. This software is released under the Modified BSD license.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly GUI for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    BookReader
    BookReader is a file converter from txt to mp3. Now your computer can read a text file to obtain an audiobook. No speech engine nor voices included.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    GRANULE is a flashcards program based on Leitner cardfile methodology for learning new words. It features long-term memory training capabilities with scheduling, integrated pictures, sound, and full-screen mode.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    a GUI for the Festival speech synthesis program
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Cotovía

    Cotovía

    Text-to-Speech System for Galician and Spanish

    Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities, both in Galicia, Spain. Its development has involved a research group of linguists and engineers. Cotovía has been developed as a research project, therefore most of the work has been focused on the most interesting aspects from a scientific point of view. Although the performance of the whole TTS system is quite good, there are some parts that could be clearly improved. Cotovia files and installing instructions are available at the Files and Git sections.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    The F Modular Synthesizer is a tool for creating all kinds of sounds, white and low-band noises and midi-like tunes with modulated frequencies, volumes, balances and many more features.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Project dedicated to Festival voices development
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next