Audience
Component Library solution for developers
About gTTS
gTTS (Google Text-to-Speech), a Python library and CLI tool to interface with Google Translate's text-to-speech API. Write spoken mp3 data to a file, a file-like object (bytestring) for further audio manipulation, or stdout. Or simply pre-generate Google Translate TTS request URLs to feed to an external program. Customizable speech-specific sentence tokenizer that allows for unlimited lengths of text to be read, all while keeping proper intonation, abbreviations, decimals and more. Customizable text pre-processors which can, for example, provide pronunciation corrections.
Other Popular Alternatives & Related Software
Fluency Tutor
Features including text-to-speech, dictionary and translation tools, help struggling readers. By giving them access to support tools that help build their understanding and confidence. Teachers can share reading passages with their class and receive recordings of the assigned passages back. Helping to keep students reading, whether they’re in class or at home and avoid any learning loss. Share passages with individual students, or with the entire class using Google Drive or Google Classroom 'share' function. Provides extra help for students with text-to-speech, dictionary, picture dictionary and translation tools. Students can record their own assigned reading passages whenever and wherever it suits them and share back with their teacher. Friendly dashboard interface for teachers & students.
Learn more
TextSpeech Pro
TextSpeech Pro is a professional text-to-speech software product, proudly awarded "the best text to speech software in the world". Synthesize text-to-speech from any document format (text, Microsoft Word, PDF, Microsoft Excel, RTF, etc) using a variety of voices and languages. Export the synthesized speech from documents to a variety of audio file formats in three modes (quick, normal and batch). Create and modify conversations, bookmarks and pauses (silence breaks) in a document using an advanced text-to-speech editor. Modify speech properties (voice, speed, volume, pitch, word highlighting) and speech entities (bookmarks, conversations, pauses) on the fly. Extract text from scanned documents and convert it to speech or audio files. Use a fully featured document editor with many text processing features (text manipulation, spell checker, print and print preview, find and replace, go to line, customizable fonts, zoom capabilities, and document properties view).
Learn more
Amazon Polly
Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly's Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries.
In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications.
Learn more
BookFab
BookFab Audiobook Creator offers high-quality and personalized text-to-speech conversion. Featuring a wide range of voice and full control over parameters, this AI reader lets you create lifelike audio with ease.
Key Features of BookFab Audiobook Creator:
1. Experience high-quality AI text-to-speech with lifelike audio
2. Choose from a wide array of 20 unique voices in both English and Japanese, with options for both male and female.
3. Customize speed, loudness, prosody, expressivity and silence settings for bespoke audio
4. Correct pronunciation with alias settings and tailor reading rules to specific needs
5. Track syntax via synchronous highlighting and automatic scrolling while the audio plays, with the ability to replay specific sentences
6. Enjoy flexibility in text input and audio output. Be it direct text input or TXT file imports, output your audio in a variety of formats including MP3 and OPUS.
Learn more
Pricing
Starting Price:
Free
Free Version:
Free Version available.
Company Information
gTTS
pypi.org/project/gTTS/
Other Useful Business Software
MongoDB Atlas runs apps anywhere
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Product Details
Platforms Supported
Windows
Mac
Linux
Training
Documentation
Support
Online