Alternatives to iSpeech Translator
Compare iSpeech Translator alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to iSpeech Translator in 2026. Compare features, ratings, user reviews, pricing, and more from iSpeech Translator competitors and alternatives in order to make an informed decision for your business.
-
1
Google Cloud Speech-to-Text
Google
Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech into text in 73 languages and 137 different local variants. Leverage Google’s most advanced deep learning neural network algorithms for automatic speech recognition (ASR) and deploy ASR wherever you need it, whether in the cloud with the API, on-premises with Speech-to-Text On-Prem, or locally on any device with Speech On-Device. -
2
Google Cloud Translation API
Google
Make your content and apps multilingual with fast, dynamic machine translation available in thousands of language pairs. The basic edition of the Translation API translates the texts of your website and your applications into more than 100 languages instantly. The Advanced edition offers dynamic results just as quickly as the Basic edition, but also includes other customization features, which is very important when you use phrases or terms that are specific to specific areas and contexts. The pre-trained model of the Translation API supports over a hundred languages, from Afrikaans to Zulu. With AutoML Translation you can create custom models in more than fifty language pairs. Thanks to the Translation API glossary, the content you translate will remain true to your brand. You just have to indicate which vocabulary you want to give priority to and save the glossary file in your translation project.Starting Price: Free (500k characters/month) -
3
Speechmatics
Speechmatics
Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcriptionStarting Price: $0 per month -
4
iSpeech Dictation
iSpeech
Speak any message and iSpeech Dictation™ will put it into text format. Dictate using BlackBerry Messenger (BBM), text (SMS), email, or voice notes into text and send. The app's human-quality speech recognition is brought to you by iSpeech®, the creator of DriveSafe.ly®, award-winning leader in texting while driving applications. Speak any phrase or message and iSpeech Dictation™ will translate it into text. Talk and type. -
5
Azure AI Speech
Microsoft
Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages. -
6
PowerSpeak
Saince
PowerSpeak from Saince is a versatile and powerful front end medical speech recognition software. We have included over 30 medical language dictionaries in the solution allowing you to take advantage of this technology irrespective of your specialization or care setting. It is an ideal clinical documentation and reporting solution not just for radiologists, but also for physicians of all specialties and in all care settings – acute care hospitals, imaging centers, labs, physician offices, behavioral health hospitals, long term care hospitals, nursing homes etc. Unlike other speech recognition solutions in the market that tie you down to a single device to use them, PowerSpeak Medical speech recognition software gives you the flexibility to install on five devices on a single license. PowerSpeak’s powerful and advanced speech recognition algorithms ensure that you enjoy 99% accuracy of the transcribed text every time. Less time spent correcting errors translates into more productivity. -
7
Rubidium
Rubidium
Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others. -
8
Knovvu Speech Recognition
Sestek
Automate customer processes, evaluate agent performances objectively and ensure your operations are 100% efficient. In our connected world, many consumers are interacting with everyday connected appliances in new ways. With a trend in connected devices that often lack a screen, speech is emerging as a natural, intuitive interface for human-machine interaction. Speech recognition is the driving technology behind this development, revolutionizing the way people interact with their devices. With Knovvu Speech Recognition from Sestek, machines and applications can understand user commands in spoken language. With the ability to listen to and interpret spoken demands, users may interact with these devices by speaking aloud rather than inputting buttons and keystrokes. Our automatic speech recognition software has full application. Many organizations use technology to power intuitive and straightforward self-service solutions. -
9
NeoSound
NeoSound Intelligence
NeoSound Intelligence is an AI tech company that turns emotions into actionable insights in order to create a world with better conversations between organizations and consumers. We intend to make all conversations better between consumers and organizations. By providing AI-powered speech analytics tools, we help call center companies to optimize their customer communication. Turn calls into revenue. Optimise customer communication by listening to customer calls automatically. NeoSound tools turn phone conversations into meaningful actionable insights to make customer communication better. NeoSound tools do not only speech-to-text translation. Smart algorithms do acoustics and intonation analysis. The machine listens to how people speak not only what they say. That is why our trained machines can easily address your company-specific needs. NeoSound offers a unique combination of speech-to-text semantic analytics and acoustic analysis of intonation. -
10
SpeechText.AI
SpeechText.AI
Transcribe audio and video into text. Get accurate transcriptions of podcasts with domain-specific speech recognition. SpeechText.AI is a powerful artificial intelligence software for speech to text conversion and audio transcription. Upload audio or video files. AI transcription software supports various file formats and transcribes from speech to text in any language. Select domain. Select industry domain and audio type from predefined categories to improve the recognition accuracy of domain-specific words. Transcribe. Our speech transcription engine uses state-of-the-art deep neural network models to convert from audio to text with close to human accuracy. Edit & Export. Search, modify and verify audio transcriptions using interactive editing tools. Export your content in different formats. Why SpeechText.AI? Set of amazing features to help you transcribe audio and video in seconds. Speech recognition. Powerful speech-to-text tech.Starting Price: $19 one-time payment -
11
Soniox
Soniox
Soniox develops highly accurate foundational speech models that transcribe, translate, and understand speech as it happens, and also provides the developer platform that makes it easy to integrate real-time voice intelligence into any application. Soniox Speech-to-Text API allows you to transcribe speech in 60+ languages in real-time with high accuracy - built for large scale. Soniox also provides regional data residency and is SOC 2 Type 2, GDPR and HIPAA compliant.Starting Price: $0.10/hour of audio -
12
Microsoft Translator
Microsoft
Microsoft Translator enables you to translate text and speech, have translated conversations, and even download AI-powered language packs to use offline. Speak, type, or write by hand with Windows Ink, to translate into over 60 languages. Have real-time translated conversations with up to 100 people, each on their own device (Windows, iOS, Android, Kindle). Start or join a conversation directly through Cortana. Translate images such as menus and signs. Download languages to translate offline using state-of-the-art neural machine translation. Hear your translated phrase to help you with pronunciation. Share your translation with other apps. Pin your most frequent translations to save for later. Learn a new word or phrase everyday by pinning Translator to Start. Breaking the language barrier at home, at work, anywhere you need it. Join the conversation no matter what language you speak. Chat, share experiences, create a connection. Interact with ease when traveling abroad. -
13
Dictation Speech to Text
IBN Software
You can now add custom words to improve speech recognition! Find the list in setup->manage custom words. Dictation Speech to text allows to dictate, record, translate and transcribe text instead of typing. It uses latest speech to text voice recognition technology and its main purpose is speech to text and translation for text messaging. Never type any text, just dictate and translate using your speech! Nearly every app that can send text messages can be configured to operate with 'Dictation Speech to text'. Dictate uses the builtin speech to text recognition engine. Dictation Speech to text supports more than 40 languages. Dictate offers 3 text zones, indicated by language flags, for which you can configure a different language in the settings. Thus you can switch between different language projects with a singe click. Translation is as easy as pushing the translation button. You can specify the translation target language in the app settings.Starting Price: $4.49 one-time payment -
14
AccuSpeechMobile
AccuSpeechMobile
AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands. -
15
Vocola 3
Vocola 3
Dictation with Windows Speech Recognition (WSR) works well for "WSR-friendly" applications like MS Word, Outlook, and PowerPoint. Dictated text is inserted directly into document text, and commands like "Delete hedgehog" can refer to specific document text. But WSR dictation works less well for "WSR-unfriendly" applications like MS Excel, Gmail, and most programming environments. Dictation is not inserted directly into document text, and commands cannot refer to document text. Vocola improves this situation by supporting direct dictation for WSR-unfriendly applications, and by allowing correction and modification of the just-dictated phrase. Vocola and WSR use the same underlying speech profile, so any improvements you make via training, correction, or the speech dictionary benefit WSR dictation and Vocola dictation equally. Dictation to WSR-unfriendly applications is essentially unusable in Vista, as every utterance raises the correction panel. -
16
AppTek
AppTek
AppTek is a global leader in artificial intelligence (AI) and machine learning (ML) technologies for automatic speech recognition (ASR), neural machine translation (NMT), and natural language understanding (NLU). The AppTek platform delivers industry-leading, real-time streaming and batch technology solutions in the cloud or on-premise for organizations across a breadth of worldwide markets such as media and entertainment, call centers, government, enterprise business, and more. Built by scientists and research engineers who are recognized among the best in the world, AppTek’s solutions cover a wide array of languages, dialects, and channels. AppTek utilizes deep neural networks to transcribe and understand speech and text data, delivering more accurate and efficient tools. -
17
Bohemicus
Jan Kapoun
Using this program, you can boost your translation productivity up to 300%, or even more with certain types of texts. Bohemicus is a powerful translator’s tool. It integrates with your CAT tool (or any other application) to enhance its capabilities. It works as an interface. With Bohemicus, you can take advantages of the following features in ANY application, e.g. MS Office, CAT tools, web-based CATs, etc.: machine translation, voice dictation (speech-to-text), your own translation memories, convenient search in online/offline dictionaries, note taking, clipboard manager, translation jobs management, invoicing, and much more…Starting Price: €99 -
18
TapMedia Translator
TapMedia Ltd
Translator allows you to translate any sentence or phrase into 100+ languages with the tap of a button. Translate by typing, using your voice, or scanning text with your camera. Translate into 100+ languages. Real-time voice recognition. Scan text with your camera. Built-in phrasebook, text-to-speech, and history feature. Favorite translations, attractive UI, share your translations. You will receive access to the apps in the TapMedia PRO bundle for the duration of the subscription.Starting Price: Free -
19
Transcribe
Wreally
Transcribe saves thousands of hours every month in transcription time for journalists, lawyers, podcasters, students and professional transcriptionists all over the world. Increase your productivity & save mountains of time when converting your interviews, audio notes, lectures, speeches, podcasts and any recorded speech to text. Put on your headphones, load your audio, slow it down and speak out what you hear. It's that simple. Our dictation engine will convert your speech to text on the fly. This is way faster than typing. We support English, Spanish, French, Hindi and almost all other European & Asian languages. -
20
Whisper
OpenAI
We’ve trained and are open-sourcing a neural net called Whisper that approaches human-level robustness and accuracy in English speech recognition. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise, and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing. The Whisper architecture is a simple end-to-end approach, implemented as an encoder-decoder Transformer. Input audio is split into 30-second chunks, converted into a log-Mel spectrogram, and then passed into an encoder. -
21
Alibaba Cloud Intelligent Speech Interaction
Alibaba Cloud
Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, eCommerce and smart home.Starting Price: $1.40 per hour -
22
Speech Recogniser
Anfasoft
With this revolutionary app, you won't need to type anything any more. You just speak and your speech is instantly converted into text. This brilliant speech-to-text app will allow you to do more with your iPhone. Translate your speech into more than 40 languages. Hear your translation being read aloud to you, copy your text to other apps, and Tweet. Speech Recogniser uses the latest technologies in speech recognition and machine translation. As a result, the app requires an Internet connection. Speech Recogniser will definitely make your life easier, so download it and get your copy now! The supported languages include English (Australia), English (UK), English (US), Español (España), Español (México), Bahasa indonesia, Bahasa melayu, čeština, Dansk, Deutsch, français (Canada), français (France), italiano, Magyar, Nederlands, Norsk, Polski, Português, Português brasileiro, Pyccĸий, and more.Starting Price: $10.66 one-time payment -
23
Phrase Localization Platform
Phrase Localization Platform
Phrase is a leader in Language Intelligence. Its enterprise platform automates, manages, and delivers multilingual content and experiences, helping organizations build deeper customer connections and accelerate business growth. Thousands of global brands use Phrase across hundreds of languages to reduce time to market and deliver consistent brand experiences worldwide. The Phrase Platform brings together translation management, software localization, multimedia localization, machine translation, workflow automation, and language AI in a single environment. From marketing campaigns and product interfaces to apps, audio, video, and customer support, teams manage all multilingual content in one place. Built for complex, fast-moving organizations, Phrase connects directly to the systems where content is created and published. Enterprise-ready and ISO 27001 certified, Phrase is trusted by global brands including Uber, AWS, Volkswagen, and Zendesk. Learn more at phrase.com.Starting Price: $27 per month -
24
Mymanu Translate
Mymanu
A uniquely designed, live voice-to-voice translation APP to help individuals and businesses communicate. The group translation is unique and secured by a password specifically chosen by you so you can invite who you like to join in. The speech-to-text system will generate a transcript of the conversation on each participant’s phone screen so you can refer to it later on. Its own proprietary speech recognition will enable you to understand more than 4 billion people around the world without having to type a single word. Mymanu® Translate will help you create new experiences and embrace new cultures. Live speech-to-speech translation in 29 languages, more than 4 billion people to speak with. Mymanu® Translate has been designed for people who travel abroad for fun and those who do business internationally to help them overcome language barriers. -
25
SpeechPulse
AV BEAM
SpeechPulse uses your computer’s microphone for real-time speech recognition. It can type into your favorite apps, including text editors, web browsers, and office applications. SpeechPulse works fully offline and doesn’t require any internet connectivity. It supports speech recognition in multiple languages, including English, French, Spanish, Italian, German, Japanese, Chinese, and Russian (a total of 100 languages). SpeechPulse supports both auto punctuation and manual punctuation for the English language. It supports auto punctuation for all other languages. SpeechPulse can also generate subtitles for your audio and video files with accurate timestamps. It supports SRT and VTT subtitle formats. You can also customize the width of a subtitle line to include only a limited number of characters. SpeechPulse has a one-time payment. You can pay for the product once and use it forever.Starting Price: $59.95/one-time payment -
26
aiOla
aiOla
aiOla is a deep tech Conversational, Voice, and Speech AI lab with an enterprise-level automatic speech recognition (ASR) foundation model, Text-to-speech (TTS) technology and Natural Language Understanding (NLU). It’s designed to help enterprises and developers adapt speech technologies to any process, whether through seamless API integration or an intuitive in-house app. aiOla is revolutionizing enterprise operations with enterprise level Conversational AI. We specialize in speech-to-text and text-to-speech AI that deliver unmatched accuracy (95%), specialized in specific jargon, in any language, accent, vertical, or acoustic environment. From empowering frontline workers with hands-free workflows to enabling voice AI agents with enterprise-grade ASR and TTS, aiOla seamlessly integrates into workflows, internal apps and products. -
27
Fusion Speech
Dolbey
Back-end speech recognition is the most significant technology development in the dictation and transcription industries. Without physician training, or changes in practice patterns, Fusion Speech® powered by Nuance’s SpeechMagic™ harnesses this powerful technology for facility-wide deployment in nearly every medical specialty. Capture dictation with Fusion Voice®, process the dictation through Fusion Speech, and boost transcription productivity in Fusion Text®. The Fusion modules drive cost savings in reoccurring labor and outsourcing fees. This is the speech recognition solution you have envisioned. Other speech recognition has provided cute gimmicks but fell short in offering a sustainable business application. Fusion Speech provides the tools you require to truly deploy speech recognition that returns measurable and tangible results for your investments. -
28
Maestra
Maestra.ai
Automatic Transcripts, Subtitles and Voiceovers. In just minutes. Highly accurate speech to text software with a built in advanced text editor. Translate in English, French, Spanish, German and 80+ languages. Save time and money with Maestra’s automatic audio to text transcription software. Transcribe audio files to text automatically within seconds. No credit card required for the first 15 minutes. Creating subtitles for video with online automatic subtitling software can save you a considerable amount of time. You'll be able to auto generate subtitles for videos in just a few minutes. You can also translate your subtitles automatically to 80+ languages. With Maestra video dubber you can automatically voiceover your videos aloud to foreign languages using artificial intelligence and computer generated voices.Starting Price: $6/hour -
29
Speech Recognition Cloud
Speech Recognition Cloud
Speech Recognition Cloud is a cloud-based speech recognition and dictation application for Windows. It converts speech to text in real time and types directly at the cursor in most applications (Word, Outlook, browsers and web forms). It supports automatic punctuation, spoken formatting commands (new lines, paragraphs, bullet and numbered lists), configurable hotkeys/hold-to-talk, and custom vocabulary and text expansion. Processing occurs in the cloud, so users can dictate on standard PCs without high-end hardware. An optional Medical edition supports clinical terminology for healthcare documentation. An internet connection is required.Starting Price: $6/month -
30
Talkatoo
Talkatoo
Talkatoo is a voice-enabled AI tool designed to integrate effortlessly with your workflow, transforming speech to text using specialized vocabularies. You focus on patient care; we handle the technology. Built to be affordable and tailored for clinics, Talkatoo helps you reclaim valuable time throughout your day. With processing speeds over 200 words per minute—five times faster than typing—and a built-in medical dictionary. Our key features—Auto-SOAP records, Desktop Dictation, and the AI Assistant empower you to streamline tasks with ease. Record entire appointments to generate formatted SOAP notes instantly, dictate into any application from notes to email, and use the AI Assistant to create discharge instructions, translate documents, and more. Simply download, click, and start speaking, no tech expertise needed.Starting Price: $117 per month -
31
Azure Speech Translation
Microsoft
Translate audio from more than 30 languages and customize your translations for your organization’s specific terms, all in your preferred programming language. Benefit from fast, reliable speech translation powered by neural machine translation technology. Generate speech-to-speech and speech-to-text translations with a single API call. Speech Translation captures the context of full sentences to provide accurate, fluent translations and improve communication between speakers of different languages. Customize speech recognition and translation for terminology specific to your business or industry. Train and deploy a custom translation system, without requiring machine learning expertise. Speech Translation can remove verbal fillers ("um," "uh," and coughs) and repeated words, add proper punctuation and capitalization, and exclude profanities for more readable translations. Deliver readable translations with an engine trained to normalize speech output.Starting Price: $0.36 per hour -
32
Work by Speech
Mikołaj Magowski
Work by Speech is the first program in the world that allows efficient work on a computer by speech without needing a keyboard and mouse. Work by Speech Features: - Efficient work on a computer by speech alone - Quiet speaking support - Application switching and opening by speech - Built-in voice commands for the most common actions - Custom voice commands management - Macro recording and editing - Separate dictation mode - Fast and repeatable mouse control by speech with support for all mouse actions - Customizable mousegrid that can be moved by speech - Automatic mousegrid optimization for every used application - Very low processor and memory usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Free updatesStarting Price: Free -
33
Amazon Nova Sonic
Amazon
Amazon Nova Sonic is a state-of-the-art speech-to-speech model that delivers real-time, human-like voice conversations with industry-leading price performance. It unifies speech understanding and generation into a single model, enabling developers to create natural, expressive conversational AI experiences with low latency. Nova Sonic adapts its responses based on the prosody of input speech, such as pace and timbre, resulting in more natural dialogue. It supports function calling and agentic workflows to interact with external services and APIs, including knowledge grounding with enterprise data using Retrieval-Augmented Generation (RAG). It provides robust speech understanding for American and British English across various speaking styles and acoustic conditions, with additional languages coming soon. Nova Sonic handles user interruptions gracefully without dropping conversational context and is robust to background noise. -
34
GoVivace
GoVivace
Our automatic speech recognition engine supports several English accents and can be localized to any language. Also, the ASR engine supports standard telephony as well as web and mobile applications. Being capable of actioning voice commands given to electronic devices such as computers, tablets, smartphones or telephones with the aid of a microphone, the GoVivace’s Automatic Speech Recognition Engine finds use in diverse applications. This automatic speech recognition engine compares the spoken input with a number of pre-specified possibilities and convert speech to text. The entire set of pre-specified possibilities constitute the application’s grammar, which powers the interface between the dialogue-speaker and the back-end processing. GoVivace’s patented Automatic Speech Recognition solution needs only very simple grammar for its processing. It can also support very large grammars for complex tasks. -
35
WebsiteVoice
WebsiteVoice
Turn all your website articles into high-quality audio in less than 5 minutes and for free. Let your visitors listen to the content of your website in the background while they do other things with our text-to-speech technology and increase the time spent on your website. Accessibility is sometimes forgotten. Empower visitors with visual impairment and reading disabilities to still completely consume your content without the complications of reading. Listening to podcasts and audiobooks has become a growing trend and behavior for people to consume content. Capture a wider audience that would prefer tuning in instead of reading. Thanks to our Automatic Content Recognition technology, you can just drop our snippet on your site and forget about it. We will automatically enable text-to-speech voice for the relevant content. We use Artificial Intelligence and Machine Learning to constantly improve our voice algorithms to make your website text-to-speech as realistic as possible.Starting Price: $9 per month -
36
Talk For Me
Talk For Me
Not being able to speak on your own is difficult. Talk For Me - Text to Speech, designed and engineered by a person who lost the ability to speak, seeks to make your life easier. Type in the main text area or tap one of the six main custom buttons and your iOS device will talk for you. Want to set up more custom phrases? Swipe up for more pages with custom editable buttons. Need even more? Save phrases in an archive database. This is great for saving partial sentences. A quick swipe left, select a sentence from your archive, and it will appear in the main window ready for you to complete. Can you type fast or need to spell a word? Turn on the Auto Speech Function to have every word or letter spoken as you enter it. Together with keyboard shortcuts, predictive text and your custom phrases, this app will allow you to communicate with ease. -
37
Dragon Speech Recognition
Nuance Communications
Putting words to work with AI‑powered speech recognition. Empower your employees to create high‑quality documentation. Save your organization time and money with Dragon Professional Anywhere, AI‑powered speech recognition that integrates into enterprise workflows. Empower attorneys to create high‑quality documentation and save time and money with Dragon Legal Anywhere, cloud‑hosted speech recognition that integrates directly into legal workflows. Enable officers to safely and efficiently meet reporting and documentation demands with this customized solution. Drive productivity at work and create and transcribe documents, short-cut repetitive steps—by voice. Seamlessly create, edit and transcribe legal documents by voice for improved efficiency, costs. Complete documents wherever work takes you with the cloud‑based, professional‑grade mobile dictation solution.Starting Price: $199.99 one-time fee per user -
38
Translate Me
Simya Solutions
The best free translation tool and dictionary for more than 100 languages. Easily translate from text, voice or conversations. Translate me provides the most accurate translation to help you travel with hassle-free or learn new languages every day. Take pictures of text to translate instantly or choose from your gallery. Translate voice/speech using voice recognition technology to get the most accurate result. Voice-to-Voice conversations helps you communicate with everyone without barriers in all parts of the world. Premium version offers unlimited translations with camera, +1000 essential words and phrases in the conversation guide and No ads. Established in 2016, Simya Solutions Ltd. is a multi-national IT company providing a range of mobile applications that enable people around the world to learn every day and everywhere.Starting Price: $39.99 one-time payment -
39
Wynyard Voice Frequency Analytics
Wynyard Group
There is a lot of unstructured data in various formats such as call records, recorded conversations, unclear voices, etc. To identify the relevant data and recognize the voices, a powerful tool is required. Wynyard Voice Frequency Analytics (VFA) is an analyzing tool that helps in identifying the person behind an unclaimed voice or decoding the speech in a readable format from an unclear voice. It is a web application that recognizes the identity of the speaker. The application is beneficial for the law enforcement and Government bodies to prevent crimes. Wynyard VFA works on the simple concept of matching the suspected voice with the ones available in the database and recognizing the owner of that voice. The advanced and superior technology used in the application ensures accurate results. The application can also be used to identify keywords or phrases from a conversation and convert the speech into readable text. -
40
Text to Speech!
Text to Speech!
Bring your text to life with Text to Speech! Text to speech produces natural sounding synthesised text from the words that you have entered in. With 82 different voices to choose from and the ability to adjust the rate and pitch, there are countless ways in which the synthesised voice can be adjusted. Voices are available in 38 different languages/accents. The ability to adjust the pitch and rate. Star your favourite phrases. Group starred phrases into folders. Mix speech into your phone calls. -
41
RocketWhisper
Mojosoft Co., Ltd.
RocketWhisper is a powerful desktop speech recognition and transcription application that runs 100% offline on your computer. Your voice data never leaves your machine - complete privacy guaranteed. Powered by OpenAI's Whisper engine with NVIDIA GPU (CUDA) acceleration, RocketWhisper delivers fast and accurate speech-to-text conversion for professionals, content creators, and anyone who works with voice and text. Key Features: - 100% offline processing - voice data never leaves your PC - OpenAI Whisper engine for high-accuracy speech recognition - NVIDIA CUDA GPU acceleration - up to 10x faster than CPU - Real-time voice-to-text input with global hotkey (Push-to-Talk with Right Alt) - Batch transcription of multiple audio/video files (MP3, WAV, M4A, MP4, MKV, AVI, etc.) - SRT/VTT subtitle export for video content - AI text formatting with LLM integration (OpenAI, Anthropic, Google Gemini, Grok, local LLM)Starting Price: $32 one-time -
42
Voicepoint Cloud
Voicepoint
The high-availability Voicepoint Cloud with a data centre in Switzerland offers a flexible, cost-effective speech recognition and dictation management solution for everyone who has to prepare a lot of documentation. With this sophisticated, high-performance cloud solution, you use the integrated speech recognition of Dragon Medical Direct, Dragon Legal Anywhere or Dragon Professional Anywhere and dictate directly in the target application where you get the result immediately as text. You also have access to the Winscribe dictation management solution in the Voicepoint Cloud, optimally covering your speech-based documentation processes. Whether you are in your practice, in the clinic, at your office or out, the cloud-based Voicepoint speech recognition and dictation solution supports documentation anywhere and anytime. -
43
Translate.com
Translate.com
Professional translation scaled by technology and enhanced by human experts. Translate.com is a translation software for businesses and individuals, allowing them to translate files (PDF, Word, Excel, PowerPoint, text), localize customer support, and amplify multilingual apps and websites. Trusted by leading companies worldwide, Translate.com localization services help clients overcome global language and cultural barriers and move up in business growth worldwide. Services and tools 1) Human translation services for localization, professional document translation and more. 2) Translate.com Machine Translation software. 3) Zendesk Translation App for multilingual customer support. ✓ Machine translation & Human translation ✓ Automated translation sending. ✓ Translation Glossary & Translation Memory. 4) Translation API. 5) App and website localization in JSON file format. -
44
Lingo Champion
Lingo Champion
Lingo Champion is a language-acquisition platform that enables learners to progress through reading, listening, and vocabulary practice using authentic content rather than traditional drills. It incorporates news articles, stories, videos, subtitles, and uploaded texts into an immersive learning environment, automatically tracking every word you read or listen to and converting it into personalized flashcards. The platform supports over 15 languages and offers a Chrome/desktop extension as well as a mobile app that allows you to learn while browsing the web or watching YouTube. Within any text or webpage you open, Lingo Champion detects words and sentences in your target language and provides instant translations or hover-tooltips; you can customise the translation percentage, choose to focus on words or full sentences, and import external subtitle or caption files. The system includes AI-powered tools that let you look up phrases, get explanations of natural speech, and more.Starting Price: $5.99 per month -
45
TrulyNatural
Sensory
Sensory is a pioneer in the use of embedded neural network-based speech recognition and has become the industry leader in optimizing and engineering speech recognition software with small footprints and minimal MIPS. This extensive experience and continuous innovation have led to the first embedded large vocabulary continuous-speech recognizer (LVCSR) with state of-the-art cloud performance. Unlike voice recognition software often used with smartphones and mobile devices, such as with a voice assistant mobile app, as well as with IoT (internet of things) enabled technologies (Alexa, Google Assistant, Siri, Cortana), Sensory’s solution is embedded and doesn’t require a wifi connection. Many applications don’t need or want to rely on cloud-based connection to do high-performance speech recognition. Others seek a client/cloud distributed system with optimal performance. The market concerns regarding privacy, performance and bandwidth are driving more processing to the edge. -
46
Dragon Professional
Nuance Communications
Dragon Professional is a speech recognition software that enables professionals to create high-quality documentation more efficiently by converting speech into text with up to 99% accuracy. Optimized for Windows 11 and compatible with Windows 10, it serves individuals and groups across various industries, including financial services, education, and healthcare. The software allows users to dictate documents three times faster than typing, supports the transcription of pre-recorded audio files, and offers customization options such as creating custom words and commands to streamline repetitive tasks. Additionally, Dragon Professional v16 includes access to Dragon Anywhere Mobile, a cloud-based dictation solution for iOS and Android devices, ensuring productivity on the go.Starting Price: $699 one-time payment -
47
Deepgram
Deepgram
Deploy accurate speech recognition at scale while continuously improving model performance by labeling data and training from a single console. We deliver state-of-the-art speech recognition and understanding at scale. We do it by providing cutting-edge model training and data-labeling alongside flexible deployment options. Our platform recognizes multiple languages, accents, and words, dynamically tuning to the needs of your business with every training session. The fastest, most accurate, most reliable, most scalable speech transcription, with understanding — rebuilt just for enterprise. We’ve reinvented ASR with 100% deep learning that allows companies to continuously improve accuracy. Stop waiting for the big tech players to improve their software and forcing your developers to manually boost accuracy with keywords in every API call. Start training your speech model and reaping the benefits in weeks, not months or years.Starting Price: $0 -
48
Rev.ai
Rev.ai
Rev.ai was built by leading speech recognition experts from millions of hours of accurate human-transcribed content. We began in 2011 with Rev.com, providing human transcription services. We are now the world's largest transcription vendor, with over 35,000 contractors who transcribe millions of minutes of audio each month. In 2017 we launched Temi, an automated speech-to-text transcription and editing service. Temi has already transcribed 20 million minutes of content and was named the best transcription service by Wirecutter. Today our best-in-class speech engine is available to everyone as Rev.ai. We're helping companies get the most out of their audio and video content by making it searchable and accessible. -
49
Virtual Speech Center
Virtual Speech Center
Virtual Speech Center offers innovative speech therapy apps and software for schools, private practices, independent speech pathologists and parents. We offer a wide range of mobile applications for speech therapy developed for IPad and IPhone devices. Some of our apps are offered at no charge to speech pathologists. Virtual Speech Center is a pioneer in taking speech and language therapy apps to the next level by incorporating games as reward components. The games featured in our apps include puzzles, board games, and games with sports and carnival themes. Our apps can be purchased individually or in bundles. Virtual Speech Center's TheraPlatform speech therapy software includes telepractice, documentation, billing, intake forms and e-claim submission modules designed for speech and language pathologists. Virtual Speech Center offers innovative speech therapy apps for schools, private practices, independent speech pathologists and parents. -
50
Azure AI Translator
Microsoft
An AI service for real-time document and text translation. Translate text instantly or in batches across more than 100 languages, powered by the latest innovations in machine translation. Support a wide range of use cases, such as translation for call centers, multilingual conversational agents, or in-app communication. Accurately translate text in more than 100 languages. Build custom models to handle domain-specific terminology. Access the same technology that powers billions of translations every day across Microsoft products. Your data remains yours—your text input isn’t logged during translation. Take advantage of our AI Translator service to remove the complexity of building instant translation into your apps and solutions with a single REST API call. Accurately detect the language of your source text, look up alternative translations with the bilingual dictionary, or convert text from one script to another.