Alternatives to Braina

Compare Braina alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Braina in 2026. Compare features, ratings, user reviews, pricing, and more from Braina competitors and alternatives in order to make an informed decision for your business.

  • 1
    Google Cloud Speech-to-Text
    Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech into text in 73 languages and 137 different local variants. Leverage Google’s most advanced deep learning neural network algorithms for automatic speech recognition (ASR) and deploy ASR wherever you need it, whether in the cloud with the API, on-premises with Speech-to-Text On-Prem, or locally on any device with Speech On-Device.
    Leader badge
    Compare vs. Braina View Software
    Visit Website
  • 2
    Speechmatics

    Speechmatics

    Speechmatics

    Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcription
    Starting Price: $0 per month
  • 3
    Twilio Voice
    Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources, like our Voice SDK. Then, add on features like Interactive Voice Response (IVR), recording transcriptions, and speech recognition to create an experience that your customers will appreciate. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice. Find docs, code samples, helper libraries, and developer tools such as Twilio Runtime and our visual workflow builder, Studio.
    Starting Price: $0.0085 per min
  • 4
    LumenVox

    LumenVox

    LumenVox

    Transforming customer engagement with AI-driven speech recognition and voice authentication technology. We’ve spent the last 20 years empowering our partners’ success through collaboration. Our curiosity keeps us innovating for the next 20. Our flexible speech-enabling technology enables you to build a solution that fulfills all your customers’ demands, affordably and reliably. We do one thing, and we do it well. And that's speech-enabling your applications. Finally, deliver great voice automation and interactions. Whether short and simple commands, or conversational questions, LumenVox ASR and TTS is accurate and affordable, helping you improve efficiencies on both sides of the phone line. You’ll never repeat yourself again. We provide you with the utmost flexibility from a capabilities, deployment and monetization perspective. If you can think it, you can build it with LumenVox. Shorten your development to deployment time with our easy, intuitive technology and toolsets.
  • 5
    Otter.ai

    Otter.ai

    Otter.ai

    Otter is where conversations live. Generate rich notes for meetings, interviews, lectures, and other important voice conversations with Otter, your AI-powered assistant. Organizations who have the Otter advantage. Teams big and small trust Otter to transcribe their important conversations. Our shiny new release, Otter 2.0, adds more functionality to improve collaboration and productivity. The Teams plan includes capabilities designed especially for small and medium businesses and teams in larger enterprises. Record and review in real time. Search, play, edit, organize, and share your conversations from any device. Record conversations using Otter on your phone or web browser. Import or sync recordings from other services. Integrate with Zoom. Get real-time streaming transcripts and, within minutes, rich, searchable notes with text, audio, images, speaker ID, and key phrases. Share or export voice notes to inform others and get on the same page.
    Starting Price: $8.33 per month
  • 6
    Work by Speech

    Work by Speech

    Mikołaj Magowski

    Work by Speech is the first program in the world that allows efficient work on a computer by speech without needing a keyboard and mouse. Work by Speech Features: - Efficient work on a computer by speech alone - Quiet speaking support - Application switching and opening by speech - Built-in voice commands for the most common actions - Custom voice commands management - Macro recording and editing - Separate dictation mode - Fast and repeatable mouse control by speech with support for all mouse actions - Customizable mousegrid that can be moved by speech - Automatic mousegrid optimization for every used application - Very low processor and memory usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Free updates
    Starting Price: Free
  • 7
    Dragon Legal

    Dragon Legal

    Nuance Communications

    Dragon Legal is a specialized speech recognition software tailored for legal professionals, offering a legal-specific language model trained on over 400 million words from legal documents. This enables attorneys and legal practitioners to dictate contracts, briefs, and legal citations with up to 99% accuracy, three times faster than typing. The software supports the creation of custom voice commands to automate repetitive tasks and allows for the transcription of pre-recorded audio files, enhancing workflow efficiency. Optimized for Windows 11 and compatible with Windows 10, Dragon Legal v16 also provides accessibility features such as "play that back" audio of dictated text and sophisticated macro commands, accommodating legal professionals with physical or cognitive disabilities. Additionally, it offers integration with Dragon Anywhere Mobile, a cloud-based dictation solution for iOS and Android devices, ensuring productivity on the go.
    Starting Price: $799 one-time payment
  • 8
    Dragon Legal Anywhere

    Dragon Legal Anywhere

    Nuance Communications

    Nuance’s Dragon Legal Anywhere helps attorneys, judges, clerks, paralegals, and other legal professionals create high-quality documentation, in less time, by using the power of their voice. Legal documentation should be dictated by legal practitioners, not technology limitations. Conversational AI empowers legal teams to document more naturally. Dragon Legal Anywhere’s specialized vocabulary means professionals can dictate contracts, briefs, or format legal citations and other legal documentation, 3X faster than typing, with up to 99% accuracy right from the first use. Speak freely and as much as you like with no per-user limits—legal professionals can stay productive anywhere and focus on their clients and business rather than the technology. Create custom voice commands to insert standard clauses into documents. Or create step‑by‑step commands to automate multi‑part workflows by voice.
  • 9
    SpeechText.AI

    SpeechText.AI

    SpeechText.AI

    Transcribe audio and video into text. Get accurate transcriptions of podcasts with domain-specific speech recognition. SpeechText.AI is a powerful artificial intelligence software for speech to text conversion and audio transcription. Upload audio or video files. AI transcription software supports various file formats and transcribes from speech to text in any language. Select domain. Select industry domain and audio type from predefined categories to improve the recognition accuracy of domain-specific words. Transcribe. Our speech transcription engine uses state-of-the-art deep neural network models to convert from audio to text with close to human accuracy. Edit & Export. Search, modify and verify audio transcriptions using interactive editing tools. Export your content in different formats. Why SpeechText.AI? Set of amazing features to help you transcribe audio and video in seconds. Speech recognition. Powerful speech-to-text tech.
    Starting Price: $19 one-time payment
  • 10
    Dragon Professional

    Dragon Professional

    Nuance Communications

    Dragon Professional is a speech recognition software that enables professionals to create high-quality documentation more efficiently by converting speech into text with up to 99% accuracy. Optimized for Windows 11 and compatible with Windows 10, it serves individuals and groups across various industries, including financial services, education, and healthcare. The software allows users to dictate documents three times faster than typing, supports the transcription of pre-recorded audio files, and offers customization options such as creating custom words and commands to streamline repetitive tasks. Additionally, Dragon Professional v16 includes access to Dragon Anywhere Mobile, a cloud-based dictation solution for iOS and Android devices, ensuring productivity on the go.
    Starting Price: $699 one-time payment
  • 11
    Vocola 3

    Vocola 3

    Vocola 3

    Dictation with Windows Speech Recognition (WSR) works well for "WSR-friendly" applications like MS Word, Outlook, and PowerPoint. Dictated text is inserted directly into document text, and commands like "Delete hedgehog" can refer to specific document text. But WSR dictation works less well for "WSR-unfriendly" applications like MS Excel, Gmail, and most programming environments. Dictation is not inserted directly into document text, and commands cannot refer to document text. Vocola improves this situation by supporting direct dictation for WSR-unfriendly applications, and by allowing correction and modification of the just-dictated phrase. Vocola and WSR use the same underlying speech profile, so any improvements you make via training, correction, or the speech dictionary benefit WSR dictation and Vocola dictation equally. Dictation to WSR-unfriendly applications is essentially unusable in Vista, as every utterance raises the correction panel.
  • 12
    LilySpeech

    LilySpeech

    LilySpeech

    LilySpeech is a free speech to text application that lets you type anywhere in windows using your voice instead of typing with your hands. Use it with any application to send emails, do Google searches, Facebook chats, Skype chats. Use it anywhere you would normally type.
  • 13
    SpeechTexter

    SpeechTexter

    SpeechTexter

    SpeechTexter is a free multilingual speech-to-text application aimed at assisting you with transcription of any type of documents, books, reports or blog posts by using your voice. SpeechTexter allows adding custom voice commands for punctuation marks and some actions (undo, redo, make a new paragraph). Accuracy levels higher than 90% should be expected. It varies depending on the language and the speaker. SpeechTexter is used daily by students, teachers, writers, bloggers around the world. Voice-to-text software is exceptionally valuable for people who have difficulty using their hands due to trauma, people with dyslexia or disabilities that limit the use of conventional input devices. It will assist you in minimizing your writing efforts significantly. It can also be used as a tool for learning a proper pronunciation of words in the foreign language, in addition to helping a person develop fluency with their speaking skills. No download, installation or registration is required.
  • 14
    Dictation.io

    Dictation.io

    Dictation.io

    Use the magic of speech recognition to write emails and documents in Google Chrome. Dictation accurately transcribes your speech to text in real time. You can add paragraphs, punctuation marks, and even smileys using voice commands. Dictation can recognize and transcribe popular languages including English, Español, Français, Italiano, Português, and many more. You can add new paragraphs, punctuation marks, smileys and other special characters using simple voice commands. For instance, say "New line" to move the cursor to the next list or say "Smiling Face" to insert :-) smiley. Dictation uses Google Speech Recognition to transcribe your spoken words into text. It stores the converted text in your browser locally and no data is uploaded anywhere. Learn more. Dictation lets you write text in any language by voice alone, without needing a keyboard or mouse.
  • 15
    Transcribe

    Transcribe

    Wreally

    Transcribe saves thousands of hours every month in transcription time for journalists, lawyers, podcasters, students and professional transcriptionists all over the world. Increase your productivity & save mountains of time when converting your interviews, audio notes, lectures, speeches, podcasts and any recorded speech to text. Put on your headphones, load your audio, slow it down and speak out what you hear. It's that simple. Our dictation engine will convert your speech to text on the fly. This is way faster than typing. We support English, Spanish, French, Hindi and almost all other European & Asian languages.
  • 16
    Dragon Speech Recognition

    Dragon Speech Recognition

    Nuance Communications

    Putting words to work with AI‑powered speech recognition. Empower your employees to create high‑quality documentation. Save your organization time and money with Dragon Professional Anywhere, AI‑powered speech recognition that integrates into enterprise workflows. Empower attorneys to create high‑quality documentation and save time and money with Dragon Legal Anywhere, cloud‑hosted speech recognition that integrates directly into legal workflows. Enable officers to safely and efficiently meet reporting and documentation demands with this customized solution. Drive productivity at work and create and transcribe documents, short-cut repetitive steps—by voice. Seamlessly create, edit and transcribe legal documents by voice for improved efficiency, costs. Complete documents wherever work takes you with the cloud‑based, professional‑grade mobile dictation solution.
    Starting Price: $199.99 one-time fee per user
  • 17
    Fusion Speech
    Back-end speech recognition is the most significant technology development in the dictation and transcription industries. Without physician training, or changes in practice patterns, Fusion Speech® powered by Nuance’s SpeechMagic™ harnesses this powerful technology for facility-wide deployment in nearly every medical specialty. Capture dictation with Fusion Voice®, process the dictation through Fusion Speech, and boost transcription productivity in Fusion Text®. The Fusion modules drive cost savings in reoccurring labor and outsourcing fees. This is the speech recognition solution you have envisioned. Other speech recognition has provided cute gimmicks but fell short in offering a sustainable business application. Fusion Speech provides the tools you require to truly deploy speech recognition that returns measurable and tangible results for your investments.
  • 18
    GoVivace

    GoVivace

    GoVivace

    Our automatic speech recognition engine supports several English accents and can be localized to any language. Also, the ASR engine supports standard telephony as well as web and mobile applications. Being capable of actioning voice commands given to electronic devices such as computers, tablets, smartphones or telephones with the aid of a microphone, the GoVivace’s Automatic Speech Recognition Engine finds use in diverse applications. This automatic speech recognition engine compares the spoken input with a number of pre-specified possibilities and convert speech to text. The entire set of pre-specified possibilities constitute the application’s grammar, which powers the interface between the dialogue-speaker and the back-end processing. GoVivace’s patented Automatic Speech Recognition solution needs only very simple grammar for its processing. It can also support very large grammars for complex tasks.
  • 19
    Dragon Law Enforcement

    Dragon Law Enforcement

    Nuance Communications

    Eliminate the need to decipher handwritten notes or try to recall details from hours before. Officers simply speak to create detailed and accurate incident reports, 3 times faster than typing and with up to 99% recognition accuracy—Zall by voice. With a next-generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse work groups and settings. Use fast and accurate dictation to enter data into RMS and CAD systems or other applications. Officers or support staff simply dictate anywhere they would normally type, and fill and navigate within form fields by voice.
  • 20
    SpeechWrite

    SpeechWrite

    SpeechWrite

    SpeechWrite specializes in a range of cloud dictation and voice recognition agile workflow solutions designed to meet the flexible working needs of the modern-day professional. Scalable and future-proofed solutions to suit all types of organizations. Our industry-leading range of digital dictation and transcription solutions link authors and transcribers facilitating efficient communication. Individual and organizational workflow settings enhance flexibility to ensure you receive your written dictations quickly and efficiently when in the office or on the move. Use your most powerful tool, your voice, and put it to work. Our practical technology, sophisticated yet simple, allows you to enhance your working environment and simply work smarter. We listen, learn and collaborate to support you through every stage of the process while also offering professional guidance and support along the way.
  • 21
    Talkatoo

    Talkatoo

    Talkatoo

    Talkatoo is a voice-enabled AI tool designed to integrate effortlessly with your workflow, transforming speech to text using specialized vocabularies. You focus on patient care; we handle the technology. Built to be affordable and tailored for clinics, Talkatoo helps you reclaim valuable time throughout your day. With processing speeds over 200 words per minute—five times faster than typing—and a built-in medical dictionary. Our key features—Auto-SOAP records, Desktop Dictation, and the AI Assistant empower you to streamline tasks with ease. Record entire appointments to generate formatted SOAP notes instantly, dictate into any application from notes to email, and use the AI Assistant to create discharge instructions, translate documents, and more. Simply download, click, and start speaking, no tech expertise needed.
    Starting Price: $117 per month
  • 22
    Utterly Voice

    Utterly Voice

    Utterly Voice

    ​Utterly Voice is a highly customizable voice dictation and computer control application designed for a completely hands-free computing experience. It allows users to type text, edit content, press keyboard shortcuts, manage windows, scroll content, control the mouse, and create macros using only their voice. Compatible with Windows 10 and 11, Utterly Voice supports English language input, with plans for additional language support in the future. The application offers multiple speech recognizers and models to choose from, including Vosk, Microsoft Azure, Deepgram, Google Cloud Speech-to-Text V1, and Whisper. Users can easily type individual letters, alphanumerics, or code, and benefit from powerful customization abilities using text configuration files. Advanced mouse control methods, configurable voice commands, and control over speech recognition bias enhance the user experience.
    Starting Price: Free
  • 23
    Dragon Professional Anywhere

    Dragon Professional Anywhere

    Nuance Communications

    Nuance Dragon Professional Anywhere empowers busy professionals, including remote workers, to use their voice naturally to create more detailed and accurate documentation quickly and easily. Mission critical documentation should be dictated by knowledge workers and field professionals, not technology limitations. Conversational AI empowers private and public sector professionals to document more naturally. Enables professionals to quickly and easily document the details of client meetings using speech recognition that is 3x faster than typing and up to 99% accurate. Most people speak at over 120 wpm but type at less than 40 wpm. Speak freely and as much as you like with no per-user limits. Business professionals can stay productive anywhere and focus on their clients and business rather than the technology.
  • 24
    Willow Voice

    Willow Voice

    Willow Voice

    ​Willow Voice is an AI-powered dictation tool that is fast, accurate and works on any app. Just speak naturally, and Willow formats your text the way you want it without commands. Speak your thoughts and watch them turn into text. Willow fixes mistakes and formats your words automatically. It adapts to your natural style on any platform. Willow remembers the names and words you use. Willow works on every computer-based website or app, with no copy and pasting, and no context switching. Writing emails shouldn’t be exhausting. Willow saves hours each week by making it as easy as talking. Increase accuracy by adding custom dictionaries for your unique words. Built with end-to-end encryption to keep your data secure at all times. Your voice and text remain private and in your control. Dictate in ten other languages with the same accuracy.
  • 25
    Speechy

    Speechy

    Speechy

    Speechy is an easy-to-use real-time dictation application based on the latest artificial intelligence and powerful speech recognition engine. In Speechy you can dictate the speech into text without the need for a keyboard to enter text. It also helps pronunciation practice of foreign language learning and minutes of meeting memo. Speechy not only transcribes your words, but also records your VOICE so you can refer to the original recording later! Plus, you can easily share your text and audio files later! (Works with Evernote, Dropbox, Google Drive, OneDrive, Facebook, Twitter, Snapchat, WhatsApp and other iOS supported sharing apps.) Whether you’re a professional writer, doctor, lawyer, disabled or somehow prevented from traditional typing, Speechy will swiftly solve your transcription problems and help you achieve your writing goals today! And Speechy doesn’t stop there! Speechy is global-focused, and will recognize your native language.
    Starting Price: $5.99 one-time payment
  • 26
    Voice Finger

    Voice Finger

    Voice Finger

    Enables zero computer contact, no need for keyboards and mouses. Rest your hands and use your voice to command the computer. A definitive solution for people with disabilities and/or computer injuries. Some speech recognition software assumes you can type and click for some tasks. Voice Finger was made to do everything by voice. Also for hardcore gamers. For competitive gamers, Voice Finger can hit keys and buttons while the gamer moves and shoots, acting like a third hand. Voice Finger allows complete control of the keyboard, with short commands to navigate the cursor, type, hold and hit keys and buttons. Windows default speech recognition has a lot of lengthy commands like "Press 1", "Press A" and "Press down 30 times". Voice Finger cuts down all commands to a minimum length, like "1", "A" and "Down 30", and you are still able to use the mouse buttons with commands like "click left", "click right" and others, and at the same time hold keys like Control, Shift and Alt.
    Starting Price: $9.99 one-time payment
  • 27
    Dictation Pro

    Dictation Pro

    DeskShare

    Having difficulty in typing your documents? Speak and let Dictation Pro type for you. Prepare your letters, reports, e-mails, or homework assignments just by speaking into a microphone. A good-quality headset is required. Dictation Pro is fast, easy and fun. You'll wonder how you managed without it! Type the documents with minimum keystrokes and mouse clicks. Dictation Pro turns your voice into text and enable hands-free typing of document. Speak into your microphone and words will appear on the computer screen, instantly, 10 times faster than typing. People have different voice modulations. Voice Training process helps Dictation Pro to identify your voice pitch and tone. The more you use Dictation Pro, the more accurate speech recognition will become. You can add special phrases, names or technical terms into the Vocabulary, for even more accurate dictation. Instead of using mouse or keyboard, just speak the command and Dictation Pro executes it for you.
  • 28
    Dragon Medical One
    Dragon Medical One is a speech-driven clinical documentation platform that helps healthcare professionals streamline their workflow and reduce the time spent on administrative tasks. Designed for ease of use, it integrates with Electronic Health Records (EHRs) and uses advanced speech recognition to capture clinical notes with high accuracy—no voice profile training required. Dragon Medical One offers real-time dictation, auto-punctuation, and customizable voice commands, making it easy for clinicians to document patient interactions and navigate systems hands-free. The platform also supports mobile access, enabling clinicians to work efficiently across various care settings, ultimately improving patient care and clinician satisfaction.
  • 29
    Voicepoint Cloud
    The high-availability Voicepoint Cloud with a data centre in Switzerland offers a flexible, cost-effective speech recognition and dictation management solution for everyone who has to prepare a lot of documentation. With this sophisticated, high-performance cloud solution, you use the integrated speech recognition of Dragon Medical Direct, Dragon Legal Anywhere or Dragon Professional Anywhere and dictate directly in the target application where you get the result immediately as text. You also have access to the Winscribe dictation management solution in the Voicepoint Cloud, optimally covering your speech-based documentation processes. Whether you are in your practice, in the clinic, at your office or out, the cloud-based Voicepoint speech recognition and dictation solution supports documentation anywhere and anytime.
  • 30
    Echo Speech-to-Text

    Echo Speech-to-Text

    Echo Speech-to-Text

    Voice typing. Dictate into any website. Real-time voice transcription. Echo - Speech-to-Text is a state-of-the-art voice typing tool that works on most websites. Experience the most accurate speech recognition accuracy available. Key Features: - ✨ Automatic Punctuation: Enjoy automatic punctuation for polished, professional text. - 🗣️ Voice Type Directly into Textbox: No weird overlay or copy-pasting. - 🌍 Multi-language Support: Supports 50+ languages, including English, Spanish, German, French, etc. - 🛠️ Custom Vocabularies: Add specialized vocabulary or uncommon nouns to boost transcription accuracy. - ⌨️ Keyboard Shortcut: Start and pause voice recognition quickly with a simple keyboard shortcut. 🔒 Trusted and Secure Your privacy is our priority – we do not collect or share your data. We do NOT store any dictation text in our database. 🛡️ HIPAA Compliance We are HIPAA compliant in practice. Audio recordings are never stored. Transcription texts are
  • 31
    e-Speaking

    e-Speaking

    e-Speaking

    An easy software solution to enable you to control your computer, dictate emails and letters, and have the computer read documents back to you. Command and control your Window's computer through your voice. Operate your computer using a minimum of keystrokes or mouse clicks. If you want to move the cursor down one line, simply say: Down One. Want to check your emails? Simply say: Open Email. Add commands to open and control any Window's document or program. People have been speaking to each other for tens of thousands of years. Our brains have evolved to perform a fantastic and complex set of analyses of auditory input. Our brains convert the sounds we hear into conceptual ideas and thoughts which in turn form the basis of instructions, commands, information, and entertainment.
    Starting Price: $14 one-time payment
  • 32
    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands.
  • 33
    INVOX Medical
    The most intuitive voice dictation program on the market. Convenient and instant audio-to-text transcription. The program has a clear and simple design, which guarantees a comfortable, fast and precise operation. INVOX Medical has specific dictionaries and is adapted to many medical specialties. INVOX Medical accurately recognizes a wide variety of medical terminology. INVOX Medical is the voice recognition software already trusted by thousands of medical professionals around the world. It's accurate, easy, and incredibly intuitive. In a few minutes you will be dictating your medical reports with complete accuracy. And in addition, it has an unbeatable price. INVOX Medical uses the latest technology in the use of artificial intelligence to help you dictate your medical reports with maximum precision, allowing you to work up to three times faster. The system allows you to add terms to the dictionary, replace words and modify their pronunciation at any time.
    Starting Price: $35 per month
  • 34
    Azure AI Speech
    Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages.
  • 35
    Soniox

    Soniox

    Soniox

    Soniox develops highly accurate foundational speech models that transcribe, translate, and understand speech as it happens, and also provides the developer platform that makes it easy to integrate real-time voice intelligence into any application. Soniox Speech-to-Text API allows you to transcribe speech in 60+ languages in real-time with high accuracy - built for large scale. Soniox also provides regional data residency and is SOC 2 Type 2, GDPR and HIPAA compliant.
    Starting Price: $0.10/hour of audio
  • 36
    Dictation Speech to Text
    You can now add custom words to improve speech recognition! Find the list in setup->manage custom words. Dictation Speech to text allows to dictate, record, translate and transcribe text instead of typing. It uses latest speech to text voice recognition technology and its main purpose is speech to text and translation for text messaging. Never type any text, just dictate and translate using your speech! Nearly every app that can send text messages can be configured to operate with 'Dictation Speech to text'. Dictate uses the builtin speech to text recognition engine. Dictation Speech to text supports more than 40 languages. Dictate offers 3 text zones, indicated by language flags, for which you can configure a different language in the settings. Thus you can switch between different language projects with a singe click. Translation is as easy as pushing the translation button. You can specify the translation target language in the app settings.
    Starting Price: $4.49 one-time payment
  • 37
    Dragon Anywhere

    Dragon Anywhere

    Nuance Communications

    Dragon Anywhere is a professional-grade mobile dictation app that enables users to create, edit, and format documents of any length using voice commands on iOS and Android devices. With up to 99% accuracy, it allows for continuous dictation without word limits, facilitating efficient document creation and editing on the go. The app supports the use of custom vocabularies and auto-texts, which can be synchronized with Dragon desktop products for a seamless workflow across devices. Additionally, Dragon Anywhere offers robust voice formatting and editing capabilities, allowing users to select text, apply formatting, and make corrections using voice commands. Documents can be easily shared via email, Dropbox, Evernote, and other cloud-based services, enhancing productivity for mobile professionals.
    Starting Price: $15 per user per month
  • 38
    Dictation - Voice to Text

    Dictation - Voice to Text

    Christian Neubauer

    ​Dictation - Voice to Text is an application that enables users to dictate, record, and translate text instead of typing, facilitating text generation in a 'dictation' setup with one speaker in front of the microphone. It supports more than 40 languages for dictation and over 40 languages for translation, allowing users to switch between different language projects with a single click. It offers AI-based transcription capabilities, allowing users to transcribe audio recordings, videos, voice memos, URLs, and YouTube content using OpenAI's speech recognition technology. Both audio recordings and text files can be accessed via the Apple 'Files' app and shared along with the text. With iCloud synchronization enabled, text is automatically synchronized across all devices running Dictation, including iPhone, iPad, macOS, and Apple Watch. It also supports the system font size setting and provides configurable button sizes for visually impaired users.
    Starting Price: Free
  • 39
    Vozy

    Vozy

    Vozy

    Vozy transforms the way companies interact with customers through voice assistants and conversational artificial intelligence to boost customer-centric enterprises with an automation that really works. With personalized solutions designed to meet the growing omnichannel customer care demand, Vozy is delivering significant cost savings and unprecedented customer experiences for companies in Latin America. That’s why powerhouses like SURA, Bancolombia, Protección, and Emtelco trust Vozy.
  • 40
    Gladia

    Gladia

    Gladia

    Gladia is a speech-to-text platform built for production, turning raw audio into structured outputs that power real workflows like meeting summaries, CRM enrichment, contact center QA, and real-time voice assistants. With support for 99+ languages and the ability to handle messy real-world audio—overlapping speakers, accents, code-switching, domain-specific terminology—Gladia is designed for the complexity of actual conversations, not clean studio recordings.
    Starting Price: 10 hours free
  • 41
    DeepScribe

    DeepScribe

    DeepScribe

    DeepScribe’s AI-powered scribe captures the natural conversation between a clinician and patient and automatically writes medical documentation, allowing clinicians to focus on patient care instead of note-taking. Through an easy-to-use mobile app, DeepScribe records the natural clinical encounter and transcribes it in real time. Our proprietary AI then extracts the medical information from the transcript, classifies it into a standard note, and then integrates that note directly into a clinician’s electronic health record system. Unlike traditional scribes, dictation tools, or other solutions, the ambient nature of DeepScribe means it doesn’t intrude on the patient visit or disrupt the clinical workflow. Providers can simply talk to their patient like normal, then review their notes after the visit and sign-off in their EHR. DeepScribe handles documentation, charting, and even populates suggested diagnostic coding based on the information extracted from the visit.
  • 42
    Verbio

    Verbio

    Verbio

    Increase security and user experience in daily interactions with the unique potential of voice. An innovative language agnostic, cost-effective and reliable alternative to seamlessly verify and identify users in real-time. Voice biometrics allows to automatically recognize any person through the characteristics of their voice and it can smartly substitute traditional authentication methods (cards, passwords, signature, fingerprint, etc) in security access control, user verification for digital transactions or for fraud prevention and detection. With an easy and cost-effective solution, authentication through voice biometrics brings an innovative and safe experience to users, with a risk-free and remote access. Biometric Authentication and Identification through voice has never been so secure and fast with different operational uttering models for each type of client and advanced anti-spoofing methodologies.
  • 43
    Speechlogger

    Speechlogger

    Speechlogger

    Generate .srt files, using Speechlogger’s automatica transcription for your own speech, movies, or other audio files. Then you may take the file and automatically translate it into any language to produce international subtitles. For best results, it is best to listen to the movie and dictate it yourself in real-time. Meeting with foreign guests? Bring a laptop (or two) with speechlogger and a microphone. Each party will see the other’s spoken words translated into their own language in real time. It is also useful on a phone call in a foreign language, to make sure you fully understand the other side. Connect your phone’s audio output to your computer’s line-in and start Speechlogger. Both for face to face interactions, and as a caption-phone, Speechlogger can assist the hard of hearing by showing them on the big screen whatever is being said. It is completely automatic, with no human-typist hearing your conversations.
  • 44
    Augnito

    Augnito

    Augnito

    Augnito combines the power of Speech Recognition AI with ease of mobility. You can edit, format, and complete reports at the speed of human speech, with best-in-class accuracy. Now use your personal templates and short forms from any workstation whether you are in the office, or at home or in the journey in between. Best suited for clinical specialties producing detailed reports such as Radiology, Histopathology and Surgical Notes, you can now dictate your reports from anywhere in the world. Augnito understands diverse accents and pronunciations out-of-the-box with no profile training. Built with the latest deep learning technology, it has the entire language of medicine which covers 50+ specialties and sub-specialties combined with all popular generic and drug names.
  • 45
    Maestra

    Maestra

    Maestra.ai

    Automatic Transcripts, Subtitles and Voiceovers. In just minutes. Highly accurate speech to text software with a built in advanced text editor. Translate in English, French, Spanish, German and 80+ languages. Save time and money with Maestra’s automatic audio to text transcription software. Transcribe audio files to text automatically within seconds. No credit card required for the first 15 minutes. Creating subtitles for video with online automatic subtitling software can save you a considerable amount of time. You'll be able to auto generate subtitles for videos in just a few minutes. You can also translate your subtitles automatically to 80+ languages. With Maestra video dubber you can automatically voiceover your videos aloud to foreign languages using artificial intelligence and computer generated voices.
    Starting Price: $6/hour
  • 46
    TrulyNatural
    Sensory is a pioneer in the use of embedded neural network-based speech recognition and has become the industry leader in optimizing and engineering speech recognition software with small footprints and minimal MIPS. This extensive experience and continuous innovation have led to the first embedded large vocabulary continuous-speech recognizer (LVCSR) with state of-the-art cloud performance. Unlike voice recognition software often used with smartphones and mobile devices, such as with a voice assistant mobile app, as well as with IoT (internet of things) enabled technologies (Alexa, Google Assistant, Siri, Cortana), Sensory’s solution is embedded and doesn’t require a wifi connection. Many applications don’t need or want to rely on cloud-based connection to do high-performance speech recognition. Others seek a client/cloud distributed system with optimal performance. The market concerns regarding privacy, performance and bandwidth are driving more processing to the edge.
  • 47
    SpeechMotion
    Document a patient encounter with full or partial dictation, voice recognition, or on-the-go with a customized solution tailored to your unique environment. Solving common documentation issues, like lowering costs and integrating workflows, begins with choosing a solution designed to meet your evolving needs. Improve workflow efficiencies and physician adoption for a rapid return on investment with a partner committed to your long-term success. A leading, national provider of US-based transcription, speech recognition, voice capture and advanced documentation technologies, SpeechMotion partners with healthcare facilities and the organizations supporting them to create a customized documentation solution tailored to support both long and short-term goals. SpeechMotion provides the flexible options healthcare facilities need to quickly and efficiently document a complete patient story, all under one product and service umbrella.
  • 48
    Gboard

    Gboard

    Google

    Gboard has everything you love about Google Keyboard—speed and reliability, Glide Typing, voice typing, Handwriting, and more. Type faster by sliding your finger from letter to letter. Easily dictate text on the go. Write in cursive and printed letters. Search and share GIFs for the perfect reaction. No more switching between languages manually. Gboard will autocorrect and suggest from any of your enabled languages. Translate as you type in the keyboard.
  • 49
    Rubidium

    Rubidium

    Rubidium

    Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others.
  • 50
    Just Press Record

    Just Press Record

    Just Press Record

    Just Press Record is the award-winning mobile audio recorder that brings one-tap recording, transcription and iCloud syncing to all your devices. Turn your voice recordings into text which you can tweak right inside the app and fine-tune your audio by cutting out the parts you don’t need. Life is full of moments we would rather not forget, like your child’s first words, an important meeting or a great idea. Capture and sync these moments effortlessly on Mac, iPad, iPhone and, for ultimate convenience, Apple Watch! A record button everywhere, ready to go when you need it. Unlimited recording time, background recording and pause / resume make it the perfect recorder. Make professional quality recordings up to 96kHz / 24-bit with external microphones connected via the Lightning Port, in M4A, WAV or AIF files. Turn speech into editable, searchable text with support for over 30 languages, independent of your device’s language setting! You can even add punctuation!