Alternatives to LilySpeech

Compare LilySpeech alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to LilySpeech in 2026. Compare features, ratings, user reviews, pricing, and more from LilySpeech competitors and alternatives in order to make an informed decision for your business.

  • 1
    Google Cloud Speech-to-Text
    Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech into text in 73 languages and 137 different local variants. Leverage Google’s most advanced deep learning neural network algorithms for automatic speech recognition (ASR) and deploy ASR wherever you need it, whether in the cloud with the API, on-premises with Speech-to-Text On-Prem, or locally on any device with Speech On-Device.
    Leader badge
    Compare vs. LilySpeech View Software
    Visit Website
  • 2
    Speechmatics

    Speechmatics

    Speechmatics

    Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcription
    Starting Price: $0 per month
  • 3
    LumenVox

    LumenVox

    LumenVox

    Transforming customer engagement with AI-driven speech recognition and voice authentication technology. We’ve spent the last 20 years empowering our partners’ success through collaboration. Our curiosity keeps us innovating for the next 20. Our flexible speech-enabling technology enables you to build a solution that fulfills all your customers’ demands, affordably and reliably. We do one thing, and we do it well. And that's speech-enabling your applications. Finally, deliver great voice automation and interactions. Whether short and simple commands, or conversational questions, LumenVox ASR and TTS is accurate and affordable, helping you improve efficiencies on both sides of the phone line. You’ll never repeat yourself again. We provide you with the utmost flexibility from a capabilities, deployment and monetization perspective. If you can think it, you can build it with LumenVox. Shorten your development to deployment time with our easy, intuitive technology and toolsets.
  • 4
    Otter.ai

    Otter.ai

    Otter.ai

    Otter is where conversations live. Generate rich notes for meetings, interviews, lectures, and other important voice conversations with Otter, your AI-powered assistant. Organizations who have the Otter advantage. Teams big and small trust Otter to transcribe their important conversations. Our shiny new release, Otter 2.0, adds more functionality to improve collaboration and productivity. The Teams plan includes capabilities designed especially for small and medium businesses and teams in larger enterprises. Record and review in real time. Search, play, edit, organize, and share your conversations from any device. Record conversations using Otter on your phone or web browser. Import or sync recordings from other services. Integrate with Zoom. Get real-time streaming transcripts and, within minutes, rich, searchable notes with text, audio, images, speaker ID, and key phrases. Share or export voice notes to inform others and get on the same page.
    Starting Price: $8.33 per month
  • 5
    SpeechPulse
    SpeechPulse uses your computer’s microphone for real-time speech recognition. It can type into your favorite apps, including text editors, web browsers, and office applications. SpeechPulse works fully offline and doesn’t require any internet connectivity. It supports speech recognition in multiple languages, including English, French, Spanish, Italian, German, Japanese, Chinese, and Russian (a total of 100 languages). SpeechPulse supports both auto punctuation and manual punctuation for the English language. It supports auto punctuation for all other languages. SpeechPulse can also generate subtitles for your audio and video files with accurate timestamps. It supports SRT and VTT subtitle formats. You can also customize the width of a subtitle line to include only a limited number of characters. SpeechPulse has a one-time payment. You can pay for the product once and use it forever.
    Starting Price: $59.95/one-time payment
  • 6
    Dragon Law Enforcement

    Dragon Law Enforcement

    Nuance Communications

    Eliminate the need to decipher handwritten notes or try to recall details from hours before. Officers simply speak to create detailed and accurate incident reports, 3 times faster than typing and with up to 99% recognition accuracy—Zall by voice. With a next-generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse work groups and settings. Use fast and accurate dictation to enter data into RMS and CAD systems or other applications. Officers or support staff simply dictate anywhere they would normally type, and fill and navigate within form fields by voice.
  • 7
    Dragon Professional

    Dragon Professional

    Nuance Communications

    Dragon Professional is a speech recognition software that enables professionals to create high-quality documentation more efficiently by converting speech into text with up to 99% accuracy. Optimized for Windows 11 and compatible with Windows 10, it serves individuals and groups across various industries, including financial services, education, and healthcare. The software allows users to dictate documents three times faster than typing, supports the transcription of pre-recorded audio files, and offers customization options such as creating custom words and commands to streamline repetitive tasks. Additionally, Dragon Professional v16 includes access to Dragon Anywhere Mobile, a cloud-based dictation solution for iOS and Android devices, ensuring productivity on the go.
    Starting Price: $699 one-time payment
  • 8
    SpeechText.AI

    SpeechText.AI

    SpeechText.AI

    Transcribe audio and video into text. Get accurate transcriptions of podcasts with domain-specific speech recognition. SpeechText.AI is a powerful artificial intelligence software for speech to text conversion and audio transcription. Upload audio or video files. AI transcription software supports various file formats and transcribes from speech to text in any language. Select domain. Select industry domain and audio type from predefined categories to improve the recognition accuracy of domain-specific words. Transcribe. Our speech transcription engine uses state-of-the-art deep neural network models to convert from audio to text with close to human accuracy. Edit & Export. Search, modify and verify audio transcriptions using interactive editing tools. Export your content in different formats. Why SpeechText.AI? Set of amazing features to help you transcribe audio and video in seconds. Speech recognition. Powerful speech-to-text tech.
    Starting Price: $19 one-time payment
  • 9
    Voicepoint Cloud
    The high-availability Voicepoint Cloud with a data centre in Switzerland offers a flexible, cost-effective speech recognition and dictation management solution for everyone who has to prepare a lot of documentation. With this sophisticated, high-performance cloud solution, you use the integrated speech recognition of Dragon Medical Direct, Dragon Legal Anywhere or Dragon Professional Anywhere and dictate directly in the target application where you get the result immediately as text. You also have access to the Winscribe dictation management solution in the Voicepoint Cloud, optimally covering your speech-based documentation processes. Whether you are in your practice, in the clinic, at your office or out, the cloud-based Voicepoint speech recognition and dictation solution supports documentation anywhere and anytime.
  • 10
    Dragon Legal

    Dragon Legal

    Nuance Communications

    Dragon Legal is a specialized speech recognition software tailored for legal professionals, offering a legal-specific language model trained on over 400 million words from legal documents. This enables attorneys and legal practitioners to dictate contracts, briefs, and legal citations with up to 99% accuracy, three times faster than typing. The software supports the creation of custom voice commands to automate repetitive tasks and allows for the transcription of pre-recorded audio files, enhancing workflow efficiency. Optimized for Windows 11 and compatible with Windows 10, Dragon Legal v16 also provides accessibility features such as "play that back" audio of dictated text and sophisticated macro commands, accommodating legal professionals with physical or cognitive disabilities. Additionally, it offers integration with Dragon Anywhere Mobile, a cloud-based dictation solution for iOS and Android devices, ensuring productivity on the go.
    Starting Price: $799 one-time payment
  • 11
    Dragon Professional Anywhere

    Dragon Professional Anywhere

    Nuance Communications

    Nuance Dragon Professional Anywhere empowers busy professionals, including remote workers, to use their voice naturally to create more detailed and accurate documentation quickly and easily. Mission critical documentation should be dictated by knowledge workers and field professionals, not technology limitations. Conversational AI empowers private and public sector professionals to document more naturally. Enables professionals to quickly and easily document the details of client meetings using speech recognition that is 3x faster than typing and up to 99% accurate. Most people speak at over 120 wpm but type at less than 40 wpm. Speak freely and as much as you like with no per-user limits. Business professionals can stay productive anywhere and focus on their clients and business rather than the technology.
  • 12
    Dragon Speech Recognition

    Dragon Speech Recognition

    Nuance Communications

    Putting words to work with AI‑powered speech recognition. Empower your employees to create high‑quality documentation. Save your organization time and money with Dragon Professional Anywhere, AI‑powered speech recognition that integrates into enterprise workflows. Empower attorneys to create high‑quality documentation and save time and money with Dragon Legal Anywhere, cloud‑hosted speech recognition that integrates directly into legal workflows. Enable officers to safely and efficiently meet reporting and documentation demands with this customized solution. Drive productivity at work and create and transcribe documents, short-cut repetitive steps—by voice. Seamlessly create, edit and transcribe legal documents by voice for improved efficiency, costs. Complete documents wherever work takes you with the cloud‑based, professional‑grade mobile dictation solution.
    Starting Price: $199.99 one-time fee per user
  • 13
    Fusion Speech
    Back-end speech recognition is the most significant technology development in the dictation and transcription industries. Without physician training, or changes in practice patterns, Fusion Speech® powered by Nuance’s SpeechMagic™ harnesses this powerful technology for facility-wide deployment in nearly every medical specialty. Capture dictation with Fusion Voice®, process the dictation through Fusion Speech, and boost transcription productivity in Fusion Text®. The Fusion modules drive cost savings in reoccurring labor and outsourcing fees. This is the speech recognition solution you have envisioned. Other speech recognition has provided cute gimmicks but fell short in offering a sustainable business application. Fusion Speech provides the tools you require to truly deploy speech recognition that returns measurable and tangible results for your investments.
  • 14
    Transcribe

    Transcribe

    Wreally

    Transcribe saves thousands of hours every month in transcription time for journalists, lawyers, podcasters, students and professional transcriptionists all over the world. Increase your productivity & save mountains of time when converting your interviews, audio notes, lectures, speeches, podcasts and any recorded speech to text. Put on your headphones, load your audio, slow it down and speak out what you hear. It's that simple. Our dictation engine will convert your speech to text on the fly. This is way faster than typing. We support English, Spanish, French, Hindi and almost all other European & Asian languages.
  • 15
    Braina

    Braina

    Brainasoft

    Braina (Brain Artificial) is an intelligent personal assistant, human language interface, automation and voice recognition software for Windows PC. Braina is a multi-functional AI software that allows you to interact with your computer using voice commands in most of the languages of the world. Braina also allows you to accurately convert speech to text in over 100 different languages of the world. Braina's artificial intelligence makes it possible for you to control your computer using natural language commands and makes your life easier. Braina is not a Siri or Cortana clone for PC but rather a powerful personal and office productivity software. It isn't just like a chat-bot; its priority is to be super functional and to help you in doing tasks. Braina helps you do things you do everyday. It is a multi-functional artificial intelligence software that provides a single window environment to control your computer and perform wide range of tasks using voice commands.
    Starting Price: $29 per year
  • 16
    Vocola 3

    Vocola 3

    Vocola 3

    Dictation with Windows Speech Recognition (WSR) works well for "WSR-friendly" applications like MS Word, Outlook, and PowerPoint. Dictated text is inserted directly into document text, and commands like "Delete hedgehog" can refer to specific document text. But WSR dictation works less well for "WSR-unfriendly" applications like MS Excel, Gmail, and most programming environments. Dictation is not inserted directly into document text, and commands cannot refer to document text. Vocola improves this situation by supporting direct dictation for WSR-unfriendly applications, and by allowing correction and modification of the just-dictated phrase. Vocola and WSR use the same underlying speech profile, so any improvements you make via training, correction, or the speech dictionary benefit WSR dictation and Vocola dictation equally. Dictation to WSR-unfriendly applications is essentially unusable in Vista, as every utterance raises the correction panel.
  • 17
    Talkatoo

    Talkatoo

    Talkatoo

    Talkatoo is a voice-enabled AI tool designed to integrate effortlessly with your workflow, transforming speech to text using specialized vocabularies. You focus on patient care; we handle the technology. Built to be affordable and tailored for clinics, Talkatoo helps you reclaim valuable time throughout your day. With processing speeds over 200 words per minute—five times faster than typing—and a built-in medical dictionary. Our key features—Auto-SOAP records, Desktop Dictation, and the AI Assistant empower you to streamline tasks with ease. Record entire appointments to generate formatted SOAP notes instantly, dictate into any application from notes to email, and use the AI Assistant to create discharge instructions, translate documents, and more. Simply download, click, and start speaking, no tech expertise needed.
    Starting Price: $117 per month
  • 18
    SpeechWrite

    SpeechWrite

    SpeechWrite

    SpeechWrite specializes in a range of cloud dictation and voice recognition agile workflow solutions designed to meet the flexible working needs of the modern-day professional. Scalable and future-proofed solutions to suit all types of organizations. Our industry-leading range of digital dictation and transcription solutions link authors and transcribers facilitating efficient communication. Individual and organizational workflow settings enhance flexibility to ensure you receive your written dictations quickly and efficiently when in the office or on the move. Use your most powerful tool, your voice, and put it to work. Our practical technology, sophisticated yet simple, allows you to enhance your working environment and simply work smarter. We listen, learn and collaborate to support you through every stage of the process while also offering professional guidance and support along the way.
  • 19
    Dragon Legal Anywhere

    Dragon Legal Anywhere

    Nuance Communications

    Nuance’s Dragon Legal Anywhere helps attorneys, judges, clerks, paralegals, and other legal professionals create high-quality documentation, in less time, by using the power of their voice. Legal documentation should be dictated by legal practitioners, not technology limitations. Conversational AI empowers legal teams to document more naturally. Dragon Legal Anywhere’s specialized vocabulary means professionals can dictate contracts, briefs, or format legal citations and other legal documentation, 3X faster than typing, with up to 99% accuracy right from the first use. Speak freely and as much as you like with no per-user limits—legal professionals can stay productive anywhere and focus on their clients and business rather than the technology. Create custom voice commands to insert standard clauses into documents. Or create step‑by‑step commands to automate multi‑part workflows by voice.
  • 20
    iSpeech Dictation
    Speak any message and iSpeech Dictation™ will put it into text format. Dictate using BlackBerry Messenger (BBM), text (SMS), email, or voice notes into text and send. The app's human-quality speech recognition is brought to you by iSpeech®, the creator of DriveSafe.ly®, award-winning leader in texting while driving applications. Speak any phrase or message and iSpeech Dictation™ will translate it into text. Talk and type.
  • 21
    Dictation Speech to Text
    You can now add custom words to improve speech recognition! Find the list in setup->manage custom words. Dictation Speech to text allows to dictate, record, translate and transcribe text instead of typing. It uses latest speech to text voice recognition technology and its main purpose is speech to text and translation for text messaging. Never type any text, just dictate and translate using your speech! Nearly every app that can send text messages can be configured to operate with 'Dictation Speech to text'. Dictate uses the builtin speech to text recognition engine. Dictation Speech to text supports more than 40 languages. Dictate offers 3 text zones, indicated by language flags, for which you can configure a different language in the settings. Thus you can switch between different language projects with a singe click. Translation is as easy as pushing the translation button. You can specify the translation target language in the app settings.
    Starting Price: $4.49 one-time payment
  • 22
    SpeechTexter

    SpeechTexter

    SpeechTexter

    SpeechTexter is a free multilingual speech-to-text application aimed at assisting you with transcription of any type of documents, books, reports or blog posts by using your voice. SpeechTexter allows adding custom voice commands for punctuation marks and some actions (undo, redo, make a new paragraph). Accuracy levels higher than 90% should be expected. It varies depending on the language and the speaker. SpeechTexter is used daily by students, teachers, writers, bloggers around the world. Voice-to-text software is exceptionally valuable for people who have difficulty using their hands due to trauma, people with dyslexia or disabilities that limit the use of conventional input devices. It will assist you in minimizing your writing efforts significantly. It can also be used as a tool for learning a proper pronunciation of words in the foreign language, in addition to helping a person develop fluency with their speaking skills. No download, installation or registration is required.
  • 23
    DeepScribe

    DeepScribe

    DeepScribe

    DeepScribe’s AI-powered scribe captures the natural conversation between a clinician and patient and automatically writes medical documentation, allowing clinicians to focus on patient care instead of note-taking. Through an easy-to-use mobile app, DeepScribe records the natural clinical encounter and transcribes it in real time. Our proprietary AI then extracts the medical information from the transcript, classifies it into a standard note, and then integrates that note directly into a clinician’s electronic health record system. Unlike traditional scribes, dictation tools, or other solutions, the ambient nature of DeepScribe means it doesn’t intrude on the patient visit or disrupt the clinical workflow. Providers can simply talk to their patient like normal, then review their notes after the visit and sign-off in their EHR. DeepScribe handles documentation, charting, and even populates suggested diagnostic coding based on the information extracted from the visit.
  • 24
    Voice Texting Pro

    Voice Texting Pro

    Sparkling Apps

    Sending messages or dictating has never been easier! Just speak into the microphone and convert your speech into text. Directly send your message to e-mail, sms, Twitter or Facebook. All features are easily available from a single screen. just speak into the microphone and convert your speech into text. Then directly send your message to e-mail, sms, Twitter or Facebook. You can also send it to your clipboard (copy) and use paste to use the dictated text in any other application. Voice Texting Pro uses superior speech recognition. There are no settings required, Just say the words! Voice Texting Pro doesn't need to learn your voice, no training is required. It works straight out of the box. All features are easily available from a single screen. Sparkling Apps is a young enterprise that has jumped on the possibilities in the current market and technologies. The mobile technology and social media domains offer unique opportunities.
  • 25
    Dictation Pro

    Dictation Pro

    DeskShare

    Having difficulty in typing your documents? Speak and let Dictation Pro type for you. Prepare your letters, reports, e-mails, or homework assignments just by speaking into a microphone. A good-quality headset is required. Dictation Pro is fast, easy and fun. You'll wonder how you managed without it! Type the documents with minimum keystrokes and mouse clicks. Dictation Pro turns your voice into text and enable hands-free typing of document. Speak into your microphone and words will appear on the computer screen, instantly, 10 times faster than typing. People have different voice modulations. Voice Training process helps Dictation Pro to identify your voice pitch and tone. The more you use Dictation Pro, the more accurate speech recognition will become. You can add special phrases, names or technical terms into the Vocabulary, for even more accurate dictation. Instead of using mouse or keyboard, just speak the command and Dictation Pro executes it for you.
  • 26
    Dictation.io

    Dictation.io

    Dictation.io

    Use the magic of speech recognition to write emails and documents in Google Chrome. Dictation accurately transcribes your speech to text in real time. You can add paragraphs, punctuation marks, and even smileys using voice commands. Dictation can recognize and transcribe popular languages including English, Español, Français, Italiano, Português, and many more. You can add new paragraphs, punctuation marks, smileys and other special characters using simple voice commands. For instance, say "New line" to move the cursor to the next list or say "Smiling Face" to insert :-) smiley. Dictation uses Google Speech Recognition to transcribe your spoken words into text. It stores the converted text in your browser locally and no data is uploaded anywhere. Learn more. Dictation lets you write text in any language by voice alone, without needing a keyboard or mouse.
  • 27
    Speechy

    Speechy

    Speechy

    Speechy is an easy-to-use real-time dictation application based on the latest artificial intelligence and powerful speech recognition engine. In Speechy you can dictate the speech into text without the need for a keyboard to enter text. It also helps pronunciation practice of foreign language learning and minutes of meeting memo. Speechy not only transcribes your words, but also records your VOICE so you can refer to the original recording later! Plus, you can easily share your text and audio files later! (Works with Evernote, Dropbox, Google Drive, OneDrive, Facebook, Twitter, Snapchat, WhatsApp and other iOS supported sharing apps.) Whether you’re a professional writer, doctor, lawyer, disabled or somehow prevented from traditional typing, Speechy will swiftly solve your transcription problems and help you achieve your writing goals today! And Speechy doesn’t stop there! Speechy is global-focused, and will recognize your native language.
    Starting Price: $5.99 one-time payment
  • 28
    Echo Speech-to-Text

    Echo Speech-to-Text

    Echo Speech-to-Text

    Voice typing. Dictate into any website. Real-time voice transcription. Echo - Speech-to-Text is a state-of-the-art voice typing tool that works on most websites. Experience the most accurate speech recognition accuracy available. Key Features: - ✨ Automatic Punctuation: Enjoy automatic punctuation for polished, professional text. - 🗣️ Voice Type Directly into Textbox: No weird overlay or copy-pasting. - 🌍 Multi-language Support: Supports 50+ languages, including English, Spanish, German, French, etc. - 🛠️ Custom Vocabularies: Add specialized vocabulary or uncommon nouns to boost transcription accuracy. - ⌨️ Keyboard Shortcut: Start and pause voice recognition quickly with a simple keyboard shortcut. 🔒 Trusted and Secure Your privacy is our priority – we do not collect or share your data. We do NOT store any dictation text in our database. 🛡️ HIPAA Compliance We are HIPAA compliant in practice. Audio recordings are never stored. Transcription texts are
  • 29
    Soniox

    Soniox

    Soniox

    Soniox develops highly accurate foundational speech models that transcribe, translate, and understand speech as it happens, and also provides the developer platform that makes it easy to integrate real-time voice intelligence into any application. Soniox Speech-to-Text API allows you to transcribe speech in 60+ languages in real-time with high accuracy - built for large scale. Soniox also provides regional data residency and is SOC 2 Type 2, GDPR and HIPAA compliant.
    Starting Price: $0.10/hour of audio
  • 30
    Azure AI Speech
    Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages.
  • 31
    Dictation - Voice to Text

    Dictation - Voice to Text

    Christian Neubauer

    ​Dictation - Voice to Text is an application that enables users to dictate, record, and translate text instead of typing, facilitating text generation in a 'dictation' setup with one speaker in front of the microphone. It supports more than 40 languages for dictation and over 40 languages for translation, allowing users to switch between different language projects with a single click. It offers AI-based transcription capabilities, allowing users to transcribe audio recordings, videos, voice memos, URLs, and YouTube content using OpenAI's speech recognition technology. Both audio recordings and text files can be accessed via the Apple 'Files' app and shared along with the text. With iCloud synchronization enabled, text is automatically synchronized across all devices running Dictation, including iPhone, iPad, macOS, and Apple Watch. It also supports the system font size setting and provides configurable button sizes for visually impaired users.
  • 32
    SpeechMotion
    Document a patient encounter with full or partial dictation, voice recognition, or on-the-go with a customized solution tailored to your unique environment. Solving common documentation issues, like lowering costs and integrating workflows, begins with choosing a solution designed to meet your evolving needs. Improve workflow efficiencies and physician adoption for a rapid return on investment with a partner committed to your long-term success. A leading, national provider of US-based transcription, speech recognition, voice capture and advanced documentation technologies, SpeechMotion partners with healthcare facilities and the organizations supporting them to create a customized documentation solution tailored to support both long and short-term goals. SpeechMotion provides the flexible options healthcare facilities need to quickly and efficiently document a complete patient story, all under one product and service umbrella.
  • 33
    Dragon Medical One
    Dragon Medical One is a speech-driven clinical documentation platform that helps healthcare professionals streamline their workflow and reduce the time spent on administrative tasks. Designed for ease of use, it integrates with Electronic Health Records (EHRs) and uses advanced speech recognition to capture clinical notes with high accuracy—no voice profile training required. Dragon Medical One offers real-time dictation, auto-punctuation, and customizable voice commands, making it easy for clinicians to document patient interactions and navigate systems hands-free. The platform also supports mobile access, enabling clinicians to work efficiently across various care settings, ultimately improving patient care and clinician satisfaction.
  • 34
    Augnito

    Augnito

    Augnito

    Augnito combines the power of Speech Recognition AI with ease of mobility. You can edit, format, and complete reports at the speed of human speech, with best-in-class accuracy. Now use your personal templates and short forms from any workstation whether you are in the office, or at home or in the journey in between. Best suited for clinical specialties producing detailed reports such as Radiology, Histopathology and Surgical Notes, you can now dictate your reports from anywhere in the world. Augnito understands diverse accents and pronunciations out-of-the-box with no profile training. Built with the latest deep learning technology, it has the entire language of medicine which covers 50+ specialties and sub-specialties combined with all popular generic and drug names.
  • 35
    Picovoice

    Picovoice

    Picovoice

    Picovoice is the first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, Speech-to-Intent (intent detection) and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.
  • 36
    Flow

    Flow

    Flow

    Use your voice to type 3x faster than your keyboard, anytime, anywhere. Designed for effortless dictation. Turn rambling thoughts into clear concise messages. Improve the clarity and structure of your writing. Become productive across all your writing needs. Use voice to get through your email in half the time. Send quick responses effortlessly with your voice. Speak detailed prompts for smarter AI outputs. Break through writer’s block and write with intention. Experience the future of voice-first writing today. Let your voice do the typing everywhere.
  • 37
    GoVivace

    GoVivace

    GoVivace

    Our automatic speech recognition engine supports several English accents and can be localized to any language. Also, the ASR engine supports standard telephony as well as web and mobile applications. Being capable of actioning voice commands given to electronic devices such as computers, tablets, smartphones or telephones with the aid of a microphone, the GoVivace’s Automatic Speech Recognition Engine finds use in diverse applications. This automatic speech recognition engine compares the spoken input with a number of pre-specified possibilities and convert speech to text. The entire set of pre-specified possibilities constitute the application’s grammar, which powers the interface between the dialogue-speaker and the back-end processing. GoVivace’s patented Automatic Speech Recognition solution needs only very simple grammar for its processing. It can also support very large grammars for complex tasks.
  • 38
    Voicy

    Voicy

    Voicy Speech-to-Text

    Voicy - Write with your voice, everywhere. 
 
A free speech-to-text Chrome extension that lets you write with your voice on every text field on the internet. 
Voicy is powered by AI for enhanced accuracy and automatic punctuation and grammar fixes. Once installed, a microphone element will appear next whenever you click on a text field on the internet. That microphone element allows you to dictate your text directly into the text field.
    Starting Price: $6.99/month
  • 39
    Utterly Voice

    Utterly Voice

    Utterly Voice

    ​Utterly Voice is a highly customizable voice dictation and computer control application designed for a completely hands-free computing experience. It allows users to type text, edit content, press keyboard shortcuts, manage windows, scroll content, control the mouse, and create macros using only their voice. Compatible with Windows 10 and 11, Utterly Voice supports English language input, with plans for additional language support in the future. The application offers multiple speech recognizers and models to choose from, including Vosk, Microsoft Azure, Deepgram, Google Cloud Speech-to-Text V1, and Whisper. Users can easily type individual letters, alphanumerics, or code, and benefit from powerful customization abilities using text configuration files. Advanced mouse control methods, configurable voice commands, and control over speech recognition bias enhance the user experience.
  • 40
    Express Scribe

    Express Scribe

    NCH Software

    Express Scribe is a free audio player specifically designed for typists and transcription work. Featuring foot pedal control, variable speed, speech to text engine integration and support for a wide variety of audio formats including dss, dct, wav, mp3, wma and more. Audio recordings can be loaded automatically from email, LAN, FTP, local hard drive and Express Delegate. Traditional hand held dictation recorders can also be docked.
    Starting Price: $39.95/one-time/user
  • 41
    Gboard

    Gboard

    Google

    Gboard has everything you love about Google Keyboard—speed and reliability, Glide Typing, voice typing, Handwriting, and more. Type faster by sliding your finger from letter to letter. Easily dictate text on the go. Write in cursive and printed letters. Search and share GIFs for the perfect reaction. No more switching between languages manually. Gboard will autocorrect and suggest from any of your enabled languages. Translate as you type in the keyboard.
  • 42
    SpokenData

    SpokenData

    ReplayWell

    Let the automatic speech-to-text technology transcribe your data. Or transcribe your data yourself or buy professional transcript. Use our on-line time synchonous editor to surf your data and transcripts. Download transcripts in many formats. Manage your team of transcribers using tags and categories. Help them with transcription by automatic voice-to-text technology. Integrate SpokenData into your application via our REST API. We adapt the voice-to-text on your data domain to maximize the transcript accuracy and lower your labor costs. Enable speech technologies in your applications through integrating SpokenData using our REST API. We are ready to process huge amounts of your data. You get API fitting your needs. Just contact our support team. We customize the voice-to-text on your data and purpose to maximize the transcript accuracy. Suitable for: web/mobile app developers, media monitoring agencies, audio/video archive business.
  • 43
    Just Press Record

    Just Press Record

    Just Press Record

    Just Press Record is the award-winning mobile audio recorder that brings one-tap recording, transcription and iCloud syncing to all your devices. Turn your voice recordings into text which you can tweak right inside the app and fine-tune your audio by cutting out the parts you don’t need. Life is full of moments we would rather not forget, like your child’s first words, an important meeting or a great idea. Capture and sync these moments effortlessly on Mac, iPad, iPhone and, for ultimate convenience, Apple Watch! A record button everywhere, ready to go when you need it. Unlimited recording time, background recording and pause / resume make it the perfect recorder. Make professional quality recordings up to 96kHz / 24-bit with external microphones connected via the Lightning Port, in M4A, WAV or AIF files. Turn speech into editable, searchable text with support for over 30 languages, independent of your device’s language setting! You can even add punctuation!
  • 44
    MacWhisper

    MacWhisper

    Gumroad

    ​MacWhisper enables users to quickly and easily transcribe audio files into text using OpenAI's Whisper technology. Users can record directly from their microphone or any input device on their Mac, or drag and drop audio files for high-quality transcription. It supports recording meetings from platforms like Zoom, Teams, Webex, Skype, Chime, and Discord, with all transcription processing done locally to ensure data privacy. Transcripts can be saved or exported in various formats, including .srt, .vtt, .csv, .docx, .pdf, markdown, and HTML. MacWhisper offers fast transcription speeds, supports over 100 languages, and provides features like search, audio playback synced to transcripts, filler word removal, and speaker addition. The Pro version includes additional functionalities such as batch transcription, YouTube video transcription, AI service integrations (e.g., OpenAI's ChatGPT, Anthropic's Claude), system-wide dictation, and translation of audio files into other languages.
    Starting Price: €59 one-time payment
  • 45
    Aiko

    Aiko

    Aiko

    High-quality on-device transcription. Easily convert speech to text from meetings, lectures, and more. The transcription is powered by OpenAI's Whisper running locally on your device. The audio never leaves your device.
  • 46
    Speechnotes

    Speechnotes

    Speechnotes

    Speechnotes is a powerful speech-enabled online notepad, designed to empower your ideas by implementing a clean & efficient design, so you can focus on your thoughts. We strive to provide the best online dictation tool by engaging cutting-edge speech-recognition technology for the most accurate results technology can achieve today, together with incorporating built-in tools (automatic or manual) to increase users' efficiency, productivity and comfort. Works entirely online in your Chrome browser. No download, no install and even no registration needed, so you can start working right away. Speechnotes is especially designed to provide you a distraction-free environment. Every note, starts with a new clear white paper, so to stimulate your mind with a clean fresh start. All other elements but the text itself are out of sight by fading out, so you can concentrate on the most important part, your own creativity.
  • 47
    TalkText

    TalkText

    TalkText

    TalkText is an AI-powered dictation tool designed to enhance productivity by converting natural speech into polished text across various applications on macOS. By pressing 'option + space', users can dictate in any app, and TalkText refines the input by removing filler words and correcting mistakes, resulting in clear and professional text. The tool also offers a 'restyle' feature, allowing users to select any text and instruct TalkText to rewrite it in a desired tone or style, such as making it more empathetic or confident. Supporting over 30 languages, TalkText ensures accurate transcription and proper formatting, including capitalization and punctuation. Privacy is a priority, with real-time audio processing that is not stored or used for model training. The platform offers a free tier with up to 2,000 words per month, with options to upgrade for unlimited usage.
    Starting Price: $6.50 per month
  • 48
    Harker

    Harker

    Harker

    Harker is a minimal, offline voice-to-text widget that transforms spoken words into written text anywhere you’d normally type, without sending your data to external servers. It sits unobtrusively, ready to activate via a global keyboard shortcut, and pastes your transcribed speech directly into the active text field, maintaining flow across apps. The tool processes everything locally; your voice and transcriptions never leave your device, ensuring privacy and security. Harker’s embedded model delivers near-instant results, eliminating lag or internet-dependent delays. Its design is intentionally lightweight and clean: it stays hidden until called and avoids cluttering your workspace. It works across any application, emails, chats, code prompts, and documents, and is especially useful in AI workflows, letting you speak prompts instead of typing them. Because it operates offline and independently of servers, it’s suited for sensitive environments or users wanting control over their data.
    Starting Price: $9.99 per month
  • 49
    Live Transcribe

    Live Transcribe

    Live Transcribe

    Live Transcribe has a new name, Live Transcribe & Sound Notifications. It's an app that makes everyday conversations and surrounding sounds more accessible among people who are deaf and hard of hearing, using just your Android phone. Using Google’s state-of-the-art automatic speech recognition and sound detection technology, Live Transcribe & Sound Notifications provides you free, real-time transcriptions of your conversations and sends notifications based on your surrounding sounds at home. The notifications make you aware of important situations at home, such as a fire alarm or doorbell ringing, so that you can respond quickly. Get notified of potential risky situations and personal situations based on sounds happening at home (for example, smoke alarm, siren, baby sounds). Get notifications with a flashing light or vibration to your mobile device or wearable. Timeline view lets you go back in history (currently limited to 12 hours) to see what was happening around you.
  • 50
    NoNotes

    NoNotes

    NoNotes

    For over 10 years NoNotes has worked with researchers, colleges and businesses on all types of audio transcription. Audio to text starting at $0.75/minute. Use the NoNotes Call Recorder to automatically record and transcribe any inbound or outgoing calls. Try the App for free in your favourite App Store. NoNotes works with leading Masters, PhD, college faculty and qualitative researchers on any type/size project. Use NoNotes to record, transcribe, share and manage your interviews. Unlimited recording and RoboTranscribe anywhere in the world. Upgrade to ProTranscribe anytime. Record inbound/outbound/conference calls or dictate. NoNotes providers users with unlimited storage. Manage multiple users / projects from one account, enable all staff to easily record and transcribe. Collaborate and share files, one easy dashboard to manage everything, dedicated customer success manager.
    Starting Price: $0.75 per minute