Alternatives to SoapBox

Compare SoapBox alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to SoapBox in 2026. Compare features, ratings, user reviews, pricing, and more from SoapBox competitors and alternatives in order to make an informed decision for your business.

  • 1
    Speechmatics

    Speechmatics

    Speechmatics

    Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcription
    Starting Price: $0 per month
  • 2
    Twilio Voice
    Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Create an engaging voice experience that you can quickly scale and modify with a wide array of customization options and resources, like our Voice SDK. Then, add on features like Interactive Voice Response (IVR), recording transcriptions, and speech recognition to create an experience that your customers will appreciate. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice. Find docs, code samples, helper libraries, and developer tools such as Twilio Runtime and our visual workflow builder, Studio.
    Starting Price: $0.0085 per min
  • 3
    Speech2Structure
    When treating a patient, doctors spend on average two-thirds of their time documenting the treatment and far less time on examinations or patient interviews. To allow doctors to spend more time with their patients, Averbis is working on Speech2Structure – a software solution where the documentation is recorded live by voice and structured on-the-fly. Speech2Structure can correctly recognize and resolve many linguistic variations such as negations, suspected diagnoses, diagnoses that have taken place, etc. when recognizing diagnoses. Pathological laboratory values or microbiology results are also converted into corresponding diagnoses. The recorded medications can also provide clues to diagnoses.
  • 4
    Clarifai

    Clarifai

    Clarifai

    Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for developing better, faster and stronger AI. We help our customers create innovative solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. The platform comes with the broadest repository of pre-trained, out-of-the-box AI models built with millions of inputs and context. Our models give you a head start; extending your own custom AI models. Clarifai Community builds upon this and offers 1000s of pre-trained models and workflows from Clarifai and other leading AI builders. Users can build and share models with other community members. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been recognized by leading analysts, IDC, Forrester and Gartner, as a leading computer vision AI platform. Visit clarifai.com
  • 5
    SoundHound

    SoundHound

    SoundHound AI

    We believe every brand should have a voice and every person should be able to interact naturally with the products around them, by simply talking. At SoundHound Inc., we’re working together with our strategic partners to build a more accessible and connected world. We build custom voice assistants for companies wanting to keep their brand, users, and data. Built on the foundation of proprietary Speech-to-Meaning® and Deep Meaning Understanding® technologies, the Houndify platform provides conversational intelligence unmatched by others in the industry. Houndify everything! Voice-enable the world with conversational intelligence. Create a voice AI platform that exceeds human capabilities and brings value and delight via an ecosystem of billions of products enhanced by innovation and monetization opportunities. Headquartered in the heart of Silicon Valley, we are a global company with 9 offices in key markets and teams in 16 countries.
  • 6
     OTO

    OTO

    OTO Systems

    OTO allows call centers 100% visibility of what is said during customer calls within 20 hours. Complement your NPS scoring with in-call intonation analytics. Identify call agent engagement and proactively set your WFM plan. Pick calls for QA faster. OTO is language-agnostic and gives you output parameters on various angles. Our API allows companies to start analyzing 100% of in-call conversations within a couple of hours. Sign up for a free trial and start analyzing your call data! Voice is the most valuable touchpoint between you and your customer. We're here to help you truly understand and leverage your voice data at scale. Whether you're building a mobile app or data analytics dashboards, our lightweight DeepToneTM engine gives you access to our powerful voice models on any device, providing you with a rich layer of acoustic labels for nearly every audio format.
    Starting Price: $100 per month
  • 7
    Soapbox

    Soapbox

    Endurance Learning

    Soapbox finds the right activities for your learning objectives. Find your next bolt of inspiration in minutes. Soapbox generates the materials you need to successfully deliver your presentation. While Soapbox cannot predict your actual content, users report that it has gotten them 50%-80% of the way to their final training presentation, saving hours’ or even days’ worth of development time. You can choose to swap out any activity generated by Soapbox. Each activity comes with two to five alternatives that may better fit your presentation and/or the facilitator’s comfort level and will always fit the situation. If you feel a presentation is at risk of activity fatigue from all of the activities generated by Soapbox, you can delete activities you find to be superfluous.
    Starting Price: $29 per month
  • 8
    Imagine Language & Literacy
    Imagine Language & Literacy is a personalized learning program designed to accelerate both literacy skills and English language development for students in grades PreK–6. It offers direct, explicit, and systematic instruction across four language domains, adapting learning pathways to each student's needs. It integrates the "Big 5" components of reading, phonemic awareness, phonics, fluency, vocabulary, and comprehension, aligned with the science of reading. It dynamically adjusts instruction and groups students based on skill gaps, suggesting targeted activities. Imagine Language & Literacy supports multilingual learners with human-voiced audio in 15 languages and print materials in additional languages, gradually transitioning students to English proficiency.
  • 9
    The SOAPbox

    The SOAPbox

    The Social Foundry

    The SOAPbox is software that is​​​ designed to expose database operations via web services. As a result, using the SOAPbox significantly reduces the cost and time of typical integration projects by removing the complexity associated with creating internet facing Application Program Interfaces (APIs). Whether its a mobile initiative or just a way to connect your systems, leave the complex API management to us so you can focus on your development. ​ The SOAPbox is a perfect building block for both on-premise and cloud infrastructure as it creates APIs quickly and in less time than traditional hand-coding methods or large scale SOA initiatives. ​ No more expensive middle-ware platforms or fragmented integration projects. Simple and easy to use, the SOAPbox is truly integration in a little black box.
    Starting Price: $1800.00/one-time
  • 10
    Agara

    Agara

    Agara

    Agara is the world's leading Real-time Voice AI SaaS platform that processes customer support calls in real-time to eliminate hold time, reduce manual inputs and improve customer experience. Agara significantly improves customer satisfaction (CX) scores while reducing support costs by over 50%.
  • 11
    Talkatoo

    Talkatoo

    Talkatoo

    Talkatoo is a voice-enabled AI tool designed to integrate effortlessly with your workflow, transforming speech to text using specialized vocabularies. You focus on patient care; we handle the technology. Built to be affordable and tailored for clinics, Talkatoo helps you reclaim valuable time throughout your day. With processing speeds over 200 words per minute—five times faster than typing—and a built-in medical dictionary. Our key features—Auto-SOAP records, Desktop Dictation, and the AI Assistant empower you to streamline tasks with ease. Record entire appointments to generate formatted SOAP notes instantly, dictate into any application from notes to email, and use the AI Assistant to create discharge instructions, translate documents, and more. Simply download, click, and start speaking, no tech expertise needed.
    Starting Price: $117 per month
  • 12
    Vozy

    Vozy

    Vozy

    Vozy transforms the way companies interact with customers through voice assistants and conversational artificial intelligence to boost customer-centric enterprises with an automation that really works. With personalized solutions designed to meet the growing omnichannel customer care demand, Vozy is delivering significant cost savings and unprecedented customer experiences for companies in Latin America. That’s why powerhouses like SURA, Bancolombia, Protección, and Emtelco trust Vozy.
  • 13
    Soapbox

    Soapbox

    Soapbox

    Software for the next generation of social media. Soapbox is customizable open-source software that puts the power of social media in the hands of the people. Feature-rich and hyper-focused on providing a user experience to rival Big Tech, Soapbox is already home to some of the biggest alternative social platforms. Connect with users across the Fediverse — a network of over 5,000 connected sites that hosts 4.4 million users. Soapbox is entirely built by dedicated members of the Free Software community. Please consider making a donation to support our mission to make decentralized social media the new standard and protect users from the abuses of Big Tech.
    Starting Price: Free
  • 14
    Soapbox

    Soapbox

    Wistia

    The easiest way to create a great video by yourself. Soapbox is the only tool you need to record, edit, and share videos in minutes. With Soapbox, all you need to create a great video is our Chrome extension, a webcam, and something to say! Hit record, and then edit to share your webcam, your screen, or a split-screen view. Make the video-creation process a breeze. With just one extension (and a Soapbox Station if you're feeling fancy) your entire team will be able to master video production—from shooting to sharing. Combine a talking-head style recording with a screencast, or add presentation elements (like slides or other videos) to easily build a library of compelling, informational content, fast! With just a few clicks, you can export your videos to share with your audience or create a gallery that drives relevant traffic to your website with Wistia Channels. Soapbox makes it easy to create relevant and timely content, fast.
    Starting Price: $300.00/year
  • 15
    Alibaba Cloud Intelligent Speech Interaction
    Intelligent Speech Interaction is developed based on state-of-the-art technologies such as speech recognition, speech synthesis, and natural language understanding. Enterprises can integrate Intelligent Speech Interaction into their products to enable them to listen, understand, and converse with users, providing users with an immersive human-computer interaction experience. Intelligent Speech Interaction is currently available in Mandarin Chinese, Cantonese Chinese, English, Japanese, Korean, French and Indonesian, and please stay tuned for other languages. Intelligent Speech Interaction is suitable for various scenarios, including intelligent Q&A, intelligent quality inspection, real-time subtitling for speeches, and transcription of audio recordings. Intelligent Speech Interaction has been successfully applied in many industries such as finance, insurance, eCommerce and smart home.
    Starting Price: $1.40 per hour
  • 16
    Symbl

    Symbl

    Symbl.ai

    Symbl is an API platform for developers and businesses to rapidly deploy conversational intelligence at scale – on any channel of communication. Our comprehensive suite of APIs unlock proprietary machine learning algorithms that can ingest any form of conversation data to identify actionable insights across domains and channels (voice, email, chat, social) contextually – without the need for any upfront training data, wake words, or custom classifiers. Symbl is democratizing conversational tech to make collaboration effortless at scale. We provide the technology for organizations to deploy at scale our proprietary workplace productivity API so brands can optimize key workflows for knowledge workers or enhance the customer experience. Whether you are a seasoned developer or just starting to explore how to harness employee collaboration to fit your organization’s needs, our API can be customized for your specific applications.
  • 17
    LumenVox Automatic Speech Recognition (ASR)
    Transforming customer engagement with AI-powered voice recognition and voice authentication technology. Our flexible voice-enabled technology allows you to create a solution that meets all of your customers' demands, affordably and reliably. We do one thing, and we do it well. And that's voice enablement for your apps. Finally, deliver great voice automation and interactions. Whether it's short, simple commands or conversational questions, LumenVox ASR and TTS are accurate and affordable, helping you improve efficiency on both sides of the phone line. You will never repeat yourself. Recognize multiple dialects from a single global language model to serve all your customers. We give you maximum flexibility from a capabilities, implementation and monetization perspective. If you can think it, you can build it with LumenVox
  • 18
    ELSA Speak

    ELSA Speak

    ELSA Speak

    ELSA, English Language Speech Assistant, is a fun and engaging app specially designed to help you improve your English pronunciation. ELSA's artificial intelligence technology was developed using voice data of people speaking English with various accents. This allows ELSA to recognize the speech patterns of non-native speakers, setting it apart from most other voice recognition technologies. Strict but caring, the ELSA AI Coach pays close attention to every bit of progress you make along the way, and reminds you when you go off track. You will be rewarded for your hard work. ELSA gets smarter every day! Traditional language learning is transformed by our personalized English teaching technology. Our self-evolving AI analyzes your performance and behavioral data to personalize your daily curriculum. We are the first and best speech recognition app designed to evaluate and give immediate, detailed feedback on pronunciation and fluency.
    Starting Price: Free
  • 19
    Talkio AI

    Talkio AI

    Talkio AI

    Talkio AI is built on top of ChatGPT and lets you interact with the AI through voice to train your oral language skills. Talkio AI offers premium voices and supports multiple dialects for the most popular languages. With our advanced language technology, you can immerse yourself in authentic conversations and gain proficiency in the dialects that matter most to you. Ever wondered how it would be to have a personal language tutor available anytime, anywhere? At Talkio AI, we turn this dream into reality. Our AI Tutors are the perfect companions to improve your oral language skills. Powered by advanced AI technology, they mimic human interaction and conversation, offering an immersive language learning experience that is both engaging and effective.
  • 20
    FortressIQ

    FortressIQ

    Automation Anywhere

    FortressIQ enables enterprises to decode work, transform experiences, and enhance workflows with the industry’s most advanced process intelligence platform. Using innovative computer vision and artificial intelligence, FortressIQ delivers unprecedented process insights, extremely fast, and with detail and accuracy unattainable with traditional methods. The platform autonomously acquires process data at scale even as processes extend across systems, empowering enterprises to understand, monitor, and improve operations, employee and customer experiences, and every business process. FortressIQ was founded in 2017, and is backed by Lightspeed Venture Partners, Boldstart Ventures, Comcast Ventures, Eniac Ventures, M12 and Tiger Global. Pinpoint inefficiencies and process variations continuously and automatically to reveal optimal process paths and reduce time to automation.
  • 21
    Classtime

    Classtime

    Classtime Inc.

    Classroom management solution designed for students and teachers with features such as analytics, real-time grading, and libraries. Classtime is a solution for teachers that complements in-class teaching with immediate feedback on students level of understanding. Create great questions, engage everyone, improve understanding. No registration required to see how it works!
    Starting Price: $9.00/month/teacher
  • 22
    Voice Pro

    Voice Pro

    LinguaTec

    Voice Pro Enterprise has been developed especially for use in enterprises. The recognition is done on the company server and can be accessed from any device (PC, Mac, smartphone, tablet). This ensures that all in-house information remains within the company. No more time-consuming speaker training is necessary, thanks to the speaker-independent recognition technology: Just speak into your device and you will see the transcribed text immediately. Companies finally have a sophisticated and secure speech recognition solution at their disposal. Regardless of whether you need to create a document at your work station, write an email on the move or dictate a sales report on site: Voice Pro Enterprise saves time and helps to make employees more productive. Voice Pro Enterprise results in a noticeable increase in employee efficiency. With Voice Pro Enterprise you dictate on average three times faster than you type. The high recognition accuracy minimizes post-processing.
    Starting Price: €149 one-time payment
  • 23
    IDVoice

    IDVoice

    ID R&D

    Voice biometrics is the science of using a person’s voice as a uniquely identifying characteristic for the purpose of authentication and/or personalizing the user experience. The technology is referred to in a variety of ways including voice verification, speaker verification, speaker identification and speaker recognition. There are two ways we put voice biometrics into practice. The first is Text Independent Voice Verification. This approach does not depend on the person speaking a particular passphrase. The other is Text Dependent Voice Verification. in which the user enrolls using a specific phrase but unlike a password, this phrase is not secret. IDVoice enables both options depending on your use case and in some scenarios they may be used together.
  • 24
    GoVivace

    GoVivace

    GoVivace

    Our automatic speech recognition engine supports several English accents and can be localized to any language. Also, the ASR engine supports standard telephony as well as web and mobile applications. Being capable of actioning voice commands given to electronic devices such as computers, tablets, smartphones or telephones with the aid of a microphone, the GoVivace’s Automatic Speech Recognition Engine finds use in diverse applications. This automatic speech recognition engine compares the spoken input with a number of pre-specified possibilities and convert speech to text. The entire set of pre-specified possibilities constitute the application’s grammar, which powers the interface between the dialogue-speaker and the back-end processing. GoVivace’s patented Automatic Speech Recognition solution needs only very simple grammar for its processing. It can also support very large grammars for complex tasks.
  • 25
    Abacus.AI

    Abacus.AI

    Abacus.AI

    Abacus.AI is the world's first end-to-end autonomous AI platform that enables real-time deep learning at scale for common enterprise use-cases. Apply our innovative neural architecture search techniques to train custom deep learning models and deploy them on our end to end DLOps platform. Our AI engine will increase your user engagement by at least 30% with personalized recommendations. We generate recommendations that are truly personalized to individual preferences which means more user interaction and conversion. Don't waste time in dealing with data hassles. We will automatically create your data pipelines and retrain your models. We use generative modeling to produce recommendations that means even with very little data about a particular user/item you won't have a cold start.
  • 26
    Analance
    Combining Data Science, Business Intelligence, and Data Management Capabilities in One Integrated, Self-Serve Platform. Analance is a robust, salable end-to-end platform that combines Data Science, Advanced Analytics, Business Intelligence, and Data Management into one integrated self-serve platform. It is built to deliver core analytical processing power to ensure data insights are accessible to everyone, performance remains consistent as the system grows, and business objectives are continuously met within a single platform. Analance is focused on turning quality data into accurate predictions allowing both data scientists and citizen data scientists with point and click pre-built algorithms and an environment for custom coding. Company – Overview Ducen IT helps Business and IT users of Fortune 1000 companies with advanced analytics, business intelligence and data management through its unique end-to-end data science platform called Analance.
  • 27
    RapidMiner
    RapidMiner is reinventing enterprise AI so that anyone has the power to positively shape the future. We’re doing this by enabling ‘data loving’ people of all skill levels, across the enterprise, to rapidly create and operate AI solutions to drive immediate business impact. We offer an end-to-end platform that unifies data prep, machine learning, and model operations with a user experience that provides depth for data scientists and simplifies complex tasks for everyone else. Our Center of Excellence methodology and the RapidMiner Academy ensures customers are successful, no matter their experience or resource levels. Simplify operations, no matter how complex models are, or how they were created. Deploy, evaluate, compare, monitor, manage and swap any model. Solve your business issues faster with sharper insights and predictive models, no one understands the business problem like you do.
    Starting Price: Free
  • 28
    NaturalText

    NaturalText

    NaturalText

    NaturalText A.I. helps you get more out of your data. Discover relationships, create collections, and unveil hidden insights in documents and other text-based data. NaturalText A.I. uses novel artificial intelligence technology to uncover hidden relationships in data. The software uses various state-of-the-art methods to understand context, analyze patterns, and reveal insights—all in a human-readable way. Reveal insights hidden in your data. Finding everything hidden in your text data is a difficult, if not impossible, task. With traditional search, you can only locate information related to a document. NaturalText A.I., on the other hand, uncovers new information within millions of documents, including scientific papers and patents. Use NaturalText A.I. to reveal insights in the data you are currently missing.
    Starting Price: $5000.00
  • 29
    Sia

    Sia

    OneOrigin

    Sia™ revolutionizes higher education by streamlining student lifecycle management from enrollment to retention. This AI-driven tool quickly processes transcripts, aiding in credit transfers and boosting student retention. By analyzing academic histories and interests, Sia™ offers personalized course and career recommendations, enhancing student engagement and academic planning. Its role as a virtual assistant on university websites simplifies information access, reducing staff workload and improving student experience. Sia™'s innovative approach transforms administrative processes, ensuring efficient, personalized support for student success.
  • 30
    DeepScribe

    DeepScribe

    DeepScribe

    DeepScribe’s AI-powered scribe captures the natural conversation between a clinician and patient and automatically writes medical documentation, allowing clinicians to focus on patient care instead of note-taking. Through an easy-to-use mobile app, DeepScribe records the natural clinical encounter and transcribes it in real time. Our proprietary AI then extracts the medical information from the transcript, classifies it into a standard note, and then integrates that note directly into a clinician’s electronic health record system. Unlike traditional scribes, dictation tools, or other solutions, the ambient nature of DeepScribe means it doesn’t intrude on the patient visit or disrupt the clinical workflow. Providers can simply talk to their patient like normal, then review their notes after the visit and sign-off in their EHR. DeepScribe handles documentation, charting, and even populates suggested diagnostic coding based on the information extracted from the visit.
  • 31
    FirstLanguage

    FirstLanguage

    FirstLanguage

    Our Natural Language Processing(NLP) APIs provide best-in-class accuracy at an affordable rate and cover all aspects of NLP under a single roof. Save weeks of time training and creating language models. Take advantage of our best-in-class APIs to kickstart your app development. We provide the building blocks to create your own apps effectively like chatbots, sentiment analysis, etc. Text classification on multiple domains and 100+ languages. Perform effective sentiment analysis. We grow when your business does. So we have put together simple pricing that allows you to easily scale your business when it needs to evolve. Perfect for individual developers who are creating apps or building proof of concepts. Head to the Dashboard and get your API Key. Place this in the header of all your API calls. Use our SDK in your preferred language to start coding. Or you can refer to the auto-generated code blocks provided in 18 programming languages.
    Starting Price: $150 per month
  • 32
    RocketWhisper

    RocketWhisper

    Mojosoft Co., Ltd.

    RocketWhisper is a powerful desktop speech recognition and transcription application that runs 100% offline on your computer. Your voice data never leaves your machine - complete privacy guaranteed. Powered by OpenAI's Whisper engine with NVIDIA GPU (CUDA) acceleration, RocketWhisper delivers fast and accurate speech-to-text conversion for professionals, content creators, and anyone who works with voice and text. Key Features: - 100% offline processing - voice data never leaves your PC - OpenAI Whisper engine for high-accuracy speech recognition - NVIDIA CUDA GPU acceleration - up to 10x faster than CPU - Real-time voice-to-text input with global hotkey (Push-to-Talk with Right Alt) - Batch transcription of multiple audio/video files (MP3, WAV, M4A, MP4, MKV, AVI, etc.) - SRT/VTT subtitle export for video content - AI text formatting with LLM integration (OpenAI, Anthropic, Google Gemini, Grok, local LLM)
    Starting Price: $32 one-time
  • 33
    Gladia

    Gladia

    Gladia

    Gladia is a speech-to-text platform built for production, turning raw audio into structured outputs that power real workflows like meeting summaries, CRM enrichment, contact center QA, and real-time voice assistants. With support for 99+ languages and the ability to handle messy real-world audio—overlapping speakers, accents, code-switching, domain-specific terminology—Gladia is designed for the complexity of actual conversations, not clean studio recordings.
    Starting Price: 10 hours free
  • 34
    Dragon Law Enforcement

    Dragon Law Enforcement

    Nuance Communications

    Eliminate the need to decipher handwritten notes or try to recall details from hours before. Officers simply speak to create detailed and accurate incident reports, 3 times faster than typing and with up to 99% recognition accuracy—Zall by voice. With a next-generation speech engine powered by Nuance Deep Learning technology, Dragon achieves high recognition accuracy while dictating, even for users with accents or those working in open office or mobile environments; making it ideal for diverse work groups and settings. Use fast and accurate dictation to enter data into RMS and CAD systems or other applications. Officers or support staff simply dictate anywhere they would normally type, and fill and navigate within form fields by voice.
  • 35
    Pronounce

    Pronounce

    Pronounce

    Pronounce is an innovative language learning platform focused on enhancing English pronunciation and fluency through AI-driven tools. It offers instant feedback on American or British English accents, making it ideal for anyone looking to improve their spoken English. The platform features AI speech checking, meeting transcription, and AI chats with virtual speaking partners to practice conversational skills. Available with both free and premium plans, Pronounce caters to a broad audience, from language learners to professionals seeking to refine their communication skills in specific environments​.
    Starting Price: Free
  • 36
    aiOla

    aiOla

    aiOla

    aiOla is a deep tech Conversational, Voice, and Speech AI lab with an enterprise-level automatic speech recognition (ASR) foundation model, Text-to-speech (TTS) technology and Natural Language Understanding (NLU). It’s designed to help enterprises and developers adapt speech technologies to any process, whether through seamless API integration or an intuitive in-house app. aiOla is revolutionizing enterprise operations with enterprise level Conversational AI. We specialize in speech-to-text and text-to-speech AI that deliver unmatched accuracy (95%), specialized in specific jargon, in any language, accent, vertical, or acoustic environment. From empowering frontline workers with hands-free workflows to enabling voice AI agents with enterprise-grade ASR and TTS, aiOla seamlessly integrates into workflows, internal apps and products.
  • 37
    goFLUENT

    goFLUENT

    goFLUENT

    goFLUENT is the world’s leading blended learning solution provider for acquiring and refining communication skills in strategic business languages such as English, French, German, Italian, Mandarin, Portuguese, and Spanish. Dedicated to diversity & inclusion, talent development, and employee retention, our global mission is to provide all employees with an equal voice to reach their full potential, regardless of their native tongue. We accelerate language training by delivering hyper-personalized solutions that blend technology, content, and human interaction, available globally on any device. Transforming more than 1,000 international corporations’ language training approaches in 150+ countries, goFLUENT speeds up the acquisition of language skills needed to gain confidence, save time, and grow their talent on a global scale.
  • 38
    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile

    AccuSpeechMobile's modern, robust speech recognition is optimized for mobile devices in over 40 languages. Designed for industry workflows, cutting edge noise abatement technology delivers outstanding recognition in noisy environments. A speaker-independent voice engine works for all users out-of-the-box, without the need to voice train or maintain voice files for each user. AccuSpeechMobile is a 100% device-based solution. No voice server or middleware is required and no changes are needed to the backend system (WMS, ERP, EAM, CMMS). Cloud or network connection is not required to use the full functionality of device-based data collection. AccuSpeechMobile fully supports multi-modal capabilities so that users can hear spoken information and speak commands in tandem with the use of intelligent scanners. The ability to reference additional information on the device screen is also always available in conjunction with speech-to-text and text-to-speech commands.
  • 39
    tazti

    tazti

    Voice Tech Group

    Welcome to the tazti website! tazti is state of the art Speech Recognition & Voice Recognition software. You can easily mash up tazti to files, folders, programs, videos and songs on your PC, to open them by voice control. Play PC Games, control applications, programs, and robots by voice command! Over 300,000 people have now tried tazti and it's many features. tazti is super fun, especially if you are tired of pounding your keyboard or want an easy to use assistive technology. Great as well for people with Arthritis, Carpal Tunnel, Tendonitis, Fibromyalgia or other hand, finger or wrist pain.
    Starting Price: $39.99
  • 40
    AITalk

    AITalk

    AITalk

    Unlock language mastery with AITalk – your AI-powered companion for fluent conversations anytime, anywhere. Learn to speak naturally by chatting with AI. Pick topics, chat freely, and master any language, one conversation at a time. Boost your IELTS speaking skills and beyond with our all-in-one app: AI-powered conversations, writing assistance, creative naming, and grammar correction at your fingertips. Boost your IELTS Speaking score with our AI app, offering personalized practice and instant feedback for confident communication. Immerse yourself in authentic conversations with lifelike AI partners, each with their own unique voice and personality. This immersive experience enhances your learning and helps you understand different accents and speech patterns more effectively.
  • 41
    Verbio

    Verbio

    Verbio

    Increase security and user experience in daily interactions with the unique potential of voice. An innovative language agnostic, cost-effective and reliable alternative to seamlessly verify and identify users in real-time. Voice biometrics allows to automatically recognize any person through the characteristics of their voice and it can smartly substitute traditional authentication methods (cards, passwords, signature, fingerprint, etc) in security access control, user verification for digital transactions or for fraud prevention and detection. With an easy and cost-effective solution, authentication through voice biometrics brings an innovative and safe experience to users, with a risk-free and remote access. Biometric Authentication and Identification through voice has never been so secure and fast with different operational uttering models for each type of client and advanced anti-spoofing methodologies.
  • 42
    Earworms

    Earworms

    earworms Learning

    Have you ever had an earworm? Catchy music and lyrics that you just can't get out of your head? Well, utilizing the power of music, the Earworms Musical Brain Trainer puts the words of a foreign language into your head! In recent years there have been a lot of advances in language learning techniques, supported by the findings of neurological science and linguistic pedagogy. Earworms MBT bundles a lot of these findings into a powerful edutainment language learning system, unique in its teaching power. Listening to the melodious tracks puts users into a relaxed state of alertness, ideal for learning. The sound patterns combined with rhythmic repetitions from a mesmeric male voice who speaks the English and a female native speaker for the target language, 'worm' their way into the auditory cortex -- the area of the brain from which words can be easily imagined and recalled.
  • 43
    Memrise

    Memrise

    Memrise

    Specialising in combining cognitive science, powerful tech and entertaining content, Memrise makes language learning genuinely recreational. We offer 200 language combinations across 24 languages on our website, iOS and Android apps. By leveraging lots of brain science and plenty of humour, we’re striving to enrich people’s consciousness and help people achieve confident, real-world language skills in just a few short months. Memrise’s courses have one thing that textbooks don’t: real-life language. Our team of in-house linguists are not only experts but also passionate about teaching you the language they speak themselves in everyday life. To add to the richness, our courses are packed with thousands of video clips of native speakers speaking in their native language, in their hometown. So you can learn to understand authentic voices and accents, as well as taking in the scenery and getting a sense of the culture.
  • 44
    Rubidium

    Rubidium

    Rubidium

    Rubidium enables leading companies to embed voice commands and text to speech in their products. Voice Trigger is an “always on” engine that continuously listens and wakes up when you say the proper “magic word”. Voice Trigger identification uses a sophisticated miniature footprint Automatic Speech Recognition (ASR) engine to run in the background and distinguish between the trigger phrase and the rest of the speech, sounds and noise. Automated Speech Recognition (ASR) easily and safely controls any set of functions through voice commands. For example: call acceptance and rejection, device setup and installation procedure (pairing, calibration, interconnection, etc.), voice dialing, music streaming control and music selection. Rubidium technology is now embedded in over 50 million consumer products with customers and partners including leading global brands such as RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux and many others.
  • 45
    Mastercard Market Trends
    Gather information that plays a fundamental role across all areas of the business. For payment industry players, it is critical to stay on top of the largest trends and access to date market information, or risk getting left behind. Our Market Trends platform provides an in-depth view of payment insights, competitive intelligence and industry trends. Insights range from research on card performance across several markets, thought leadership on trending topics and deep-dive analysis of top fintech players. Market Trends provides a simple, curated view of reliable insights, all in one place. Easy to access reports providing insights for different markets globally on socio-economic, payment and digital KPI data, curated at a local level by Mastercard teams. Filtering and benchmarking features of card products issued by different schemes and issuers to help you understand the current competitive landscape.
  • 46
    Work by Speech

    Work by Speech

    Mikołaj Magowski

    Work by Speech is the first program in the world that allows efficient work on a computer by speech without needing a keyboard and mouse. Work by Speech Features: - Efficient work on a computer by speech alone - Quiet speaking support - Application switching and opening by speech - Built-in voice commands for the most common actions - Custom voice commands management - Macro recording and editing - Separate dictation mode - Fast and repeatable mouse control by speech with support for all mouse actions - Customizable mousegrid that can be moved by speech - Automatic mousegrid optimization for every used application - Very low processor and memory usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Free updates
    Starting Price: Free
  • 47
    Open English

    Open English

    Open English

    Connect to our classes, which start every 30 minutes, from your computer or our app. Learn with an immersion method in English and native pronunciation experts. Enjoy a portal included in your course that prepares you for these exams at no additional cost. Receive a certificate based on the levels of the Common European Framework of Reference (CEFR). You will have at your fingertips an advanced voice recognition technology tool that will show you how to improve your diction. Master English at work thanks to our content focused on specific professional areas. In our program, you have 24-hour access to live classes with expert teachers in online teaching. Be part of our student group, where you can interact with other students and exchange ideas that will help you in your learning process. Various publications recognize the success of the #1 leading online English course, thanks to real results in real students.
    Starting Price: Free
  • 48
    TrulyNatural
    Sensory is a pioneer in the use of embedded neural network-based speech recognition and has become the industry leader in optimizing and engineering speech recognition software with small footprints and minimal MIPS. This extensive experience and continuous innovation have led to the first embedded large vocabulary continuous-speech recognizer (LVCSR) with state of-the-art cloud performance. Unlike voice recognition software often used with smartphones and mobile devices, such as with a voice assistant mobile app, as well as with IoT (internet of things) enabled technologies (Alexa, Google Assistant, Siri, Cortana), Sensory’s solution is embedded and doesn’t require a wifi connection. Many applications don’t need or want to rely on cloud-based connection to do high-performance speech recognition. Others seek a client/cloud distributed system with optimal performance. The market concerns regarding privacy, performance and bandwidth are driving more processing to the edge.
  • 49
    VoiceMe

    VoiceMe

    VoiceMe

    In an always more contactless world, arises the necessity of a new model of digital trust. VoiceMe enables people, companies, and objects to interact with each other through a simple interface and in an ultra-secured way opening the door to a new generation of services. Access restricted physical areas guaranteeing users' identity. Sign with legal validation documents and contracts. Our algorithms pre-identify the user based on behaviors, using also biometric parameters obtained from the upper face and voice. All customer-related data remains exclusively at the user's disposal, offering maximum privacy and respect for GDPR regulation. Each data set is encrypted, divided in pieces, and spread on a network of nodes, making it impossible for an external unauthorized source to extract. At each authorized data usage the inverse process is done to recompose the data set. API or SDK for third-party allows easy integration in already existing systems.
  • 50
    Whisper

    Whisper

    OpenAI

    We’ve trained and are open-sourcing a neural net called Whisper that approaches human-level robustness and accuracy in English speech recognition. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise, and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing. The Whisper architecture is a simple end-to-end approach, implemented as an encoder-decoder Transformer. Input audio is split into 30-second chunks, converted into a log-Mel spectrogram, and then passed into an encoder.