Offline speech recognition API for Android, iOS, Raspberry Pi
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Open-source industrial-grade ASR models
Multilingual speech recognition and audio understanding model
kaldi-asr/kaldi is the official location of the Kaldi project
On-device Speech Recognition for Apple Silicon
Captcha solver extension for humans
Audio foundation model excelling in audio understanding
SOTA Open Source TTS
Speech recognition for your site
Fast and accurate automatic speech recognition (ASR) for edge devices
A free, open source, and extensible speech-to-text application
StreamSpeech is a seamless model for offline speech recognition
A PyTorch-based Speech Toolkit
Port of OpenAI's Whisper model in C/C++
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Multilingual Automatic Speech Recognition with word-level timestamps
Cross-platform AI language practice app
Speech-AI-Forge is a project developed around TTS generation model
Voice Recognition to Text Tool
OpenVINO™ Toolkit repository
Toolkit for conversational AI