Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Open-source industrial-grade ASR models
Audio foundation model excelling in audio understanding
kaldi-asr/kaldi is the official location of the Kaldi project
On-device Speech Recognition for Apple Silicon
Captcha solver extension for humans
A PyTorch-based Speech Toolkit
Port of OpenAI's Whisper model in C/C++
A free, open source, and extensible speech-to-text application
StreamSpeech is a seamless model for offline speech recognition
Fast and accurate automatic speech recognition (ASR) for edge devices
Cross-platform AI language practice app
Multilingual Automatic Speech Recognition with word-level timestamps
Toolkit for conversational AI
OpenVINO™ Toolkit repository
Voice Recognition to Text Tool
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
A cross-platform software for text translation and recognition
Underthesea - Vietnamese NLP Toolkit
Repo of Qwen2-Audio chat & pretrained large audio language model
Speech to Text to Speech, sends text as OSC messages
The behavior guidance framework for customer-facing LLM agents