Offline speech recognition API for Android, iOS, Raspberry Pi
A fast, local neural text to speech system
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech to Text to Speech, sends text as OSC messages
A free, open source, and extensible speech-to-text application
Speech recognition module for Python
A modern ebook manager and reader with sync and backup
High-quality multi-lingual text-to-speech library by MyShell.ai
A deep learning toolkit for Text-to-Speech, battle-tested in research
Browser extension and cross-platform desktop app based on ChatGPT API
Featuring powerful AI capabilities and supporting e-book formats
Comprehensive Gradio WebUI for audio processing
Subtitle Creation Assistant
Toolkit for conversational AI
Transcribe any audio to text, translate and edit subtitles 100% locall
Capable of understanding text, audio, vision, video
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
The behavior guidance framework for customer-facing LLM agents
Anki flashcards on Android
Examples and guides for using the Gemini API
A generative speech model for daily dialogue
Models for the spaCy Natural Language Processing (NLP) library
Open source personal AI Assistant for Linux, Windows and Mac
State-of-the-art TTS model under 25MB