Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
The open-source voice synthesis studio powered by Qwen3-TTS
Software synthesizer based on the SoundFont 2 specifications
Sonic Pi is your free code-based music creation and performance tool
Collaborative programmable music
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Functional programming language for signal processing
Free open source speech synthesizer for Russian and other languages
Framework for building real-time voice and multimodal AI agents
Open Source Speech Language Model
Controllable & emotion-expressive zero-shot TTS
Capable of understanding text, audio, vision, video
Stable diffusion for real-time music generation (web app)
Transforming Multimodal Content into Captivating Multilingual Audio
Translate the video from one language to another and embed dubbing
Industrial-level controllable zero-shot text-to-speech system
Offline Text To Speech synthesis for python
Swift audio synthesis, processing, & analysis platform
A Systematic Framework for Interactive World Modeling
Synchronized Translation for Videos
Flash + AIR sound effects generator. Based on Sfxr.