Clone a voice in 5 seconds to generate arbitrary speech in real-time
A sound cloning tool with a web interface, using your voice
Instant voice cloning by MIT and MyShell. Audio foundation model
Comprehensive Gradio WebUI for audio processing
A simple, high-quality voice conversion tool focused on ease of use
A high-quality rapid TTS voice cloning model
1 min voice data can also be used to train a good TTS model
The open-source voice synthesis studio powered by Qwen3-TTS
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
Official PyTorch Implementation
A lightweight text-to-speech model with zero-shot voice cloning
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
One-stop AI digital human system with video voice synthesis tools
Multi-lingual large voice generation model, providing inference
Foundational model for human-like, expressive TTS
Real-time voice interactive digital human
The official Python SDK for the ElevenLabs API
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Spark-TTS Inference Code
Video translation and dubbing tool powered by LLMs
Open-source framework for intelligent speech interaction
Controllable & emotion-expressive zero-shot TTS
Easy-to-use Speech Toolkit including Self-Supervised Learning model
MARS5 speech model (TTS) from CAMB.AI