Clone a voice in 5 seconds to generate arbitrary speech in real-time
Industrial-level controllable zero-shot text-to-speech system
A fast TTS architecture with conditional flow matching
An Open Source text-to-speech system built by inverting Whisper
Nyquist is a language for sound synthesis and music composition.
Unofficial Parallel WaveGAN
A deep learning toolkit for Text-to-Speech, battle-tested in research
Best practice TTS based on BERT and VITS
Audio generation using diffusion models, in PyTorch
A Very Low-Bitrate Codec for Speech Compression
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
a vocoder + equalizer + FFT effects version of radio_chung
Text-to-Speech for Basque and Spanish
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
PAddle PARAllel text-to-speech toolKIT
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
Deep learning for text to speech
Generative Adversarial Networks for Efficient and High Fidelity Speech
phase vocoder for time scaling and pitch transposition etc.
Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English