Clone a voice in 5 seconds to generate arbitrary speech in real-time
An Open Source text-to-speech system built by inverting Whisper
Unofficial Parallel WaveGAN
Nyquist is a language for sound synthesis and music composition.
A Very Low-Bitrate Codec for Speech Compression
a vocoder + equalizer + FFT effects version of radio_chung
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Generative Adversarial Networks for Efficient and High Fidelity Speech
phase vocoder for time scaling and pitch transposition etc.
DeepMind's Tacotron-2 Tensorflow implementation