TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). If you are only interested in synthesizing speech with the released TTS models, installing from PyPI is the easiest option.

Features

  • High performance Deep Learning models for Text2Speech tasks
  • Fast and efficient model training
  • Detailed training logs on console and Tensorboard
  • Support for multi-speaker TTS
  • Efficient Multi-GPUs training
  • Ability to convert PyTorch models to Tensorflow 2.0 and TFLite for inference

Project Samples

Project Activity

See All Activity >

License

Mozilla Public License 1.0 (MPL)

Follow TTS

TTS Web Site

Other Useful Business Software
Add Two Lines of Code. Get Full APM. Icon
Add Two Lines of Code. Get Full APM.

AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of TTS!

Additional Project Details

Registered

2021-10-18