GPT-SoVITS

GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Features

Zero‑shot TTS: generate speech from a 5‑second voice sample
Few‑shot fine-tuning: 1 minute of data for improved voice likeness
Cross-lingual support across multiple languages
Web UI for inference and batch generation
Open-source with pretrained model weights
Active community and publication‑grade performance

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow GPT-SoVITS

GPT-SoVITS Web Site

Other Useful Business Software

AI-powered service management for IT and enterprise teams

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free

Rate This Project

User Reviews

Be the first to post a review of GPT-SoVITS!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Voice Cloning Software

Registered

2025-07-29

Similar Business Software

Voicv

Voicv is a cutting-edge voice cloning platform that transforms your voice into a digital asset in minutes, supporting multiple languages and zero-shot learning. It allows users to clone any voice with just a 10-30-second audio sample, maintaining high fidelity and natural expression. It...

See Software
Chatterbox

Chatterbox is a free, open source voice cloning AI model developed by Resemble AI, licensed under MIT. It enables zero-shot voice cloning using just 5 seconds of reference audio, eliminating the need for training. The model offers expressive speech synthesis with unique emotion control, allowing...

See Software
AnyVoice

AnyVoice is an ultra-realistic AI voice generator that enables users to convert text into natural-sounding speech using advanced AI technology. It offers hundreds of voices and supports instant voice cloning with just a 3-second recording. It provides multi-language support for English,...

See Software

Report inappropriate content

GPT-SoVITS

1 min voice data can also be used to train a good TTS model

Get an email when there's a new version of GPT-SoVITS

Features

Project Samples

Project Activity

Categories

License

Follow GPT-SoVITS

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered