A text-to-speech, speech-to-text and speech-to-speech library
Open-source framework for intelligent speech interaction
Oobabooga - The definitive Web UI for local AI, with powerful features
Multi-modal large language model designed for audio understanding
Official Python inference and LoRA trainer package
Large Audio Language Model built for natural interactions
A Family of Open Sourced Music Foundation Models
Transforming Multimodal Content into Captivating Multilingual Audio
Streaming Real-time Audio-Driven Avatar Generation
Toolkit for audio, music, and speech generation
The open-source voice synthesis studio powered by Qwen3-TTS
Audiocraft is a library for audio processing and generation
Implementation of AudioLM audio generation model in Pytorch
Create music with JavaScript
The official Go library for the OpenAI API
Generate music based on natural language prompts using LLMs
R Package for Music Score and Audio Generation
Taming Stable Diffusion for Lip Sync
Multimodal Diffusion with Representation Alignment
A Python library for audio data augmentation
The official .NET library for the OpenAI API
48khz stereo neural audio codec for general audio
AudioMuse-AI is an Open Source Dockerized environment
Generate audiobooks from EPUBs, PDFs and text with captions
HunyuanVideo: A Systematic Framework For Large Video Generation Model