A text-to-speech, speech-to-text and speech-to-speech library
Navigable waveform built on Web Audio and Canvas
Reads an audio file and displays the waveform
Convert colors to synth presets
LLM-based Reinforcement Learning audio edit model
Python Audio Analysis Library: Feature Extraction, Classification
Swift audio synthesis, processing, & analysis platform
Data manipulation and transformation for audio signal processing
SOTA discrete acoustic codec models with 40/75 tokens per second
Component library and custom registry built on top of shadcn/ui
Subtitle Editor derived from 6.0c, but with VLC and Hunspell checker
An Open Source text-to-speech system built by inverting Whisper
Create synth presets from words
Easy, free Windows video editor with LUT support, cuts, and Splits+FX.
Software for loop and cue handling in .wav files.
HTML5 Based Subtitle Creation Tool
Unofficial Parallel WaveGAN
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Audio generation using diffusion models, in PyTorch
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
State-of-the-art deep learning based audio codec
WaveRNN Vocoder + TTS
General Speech Restoration
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)