Robust Speech Recognition via Large-Scale Weak Supervision
Data manipulation and transformation for audio signal processing
A tool for transcoding lossless audio files
A clean, lean CoDec Pack. FFDShow and LAV Combined.
Industrial-level controllable zero-shot text-to-speech system
Audio codecs extracted from Android Open Source Project
TorchMultimodal is a PyTorch library
A Conversational Speech Generation Model
Music Player Daemon SACD/DVD-A ISO decoder plugins
Precise MPEG Audio
Implementation of NÜWA, attention network for text to video synthesis
A Very Low-Bitrate Codec for Speech Compression
State-of-the-art deep learning based audio codec
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
A C, fast audio/video MPEG decoder.
Modern 3D engine and IDE written using C# and C++.