Showing 104 open source projects for "ffdshow audio decoder"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Super Audio CD Decoder
    Super Audio CD Decoder input plugin for foobar2000. Decoder is capable of playing back Super Audio CD ISO images, DSDIFF, DSF and DSD WavPack files. DSD(DoP) and PCM output modes. Separate DSD Processor/DSD Converter plugins for track extraction into DSD/DST encoded files.
    Leader badge
    Downloads: 5,085 This Week
    Last Update:
    See Project
  • 2
    DVD-Audio Decoder and Watermark Detector
    DVD-Audio Decoder input plugin and Watermark Detector/Neutralizer DSP plugins for foobar2000. Decoder is capable of playing back DVD-Audio discs, ISO images, AOB, MLP and Dolby TrueHD files in full resolution. Dedicated plugin for DTS-HD playback. APT-x100 plugin for *.AUD and *.AUE files from DTS Movie/Trailer Discs.
    Leader badge
    Downloads: 280 This Week
    Last Update:
    See Project
  • 3
    Whisper

    Whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    ...These tasks are jointly represented as a sequence of tokens to be predicted by the decoder, allowing a single model to replace many stages of a traditional speech-processing pipeline. The multitask training format uses a set of special tokens that serve as task specifiers or classification targets.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 4
    TorchAudio

    TorchAudio

    Data manipulation and transformation for audio signal processing

    The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    XLD

    XLD

    A tool for transcoding lossless audio files

    X Lossless Decoder(XLD) is a tool for Mac OS X that is able to decode/convert/play various 'lossless' audio files. The supported audio files can be split into some tracks with cue sheet when decoding. It works on Mac OS X 10.4 and later.
    Leader badge
    Downloads: 4,564 This Week
    Last Update:
    See Project
  • 6
    BlackBelt CodecPack

    BlackBelt CodecPack

    A clean, lean CoDec Pack. FFDShow and LAV Combined.

    Contains support for popular formats. Works especially well with MediaPortal. LAV, ffdshow - why choose between when you can have both in one pack ! WMV/WMA, DivX, AVI, ASF, FLV, Ogg FLAC, HEV1, x264, x265 etc. NO SPYWARE, NO ADWARE, NO TOOLBARS, NO PLAYER - JUST PURE CODECS Windows XP / Vista / 7 / 8 / 10 - 32/64 bit.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    IndexTTS2

    IndexTTS2

    Industrial-level controllable zero-shot text-to-speech system

    ...It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice cloning — meaning it can mimic a target speaker’s voice from a short reference sample — making it versatile for multi-voice uses. Compared to many open-source TTS tools, IndexTTS emphasizes efficiency and controllability: it offers faster inference, simpler training pipelines, and controllable speech parameters (like duration, pitch, and prosody), which is critical for production use.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 6,709 This Week
    Last Update:
    See Project
  • 9
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    ...The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference implementations you can adopt or adapt. The design emphasizes composability: you can mix and match encoder, fusion, and decoder components rather than starting from monolithic models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11

    MPD

    Music Player Daemon SACD/DVD-A ISO decoder plugins

    Downloads: 19 This Week
    Last Update:
    See Project
  • 12
    MMC is a commander-style media player for Windows, with native, hw accelerated video playing and translucent gui. Mpxplay is a console audio player for DOS and Win32 operating systems. x264vfw, x265vfw and xAV1vfw are video for windows encoder and decoder codecs, useful with VirtualDub.
    Leader badge
    Downloads: 242 This Week
    Last Update:
    See Project
  • 13
    fleck
    MP3 decoder and player using madlib(recommended) and mpeglib, sample code showing how to use madlib and mpeglib to decode mp3 data, and also OGG Vorbis, AAC, MP4, Opus, the first version used to replace Libav in the project rsppmp3 standalone is ready now
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    libTiMidity is a MIDI to WAVE converter library that uses Gravis Ultrasound-compatible patch files to generate digital audio data from General MIDI files. This library based on the TiMidity decoder from SDL_sound library.
    Leader badge
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15
    Tabuleiro
    MP3 decoder and player using madlib(recommended) and mpeglib, sample code showing how to use madlib and mpeglib to decode mp3 data, and also OGG Vorbis, AAC, MP4, Opus, the first version used to replace Libav in the project rsppmp3 standalone is ready now
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This project aims to create a DVD player for Linux and the Creative DXR3 (aka Sigma Designs Hollywood+) MPEG2 decoder board
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DragonOS
    *Until you install the operating system, the default user = live / no password. DragonOS Noble (24.04) DragonOS FocalX (22.04) and DragonOS Focal (20.04) are out-of-the-box Lubuntu based x86_64 operating systems for anyone interested in software defined radios. All source installed software is located in the /usr/src directory while the remaining software was installed by package managers. What is DragonOS and why do you want it? The shortest distance between two points is a...
    Leader badge
    Downloads: 2,153 This Week
    Last Update:
    See Project
  • 18

    pmaudio

    Precise MPEG Audio

    Precise MPEG Audio Decoder - Open source (GPL) - Small - Fast - Very Precise and Very Accurate - Floating-point and Fixed-point varieties - Works with Linux and Windows - Examples for using the library - Sample Input DLL for WinAmp - Sample command-line player - Decoding library derived from mpg123
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    NÜWA - Pytorch

    NÜWA - Pytorch

    Implementation of NÜWA, attention network for text to video synthesis

    Implementation of NÜWA, state of the art attention network for text-to-video synthesis, in Pytorch. It also contains an extension into video and audio generation, using a dual decoder approach. It seems as though a diffusion-based method has taken the new throne for SOTA. However, I will continue on with NUWA, extending it to use multi-headed codes + hierarchical causal transformer. I think that direction is untapped for improving on this line of work. In the paper, they also present a way to condition the video generation based on segmentation mask(s). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Lyra

    Lyra

    A Very Low-Bitrate Codec for Speech Compression

    lyra is a neural audio codec designed to deliver intelligible, natural-sounding speech at extremely low bitrates, making real-time communication viable on constrained networks. It replaces hand-engineered codecs with learned models that capture speech characteristics more efficiently and reconstruct waveforms with a neural vocoder. The system targets mobile-class hardware, balancing latency and quality so it can run in real-time on phones.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    Denoiser is a real-time speech enhancement model operating directly on raw waveforms, designed to clean noisy audio while running efficiently on CPU. It uses a causal encoder-decoder architecture with skip connections, optimized with losses defined both in the time domain and frequency domain to better suppress noise while preserving speech. Unlike models that operate on spectrograms alone, this design enables lower latency and coherent waveform output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    seek-h262

    A C, fast audio/video MPEG decoder.

    seek-h262 is a free and portable MPEG decoder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    jamailmar
    C code to play and extract OGG Vorbis media files ,the decoding engine is loaded dinamically in the process (as always) ,the processor time required to decode the OGG Vorbis is less than MP3 files ,the decoder engine is unencumbered of patents
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Vivid3D-Sharp

    Vivid3D-Sharp

    Modern 3D engine and IDE written using C# and C++.

    Modern 3D engine and IDE written using C# and C++. Vivid features all the required functionality to write modern apps ang games. Including a fully working custom GL UI called "ResonanceUI". It features lighting and shadows(Unified) and can even embed and playback videos via FFMPEG, in the UI or full screen, with fully synced audio+picture. It offers two scripting solutions: C# and a custom language called VividScript, which is an OOP style language. The Editor is based on other...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB