Showing 123 open source projects for "ac audio encoder"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3, OPUS, OGG - Editing codecs: DNxHD, DNxHR, Apple ProRes, QT Animation, GoPro CineForm, Uncompressed YUV - Output codecs: H.264, H.265, VP8, VP9, AV1, OGV - Broadcast codecs: XDCAM HD422, AVC-Intra 100, XAVC, HAP - Old codecs: DV PAL, MJPEG, Xvid, WMV, MPEG - Archiving codec: FFV1 - Images creation: JPEG, Image - Burn & Rip: DVD, Blu-ray, DVD RIP - Analysis: Loudness & True Peak, Audio normalization, Cut detection, Black detection, Media, VMAF - Download: Web video
    Leader badge
    Downloads: 166 This Week
    Last Update:
    See Project
  • 2
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    ...It integrates a latent-space audio encoder, discrete acoustic tokens, and reinforcement-learning–based training (CoT + RL) to enhance its ability to capture and reproduce voice styles, intonations, and subtle vocal cues. Moreover, Step-Audio2 supports tool-calling and retrieval-augmented generation (RAG), allowing it to access external knowledge sources or audio/text databases, thus reducing hallucinations and improving coherence in complex dialogues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Encoder of Death
    Encoder of Death is a video/audio file encoding app. Batch Processing: Encode multiple files simultaneously in queue. Audio/Video Conversion: Convert between video formats or extract audio from video. Format Support: Audio: MP3, WAV, FLAC, AIFF, AAC, M4A, OGG Video: MP4, MKV, AVI, MOV, WebM, FLV, WMV, MPEG, MPG, M4V, 3GP 1-27-25, I addressed the issue of FFmpeg not bundling with the executables.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 13 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Real-Time Voice Cloning

    Real-Time Voice Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that captures voice characteristics; this embedding is then used by a Tacotron-style synthesizer to generate spectrograms from text, which a WaveRNN-based vocoder finally turns into audio. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 7,227 This Week
    Last Update:
    See Project
  • 7
    IndexTTS2

    IndexTTS2

    Industrial-level controllable zero-shot text-to-speech system

    IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice cloning — meaning it can mimic a target speaker’s voice from a short reference sample — making it versatile for multi-voice uses. Compared to many open-source TTS tools, IndexTTS emphasizes efficiency and controllability: it offers faster inference, simpler training pipelines, and controllable speech parameters (like duration, pitch, and prosody), which is critical for production use.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    Ultravox

    Ultravox

    Fast multimodal LLM for real-time voice interaction and AI apps

    Ultravox is an open source multimodal large language model designed specifically for real-time voice-based interactions. It is built to process both text and spoken audio directly, eliminating the need for a separate speech recognition stage and enabling more seamless conversational experiences. Ultravox works by combining text prompts with encoded audio inputs, allowing it to understand spoken language alongside written instructions in a unified pipeline. Internally, it leverages pretrained...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    StaxRip

    StaxRip

    Video encoding GUI for Windows

    StaxRip is a powerful, open-source video and audio encoding GUI for Windows that orchestrates industry-standard console tools (such as x265, FFmpeg, mkvmerge) and frame-server systems (like AviSynth+ or VapourSynth) to allow users to transcode, mux, remux, or process media files with fine-grained control. It is not a “one-click” encoder; instead, it grants the user deep control over encoding settings, filtering, resizing, cropping, subtitles, audio processing, container formats, and more — making it a tool of choice for videophiles, enthusiasts, and anyone needing high-quality and customized media output. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    LatentSync

    LatentSync

    Taming Stable Diffusion for Lip Sync

    ...The system leverages a U-Net diffusion backbone, with cross-attention of audio embeddings (via an audio encoder) and reference video frames to guide generation, and applies a set of loss functions (temporal, perceptual, sync-net based) to enforce lip-sync accuracy, visual fidelity, and temporal consistency. Over versions, LatentSync has improved temporal stability and lowered resource requirements — making inference more practical (e.g. 8 GB VRAM for earlier versions, somewhat higher for latest models).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Internet Friendly Media Encoder

    Internet Friendly Media Encoder

    Simple, Flexible & Powerful H.265/HEVC & H266/VVC video encoder!

    A versatile, future-proof, and expandable multimedia encoder designed for ease of use. Internet Friendly Media Encoder supports adding subtitles and attachments, and can function as a remuxing tool with a Copy Stream option. It allows you to combine multiple video, audio, subtitle, and attachment streams into a single file, remove unwanted streams, or incorporate subtitles from another video without extracting.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 12
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to user cpu thread count. ...
    Leader badge
    Downloads: 3,913 This Week
    Last Update:
    See Project
  • 13
    VidCoder

    VidCoder

    A Blu-ray, DVD and video file transcoder for Windows

    VidCoder is a Windows-based open-source video transcoding and ripping tool that provides a graphical interface built around standard command-line multimedia tools. It lets users convert video files (or rip DVDs/Blu-rays, when supported) into modern formats and codecs, making it useful for people who want to compress, re-encode, or transcode video content without dealing directly with low-level encoder settings. Because VidCoder integrates and automates the invocation of complex backend...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    ...The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference implementations you can adopt or adapt. The design emphasizes composability: you can mix and match encoder, fusion, and decoder components rather than starting from monolithic models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    StyleTTS 2

    StyleTTS 2

    Towards Human-Level Text-to-Speech through Style Diffusion

    ...The architecture uses a two-stage training process and leverages an auxiliary speech language model to guide generation toward more natural and coherent utterances. StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    microenc

    microenc

    Batch audio encoding script for Linux/BSD

    microenc is a small Bash shell script for Linux/BSD for encoding directories with audio files to other formats using FFmpeg as encoder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MediaCoder

    MediaCoder

    Universal media transcoding software

    MediaCoder is a universal media transcoding software actively developed and maintained since 2005. It puts together most cutting-edge audio/video technologies into an out-of-box transcoding solution with a rich set of adjustable parameters which let you take full control of your transcoding. New features and latest codecs are added or updated constantly. MediaCoder might not be the easiest tool out there, but what matters here is quality and performance. It will be your swiss army knife for...
    Leader badge
    Downloads: 800 This Week
    Last Update:
    See Project
  • 18
    MMC is a commander-style media player for Windows, with native, hw accelerated video playing and translucent gui. Mpxplay is a console audio player for DOS and Win32 operating systems. x264vfw, x265vfw and xAV1vfw are video for windows encoder and decoder codecs, useful with VirtualDub.
    Leader badge
    Downloads: 213 This Week
    Last Update:
    See Project
  • 19
    mulder

    mulder

    MuldeR's OpenSource Projects

    OpenSource development since 2004. Featured projects: * LameXP - audio encoder front-end * MPlayer for Windows - custom MPlayer installer for Win32 * Simple x264 Launcher - simple GUI front-end for x264 * Many more...
    Leader badge
    Downloads: 391 This Week
    Last Update:
    See Project
  • 20
    LameXP

    LameXP

    Multi-Format Audio-Encoder Front-end

    LameXP is a free multi-format audio file converter that supports a variety of output formats, including MP3, AAC/MP4, Ogg Vorbis, Opus, as well as FLAC, and an even higher number of input formats. It also supports batch processing and can utilize multiple processor cores.
    Leader badge
    Downloads: 251 This Week
    Last Update:
    See Project
  • 21
    DVDStyler

    DVDStyler

    A cross-platform DVD authoring application

    DVDStyler is a cross-platform free DVD authoring application that makes possible for video enthusiasts to create professional-looking DVDs. DVDStyler provides over 20 DVD menu templates, allowing you to create your own menu designs and photo slideshows. After you select your DVD label name, video quality, video format, aspect ratio, and audio format, you can select a template to add video materials to. DVDStyler's interface supports drag-and-drop so you can add project buttons and movies...
    Leader badge
    Downloads: 4,247 This Week
    Last Update:
    See Project
  • 22
    wxLame

    wxLame

    wxLame is a free front-end for the LAME MP3 encoder.

    Author website: https://cfgnunes.com Website: https://cfgnunes.com/wxlame GIT repository: https://github.com/cfgnunes/wxlame
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    LAME (Lame Aint an MP3 Encoder)

    LAME (Lame Aint an MP3 Encoder)

    A high quality MP3 encoder

    LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!
    Leader badge
    Downloads: 19,330 This Week
    Last Update:
    See Project
  • 24
    Accessible-Coconut

    Accessible-Coconut

    A GNU/Linux operating system accessible for visually impaired.

    Accessible-Coconut(AC) is a community driven GNU/Linux operating system which is completely accessible for persons with visual impairment. AC is derived from Ubuntu-MATE. Yes the goal is to make a free and open-source eyes free desktop environment. Forum : https://groups.google.com/forum/#!forum/accessible-coconut Telegram forum : https://telegram.me/accessible_coconut Project home : https://zendalona.com/
    Leader badge
    Downloads: 83 This Week
    Last Update:
    See Project
  • 25
    q4rescue

    q4rescue

    A live linux Rescue toolkit/Emergency OS - based on q4os Trinity

    A live linux system rescue toolkit based on q4os Trinity available as a bootable iso for administrating, repairing and cloning/restoring your system and data. Check wiki for full description : https://sourceforge.net/p/q4rescue/wiki/ Main tools: -Foxclone -Rescuezilla -Clonezilla -DDrescue-gui -qtfsarchiver -G4L -Apart -Testdisk -Photorec -Boot Repair -WoeUSB -Q4OS imager -UNetbootin -usbimager -Kdirstats -Kdiskmark -Rclone & Rclone...
    Downloads: 69 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB