audio processing free download

Showing 336 open source projects for "audio processing"

View related business solutions

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
No-Nonsense Code-to-Cloud Security for Devs | Aikido
Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.

Start for Free
1

Ultimate Vocal Remover (UVR5)

GUI for a Vocal Remover that uses Deep Neural Networks

This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).

Downloads: 153 This Week

Last Update: 2025-01-20
See Project
2

OpenVINO AI Plugins for Audacity

A set of AI-enabled effects, generators, and analyzers for Audacity

A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.

Downloads: 85 This Week

Last Update: 2024-12-20
See Project
3

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 123 This Week

Last Update: 2025-06-26
See Project
4

MoviePy

Video editing with Python

MoviePy is a Python module for video editing, which can be used for basic operations (like cuts, concatenations, title insertions), video compositing (a.k.a. non-linear editing), video processing, or to create advanced effects. It can read and write the most common video formats, including GIF. MoviePy is an open source software originally written by Zulko and released under the MIT licence. It works on Windows, Mac, and Linux, with Python 2 or Python 3. The code is hosted on Github, where you...

Downloads: 83 This Week

Last Update: 2025-05-21
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 25 This Week

Last Update: 2025-05-27
See Project
6

Faust

Functional programming language for signal processing

Faust (Functional Audio Stream) is a functional programming language for sound synthesis and audio processing with a strong focus on the design of synthesizers, musical instruments, audio effects, etc. Faust targets high-performance signal processing applications and audio plug-ins for a variety of platforms and standards. The core component of Faust is its compiler. It allows to "translate" any Faust digital signal processing (DSP) specification to a wide range of non-domain specific languages...

Downloads: 12 This Week

Last Update: 2025-07-01
See Project
7

downkyi

Bilibili video downloader supporting 8K, batch, and toolbox tools

downkyi is an open-source downloader for Bilibili videos. It features a clean UI, QR-code login, batch downloads, support for 8K, HDR, Dolby Vision, audio/video extraction, watermark removal, and subtitle/danmaku retrieval. It leverages aria2c for multi-threaded downloading and FFmpeg for muxing and processing.

Downloads: 18 This Week

Last Update: 2025-07-07
See Project
8

Librosa

Python library for audio and music analysis

Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.

Downloads: 8 This Week

Last Update: 2025-07-03
See Project
9

SuperCollider

Audio server, programming language, and IDE for sound synthesis

SuperCollider is a platform for audio synthesis and algorithmic composition, used by musicians, artists, and researchers working with sound. It is free and open source software available for Windows, macOS, and Linux. scsynth, a real-time audio server, forms the core of the platform. It features 400+ unit generators (“UGens”) for analysis, synthesis, and processing. Its granularity allows the fluid combination of many known and unknown audio techniques, moving between additive and subtractive...

Downloads: 13 This Week

Last Update: 2025-03-15
See Project
Your top-rated shield against malware and online scams | Avast Free Antivirus
Browse and email in peace, supported by clever AI

Our antivirus software scans for security and performance issues and helps you to fix them instantly. It also protects you in real time by analyzing unknown files before they reach your desktop PC or laptop — all for free.

Free Download
10

audioFlux

A library for audio and music analysis, feature extraction

A library for audio and music analysis, and feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training and is used...

Downloads: 11 This Week

Last Update: 2024-08-09
See Project
11

Recorder

HTML5 js recording mp3 wav ogg webm amr format

... of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.

Downloads: 6 This Week

Last Update: 2025-01-11
See Project
12

Datasets

Hub of ready-to-use datasets for ML models

Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. We also feature a deep integration...

Downloads: 8 This Week

Last Update: 2025-07-09
See Project
13

txtai

Build AI-powered semantic search applications

..., models can understand concepts in documents, audio, images and more. Machine-learning pipelines to run extractive question-answering, zero-shot labeling, transcription, translation, summarization and text extraction. Cloud-native architecture that scales out with container orchestration systems (e.g. Kubernetes). Applications range from similarity search to complex NLP-driven data extractions to generate structured databases. The following applications are powered by txtai.

Downloads: 9 This Week

Last Update: 2025-06-10
See Project
14

TorchAudio

Data manipulation and transformation for audio signal processing

The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...

Downloads: 4 This Week

Last Update: 2025-05-09
See Project
15

miniaudio

Audio playback and capture library written in C,

miniaudio is written in C with no dependencies except the standard library and should compile cleanly on all major compilers without the need to install any additional development packages. All major desktop and mobile platforms are supported. miniaudio gives you complete flexibility. With the low-level API, just initialize a connection to the device and send or receive raw audio data. The modular design of miniaudio allows you to use the low-level API without compromising your ability to make...

Downloads: 5 This Week

Last Update: 2025-02-24
See Project
16

ScreenPipe

AI app store powered by 24/7 desktop history. open source

Screenpipe is an AI app store powered by continuous desktop history recording. It operates entirely locally, offering developers a platform to build, distribute, and monetize AI applications that leverage comprehensive contextual data from users' desktop activities.

Downloads: 6 This Week

Last Update: 2025-04-08
See Project
17

Triton Inference Server

The Triton Inference Server provides an optimized cloud

... real-time, batched, ensembles, and audio/video streaming. Provides Backend API that allows adding custom backends and pre/post-processing operations. Model pipelines using Ensembling or Business Logic Scripting (BLS). HTTP/REST and GRPC inference protocols based on the community-developed KServe protocol. A C API and Java API allow Triton to link directly into your application for edge and other in-process use cases.

Downloads: 6 This Week

Last Update: 2025-06-26
See Project
18

mediasoup

Cutting Edge WebRTC Video Conferencing

mediasoup is a Node.js library that provides a cutting-edge WebRTC server capable of handling real-time communications with efficient media routing and processing.

Downloads: 3 This Week

Last Update: 2025-07-10
See Project
19

Overtone

Collaborative programmable music

Overtone is an open-source audio environment designed to explore new musical ideas from synthesis and sampling to instrument building, live coding and collaborative jamming. We combine the powerful SuperCollider audio engine, with Clojure, a state-of-the-art lisp, to create an intoxicating interactive sonic experience. Synchronize your visuals and noise with ease. Overtone features seamless integration with both Quil, a Clojure front-end to Processing and ShaderTone, a Clojure version...

Downloads: 3 This Week

Last Update: 2024-11-07
See Project
20

SFBAudioEngine

A powerhouse of audio functionality for macOS, iOS, and tvOS

SFBAudioEngine is an advanced audio engine designed for macOS and iOS, focusing on high-quality playback, precise audio control, and support for a wide range of audio formats. Built for modern Apple platforms, it provides developers with a robust tool for integrating sophisticated audio functionalities into their applications. It emphasizes extensibility, performance, and clean API design.

Downloads: 1 This Week

Last Update: 2025-03-14
See Project
21

Competent Audio

Machine graph audio engine for computer games

Competent Audio (CA) is an audio engine suitable for video games. It is written in C, but is designed for interoperability with other languages. Windows and Linux binaries for x86 and amd64 are available. CA uses a machine graph model with support for arbitrary numbers of machines, limited only by the available system resources: - Samplers play back audio clips. - Mixers combine audio signals and optionally perform signal processing. - Sinks send audio signals to an output device...

Downloads: 0 This Week

Last Update: 2024-07-13
See Project
22

AudioKit

Swift audio synthesis, processing, & analysis platform

AudioKit is an entire audio development ecosystem of code repositories, packages, libraries, algorithms, applications, playgorunds, tests, and scripts, built and used by a community of audio programmers, app developers, engineers, researchers, scientists, musicians, gamers, and people new to programming. An important goal for AudioKit is to allow it to grow and be maintainable by a handful of volunteers. For this reason we have extensive tests that are run whenever changes are made to any...

Downloads: 2 This Week

Last Update: 2025-03-11
See Project
23

MATLAB Deep Learning Model Hub

Discover pretrained models for deep learning in MATLAB

Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.

Downloads: 2 This Week

Last Update: 2024-10-11
See Project
24

BotSharp

AI Multi-Agent Framework in .NET

Conversation as a platform (CaaP) is the future, so it's perfect that we're already offering the whole toolkits to our .NET developers using the BotSharp AI BOT Platform Builder to build a CaaP. It opens up as much learning power as possible for your own robots and precisely control every step of the AI processing pipeline. BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio processing...

Downloads: 2 This Week

Last Update: 2025-06-27
See Project
25

h2oGPT

Private chat with local GPT with document, images, video, etc.

h2oGPT is an open-source platform that allows users to interact with local GPT models in a completely private environment. It supports a variety of document types, including PDFs, Word files, images, video frames, and even audio, enabling users to query and analyze their documents or engage in a private chat with AI. The platform is designed to be secure and offline, ensuring that all data remains private and under the user's control. h2oGPT supports several AI models, including oLLaMa...

Downloads: 1 This Week

Last Update: 2025-02-22
See Project