audio-to-midi free download

Showing 9766 open source projects for "audio-to-midi"

View related business solutions

8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
1

MIDI Visualizer

A small MIDI visualizer tool, using OpenGL

MIDIVisualizer is a cross-platform application that renders real-time visualizations of MIDI files using OpenGL. Inspired by the look of rhythm games and piano roll editors, it offers smooth animations and customizable themes to turn MIDI sequences into captivating graphical performances. It works as both a live visualizer and a tool to export visuals into video, making it ideal for musicians, VJs, and creators who want to produce visually engaging content synced to their compositions.

Downloads: 59 This Week

Last Update: 2025-12-14
See Project
2

NeuralNote

Audio Plugin for Audio to MIDI transcription using deep learning

NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. ...

Downloads: 48 This Week

Last Update: 2026-03-12
See Project
3

A2M — Audio to MIDI

A2M is a desktop app that converts AUDIO TO MIDI in one click.

A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder.

Downloads: 25 This Week

Last Update: 2026-02-20
See Project
4

Giada

Your Hardcore Loop Machine.

...Process samples or audio/MIDI input signals with VST instruments from your plug-ins collection. Control other software or synchronize physical MIDI devices by using Giada as a MIDI master sequencer.

Downloads: 5 This Week

Last Update: 2025-12-05
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
5

Frescobaldi

LilyPond sheet music text editor

...Designed to be powerful yet lightweight and easy-to-use, Frescobaldi offers great functionality and a host of useful features such as music view with advanced two-way Point & Click, Midi capturing to enter music, a Snippet Manager and many more. Frescobaldi is named after Girolamo Frescobaldi (1583-1643), an Italian composer of keyboard music in the late Renaissance and early Baroque period.

2 Reviews

Downloads: 22 This Week

Last Update: 2026-02-11
See Project
6

MLX-Audio

A text-to-speech, speech-to-text and speech-to-speech library

...The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.

Downloads: 5 This Week

Last Update: 2026-03-14
See Project
7

Qwen2-Audio

Repo of Qwen2-Audio chat & pretrained large audio language model

Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models.

Downloads: 1 This Week

Last Update: 2025-09-23
See Project
8

Qwen-Audio

Chat & pretrained large audio language model proposed by Alibaba Cloud

Qwen-Audio is a large audio-language model developed by Alibaba Cloud, built to accept various types of audio input (speech, natural sounds, music, singing) along with text input, and output text. There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio.

Downloads: 1 This Week

Last Update: 2025-09-23
See Project
9

Scribbletune

Create music with JavaScript

Scribbletune is a JavaScript library for creating music and sequences using a simple and intuitive syntax, allowing developers to generate MIDI files and integrate music composition into their applications.

Downloads: 0 This Week

Last Update: 2026-02-18
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

Kimi-Audio

Audio foundation model excelling in audio understanding

Kimi-Audio is an ambitious open-source audio foundation model designed to unify a wide array of audio processing tasks — from speech recognition and audio understanding to generative conversation and sound event classification — within a single cohesive architecture. Instead of fragmenting work across specialized models, Kimi-Audio handles automatic speech recognition (ASR), audio question answering, automatic audio captioning, speech emotion recognition, and audio-to-text chat in one system, enabling developers to build rich, multimodal audio applications without stitching together disparate components. ...

Downloads: 0 This Week

Last Update: 2026-01-27
See Project
11

Step-Audio

Open-source framework for intelligent speech interaction

Step-Audio is a unified, open-source framework aimed at building intelligent speech systems that combine both comprehension and generation: it integrates large language models (LLMs) with speech input/output to handle not only semantic understanding but also rich vocal characteristics like tone, style, dialect, emotion, and prosody. The design moves beyond traditional separate-component pipelines (ASR → text model → TTS), instead offering a multimodal model that ingests speech or audio and produces speech accordingly, enabling natural dialogue, voice cloning, and expressive speech synthesis. ...

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
12

BlackHole

BlackHole is a modern macOS audio loopback driver

...The driver integrates directly with macOS Core Audio and appears in Audio MIDI Setup and supported audio applications. Designed with performance and stability in mind, BlackHole works on both Intel and Apple Silicon Macs without requiring kernel extensions or system security modifications. As an open-source project, it offers transparency, customization options, and active community-driven development.

Downloads: 91 This Week

Last Update: 2025-02-06
See Project
13

Basic Pitch

A lightweight audio-to-MIDI converter with pitch bend detection

...Provide a compatible audio file and a basic-pitch will generate a MIDI file, complete with pitch bends. The basic pitch is instrument-agnostic and supports polyphonic instruments, so you can freely enjoy transcription of all your favorite music, no matter what instrument is used. Basic pitch works best on one instrument at a time.

Downloads: 26 This Week

Last Update: 2024-08-16
See Project
14

Fun Audio Chat

Large Audio Language Model built for natural interactions

Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. ...

Downloads: 1 This Week

Last Update: 2026-02-27
See Project
15

FluidSynth

Software synthesizer based on the SoundFont 2 specifications

FluidSynth is a real-time software synthesizer based on the SoundFont 2 specifications and has reached widespread distribution. FluidSynth itself does not have a graphical user interface, but due to its powerful API several applications utilize it and it has even found its way onto embedded systems and is used in some mobile apps.

Downloads: 48 This Week

Last Update: 2026-02-21
See Project
16

Audio Priority Bar

A native macOS menu bar app for managing audio device priorities

Audio Priority Bar is a lightweight macOS utility that gives users precise control over how audio output is prioritized across different apps and devices, filling a gap in the system audio stack that Apple doesn’t natively expose. Once installed, it places an always-accessible control in the menu bar that lets you assign priority levels to individual audio sources so that more important sounds (like alerts, calls, or music) can override or duck less important ones (like background noise or game audio). ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
17

Step-Audio-EditX

LLM-based Reinforcement Learning audio edit model

Step-Audio-EditX is an open-source, 3 billion-parameter audio model from StepFun AI designed to make expressive and precise editing of speech and audio as easy as text editing. Rather than treating audio editing as low-level waveform manipulation, this model converts speech into a sequence of discrete “audio tokens” (via a dual-codebook tokenizer) — combining a linguistic token stream and a semantic (prosody/emotion/style) token stream — thereby abstracting audio editing into high-level token operations. ...

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
18

Step-Audio 2

Multi-modal large language model designed for audio understanding

Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. ...

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
19

Qtractor

An Audio/MIDI multi-track sequencer

Qtractor is an Audio/MIDI multi-track sequencer application written in C++ around the Qt framework. The target platform will be Linux, where the Jack Audio Connection Kit (JACK) for audio, and the Advanced Linux Sound Architecture (ALSA) for MIDI,

30 Reviews

Downloads: 338 This Week

Last Update: 2 days ago
See Project
20

Mixxx

Mixxx is Free DJ software that gives you everything you need

...Built-in mappings for DJ controller hardware gives you hands-on control of Mixxx's features. Use the programmable mapping system to customize your workflow and add support for any MIDI or HID device. Add your unique spin to tracks with sound effects. Get creative by chaining multiple effects together and twisting all their knobs.

Downloads: 16 This Week

Last Update: 2025-12-14
See Project
21

Sonic Pi

Sonic Pi is your free code-based music creation and performance tool

Sonic Pi is a new kind of musical instrument. Instead of strumming strings or whacking things with sticks - you write code, live. Sonic Pi is a complete open source programming environment originally designed to explore and teach programming concepts within schools through the process of creating new sounds. In addition to being an engaging education resource it has evolved into an extremely powerful and performance-ready live coding instrument suitable for professional artists and DJs....

Downloads: 14 This Week

Last Update: 2025-06-26
See Project
22

EasyRPG Player

RPG Maker 2000/2003 and EasyRPG games interpreter

...The main goal is to play all games created with them as the original game interpreter (RPG_RT) does. This project is being developed. FreeType2 for external font support (+ HarfBuzz for Unicode text shaping) mpg123 for better MP3 audio support. WildMIDI for better MIDI audio support. Libvorbis / Tremor for Ogg Vorbis audio support. opusfile for Opus audio support. libsndfile for better WAVE audio support. libxmp for better tracker music support. SpeexDSP for proper audio resampling.

Downloads: 6 This Week

Last Update: 2025-06-02
See Project
23

It's MyTabs

Open source, web based, self hostable guitar/bass tab viewer

It’s MyTabs is an open-source, web-based and self-hostable guitar/bass tablature viewer and player, built to give musicians their own alternative to subscription services like Songsterr or Soundslice. Users can upload tab files in formats such as GP, GPX, MusicXML, or CAPX, sync them with audio or YouTube videos, and play them back in a browser or mobile device. It supports features like MIDI synth track muting/soloing, mobile-friendly UI, dark/light themes, and a variety of cursor modes (auto-scroll, follow-cursor, highlight bar). Because it is deployable via Docker Compose or as a Windows executable, musicians can self-host their own tab library, keep data private, and share links conveniently. ...

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
24

Overtone

Collaborative programmable music

Overtone is an open-source audio environment designed to explore new musical ideas from synthesis and sampling to instrument building, live coding and collaborative jamming. We combine the powerful SuperCollider audio engine, with Clojure, a state-of-the-art lisp, to create an intoxicating interactive sonic experience. Synchronize your visuals and noise with ease.

Downloads: 1 This Week

Last Update: 2024-11-07
See Project
25

LosslessCut

The swiss army knife of lossless video/audio editing

LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby losing quality. ...

5 Reviews

Downloads: 634 This Week

Last Update: 2026-01-29
See Project