Showing 133 open source projects for "audio visualization"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    MLX-Audio

    MLX-Audio

    A text-to-speech, speech-to-text and speech-to-speech library

    MLX-Audio is a speech library built on Apple’s MLX framework and optimized for Apple Silicon machines (M-series Macs). It focuses on text-to-speech and speech-to-speech workflows, with APIs and a command-line interface that make it easy to generate high-quality audio from text. Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance. The project provides a straightforward CLI...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    wavesurfer.js

    wavesurfer.js

    Navigable waveform built on Web Audio and Canvas

    wavesurfer.js is a customizable audio waveform visualization, built on top of Web Audio API and HTML5 Canvas. With wavesurfer.js you can create anything from an HTML5 audio player to a sophisticated DJ application. wavesurfer.js works only in modern browsers supporting Web Audio. It will fallback to Audio Element without graphics in other browsers (IE 11 and lower).
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Rackula

    Rackula

    Drag and drop rack visualizer

    Rackula is a browser-based rack layout designer aimed at homelabbers, audio/video technicians, and equipment organizers who want a visual way to plan and document physical device racks. It runs entirely client-side with no backend server required, making it lightweight, fast, and easy to self-host or run locally without external dependencies. Users can drag and drop devices into customizable rack spaces, annotate equipment, set unit sizes, and manage complex layouts as their setup evolves....
    Downloads: 405 This Week
    Last Update:
    See Project
  • 6
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    ...The system places a premium on extensibility, allowing contributors to add new extractors or analysis modules tailored to specific industries or datasets. Its visualization tools help users explore captured information interactively, revealing patterns or anomalies that might otherwise require significant manual effort.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    pyAudioAnalysis

    pyAudioAnalysis

    Python Audio Analysis Library: Feature Extraction, Classification

    pyAudioAnalysis is an open-source Python library designed for audio signal analysis, machine learning, and music information retrieval tasks. The project provides a collection of tools that allow developers to extract meaningful features from audio files and use those features for classification, segmentation, and analysis. The library supports multiple audio processing workflows, including feature extraction from raw audio signals, training of machine learning models, and automatic audio...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    projectM

    projectM

    Cross-platform Music Visualization Library

    ...Its purpose in life is to read an audio input and to produce mesmerizing visuals, detecting tempo, and rendering advanced equations into a limitless array of user-contributed visualizations.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 9
    Riffusion App

    Riffusion App

    Stable diffusion for real-time music generation (web app)

    Riffusion App Hobby is an open-source interactive web application that enables real-time music generation using stable diffusion models adapted for audio synthesis. Unlike traditional music generation tools, it treats audio as spectrogram images and applies diffusion techniques to generate continuous sound transitions, allowing users to create evolving musical loops and compositions. The application is built with modern web technologies including Next.js, React, and three.js, providing a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    FDWaveformView

    FDWaveformView

    Reads an audio file and displays the waveform

    FDWaveformView is an easy way to display an audio waveform in your app. It is a nice visualization to show a playing audio file or to select a position in a file. To use it, add an FDWaveformView using Interface Builder or programmatically and then just load your audio as per this example.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AudioKit

    AudioKit

    Swift audio synthesis, processing, & analysis platform

    AudioKit is an entire audio development ecosystem of code repositories, packages, libraries, algorithms, applications, playgorunds, tests, and scripts, built and used by a community of audio programmers, app developers, engineers, researchers, scientists, musicians, gamers, and people new to programming. An important goal for AudioKit is to allow it to grow and be maintainable by a handful of volunteers. For this reason we have extensive tests that are run whenever changes are made to any...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    ...It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    TRIBE v2

    TRIBE v2

    A multimodal model for brain response prediction

    TRIBE v2 is a multimodal foundation model developed by Meta AI for predicting human brain activity from naturalistic stimuli such as video, audio, and text. It is designed for in-silico neuroscience, enabling researchers to model how the brain responds to complex real-world inputs. The system integrates state-of-the-art encoders—including LLaMA for text, V-JEPA for video, and Wav2Vec-BERT for audio—into a unified Transformer architecture. This combined representation is mapped onto the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TensorBoardLogger.jl

    TensorBoardLogger.jl

    Easy peasy logging to TensorBoard with Julia

    TensorBoardLogger.jl is a native library for logging arbitrary data to Tensorboard, extending Julia's standard Logging framework. It can also be used to deserialize TensoBoard's .proto files. The fundamental type defined in this package is a TBLogger, which behaves like other standard loggers in Julia such as ConsoleLogger or TextLogger. You can create one by passing it the path to the folder where you want to store the data. You can also pass an optional second argument to specify the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LTX-2

    LTX-2

    Python inference and LoRA trainer package for the LTX-2 audio–video

    LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch. Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries,...
    Downloads: 57 This Week
    Last Update:
    See Project
  • 16
    Amphion

    Amphion

    Toolkit for audio, music, and speech generation

    Amphion is a toolkit from OpenMMLab dedicated to audio, music, and speech generation, aimed at both reproducible research and helping newcomers get started in generative audio. It provides standardized implementations and recipes for classic and state-of-the-art generative models in audio, including TTS, music generation, and voice conversion. A distinctive feature of Amphion is its emphasis on visualization: it offers interactive visualizations of model architectures and generation processes, making it easier to understand how complex generative audio models work. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    PlayCanvas Engine

    PlayCanvas Engine

    Fast and lightweight JavaScript game engine built on WebGL and glTF

    PlayCanvas is an open-source game engine. It uses HTML5 and WebGL to run games and other interactive 3D content in any mobile or desktop browser. PlayCanvas is used by leading companies in video games, advertising and visualization such as Animech, Arm, BMW, Disney, Facebook, Famobi, Funday Factory, IGT, King, Miniclip, Leapfrog, Mojiworks, Mozilla, Nickelodeon, Nordeus, NOWWA, PikPok, PlaySide Studios, Polaris, Product Madness, Samsung, Snap, Spry Fox, Zeptolab, Zynga. The PlayCanvas Engine...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 18
    TensorBoardX

    TensorBoardX

    tensorboard for pytorch (and chainer, mxnet, numpy, etc.)

    The SummaryWriter class provides a high-level API to create an event file in a given directory and add summaries and events to it. The class updates the file contents asynchronously. This allows a training program to call methods to add data to the file directly from the training loop, without slowing down training. TensorboardX now supports logging directly to Comet. Comet is a free cloud based solution that allows you to automatically track, compare and explain your experiments. It adds a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Selkies-GStreamer

    Selkies-GStreamer

    Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop

    selkies-gstreamer is a GStreamer-based media streaming component used in the Selkies project, a cloud-native platform designed for interactive desktop and application streaming. This module acts as a high-performance media pipeline that captures video, encodes it with low latency, and streams it via WebRTC to client browsers. It is optimized for GPU-accelerated encoding and integrates with Kubernetes-based deployments to enable scalable, real-time remote desktop sessions. This component...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    LARA is software for musical analysis using (new) scientific methods for analysis and visualization. LARA is part of the core research: “Interpretation and performance” of the HSLU – Musik (University of Applied Sciences Luzern – Music depart
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    xoscope is a digital oscilloscope using input from a sound card or EsounD and/or a ProbeScope/osziFOX and will soon support Bitscope hardware. Includes 8 signal displays, variable time scale, math, memory, measurements, and file save/load.
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 22
    Zenamp

    Zenamp

    Multi-format audio player with MIDI, playlists & visualization

    Lightweight GTK3 audio player supporting MIDI, WAV, MP3, OGG, FLAC, AIFF, and Opus formats. Features OPL3 FM synthesis for authentic MIDI playback, drag-and-drop playlist queue, real-time spectrum visualization, 10-band equalizer, and M3U playlist support. Built with SDL2 audio backend for cross-platform compatibility across Linux and Windows. Includes intuitive controls with keyboard shortcuts, 5-second seek buttons, and efficient format conversion.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23

    pysoundanalyser

    a python program to generate, visualize, and manipulate short sounds

    pysoundanalyser is a Python application that can be used to generate, visualize, and manipulate short sounds through a graphical user interface. Visualization functions include visualization of the power spectrum, the spectrogram, the autocorrelation, and the autocorrelogram of a sound. Manipulation functions include filtering, concatenating, cutting, and scaling the level of a sound. Several types of sounds can also be generated including, pure tones, harmonic complex tones, noise of...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    ExpSuite
    ExpSuite is a software framework for applications to perform psychoacoustical experiments. ExpSuite allows acoustic and electric stimulation for normal hearing and cochlear implant listeners, respectively.
    Downloads: 604 This Week
    Last Update:
    See Project
  • 25
    SonicDive-8D-Music-Player

    SonicDive-8D-Music-Player

    SonicDive 8D Music Player v-1.0

    SonicDive is an immersive audio visualization & effects-based music player designed to deliver a next-level listening experience. It combines dynamic spectrums with advanced spatial audio effects like 3D & 8D sound. ✨ Features 🎵 Audio Visual Spectrums SonicDive supports multiple real-time audio visualizations: 💿 Disk Spectrum 📊 Bars Spectrum 🌊 Wave Spectrum 🖼️ Thumbnail Spectrum ⭕ Circle Spectrum Each spectrum reacts dynamically to the music’s frequency and intensity...
    Downloads: 8 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB