Search Results for "matlab projects for audio processing"

Showing 42 open source projects for "matlab projects for audio processing"

View related business solutions
  • Cut Your Data Warehouse Bill by 54% Icon
    Cut Your Data Warehouse Bill by 54%

    Migrate from Snowflake, Redshift, or Databricks with free tools. No SQL rewrites.

    BigQuery delivers 54% lower TCO with serverless scale and flexible pricing. Free migration tools handle the SQL translation automatically.
    Try Free
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    MATLAB Deep Learning Model Hub

    MATLAB Deep Learning Model Hub

    Discover pretrained models for deep learning in MATLAB

    Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    OpenAI Go

    OpenAI Go

    The official Go library for the OpenAI API

    ...It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also offers examples to help developers quickly set up projects and test different API calls. Designed for reliability and ease of use, it is maintained to stay aligned with the evolving OpenAI API specifications.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Moonshine Voice

    Moonshine Voice

    Fast and accurate automatic speech recognition (ASR) for edge devices

    ...The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. The toolkit also includes specialized model variants, including monolingual options that improve accuracy for specific languages. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    ...All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by default, with optional NVIDIA GPU acceleration for faster conversions. Project links: Website: justagwas.com/projects/a2m GitHub: github.com/Justagwas/a2m Documentation: https://github.com/Justagwas/a2m/wiki A2M is fully open source and operates only on the files you choose.
    Downloads: 18 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 5
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    ekho

    ekho

    Chinese text-to-speech engine

    ekho is a project with relatively sparse documentation, but from the repository it appears to be a small-scale tool for audio processing and playback, possibly with features for speech synthesis or manipulation. The repo includes scripts and configuration files suggesting interactions with media/audio handling libraries. Because of limited README detail, it seems targeted at users comfortable reading and modifying code, rather than end users expecting polished UIs. The code structure implies...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Objective-Oriented Directivity

    MATLAB toolbox for processing directivity models

    The project is a framework developed in the form of a MATLAB toolbox, which aims to bring common interface for various directivity representations in acoustics. The legacy version was described in paper 10521 at 151st Audio Engineering Society Convention (https://arxiv.org/abs/2109.14370). The preprint on the current, improved version, can be found here: https://arxiv.org/abs/2206.12283. Currently not submitted anywhere, please refer to the toolbox by citing this website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    slab3d

    slab3d

    slab3d is a real-time virtual acoustic environment.

    slab3d is a real-time virtual acoustic environment rendering system. It includes two rendering applications, libraries for spatial auditory display and audio signal processing development, and a set of tools for HRTF analysis and visualization.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Piano transcription

    Piano transcription

    Task of transcribing piano recordings into MIDI files

    Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 10
    Vivid 3D

    Vivid 3D

    Vivid is a modern C++ 3D engine using OpenGL4+

    Vivid is a modern C++ 3D engine using OpenGL4+. It is written using Visual C++ 2022, and relies on several open source projects to achieve it's goal of making it easy and run to make modern games with it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    processing_codes_iEEG

    Functions to analyse intracranial EEG (SEEG and ECoG) recordings

    Here you will find functions and scripts that are useful when analysing intracranial EEG data (from depth SEEG electrodes, or ECoG subdural grids). Functions are for MATLAB. We use these function in FRONT neurolab (RITMO, University of Oslo) to do our analysis. Some are simple modifications from someone else's code. Other's are fully ours. Many functions are designed to work in combination with FieldTrip (http://www.fieldtriptoolbox.org/) and iElectrodes toolbox...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    LiVES

    LiVES

    LiVES is a Video Editing System. It is designed to be simple to use, y

    LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14

    QuickerSim CFD Toolbox for MATLAB®

    Fluid flow and heat transfer simulations in MATLAB®

    QuickerSim CFD Toolbox is a powerful application for performing fluid flow and heat transfer simulations in MATLAB ® making CFD analysis more accessible than ever. Our Toolbox provides a selection of solvers and data processing tools, which are compatible with other MATLAB® toolboxes and external CFD software. You can set up your simulations quickly as well as tailor the code to your own specific needs and create a custom multiphysics models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    IDTE- ID3 Tag Editor

    IDTE- ID3 Tag Editor

    A Smarter Way to Tag FLAC, Vorbis, Lyrics, WMA, WavPack, and ID3 Tags

    IDTE is a full-featured tag editor for Windows that supports tagging of FLAC, APE, ID3v1.x/2.x, WMA, Lyrics, and Vorbis metadata in audio files. It also includes a powerful audio player with support for 40+ lossy and lossless formats, including FLAC, ALAC, OGG, APE, MP3, and more. IDTE offers a wide range of advanced features: • Rename files using tag information • Export and manage tag data • Create and manage playlists • Search online for missing or incomplete tags • Fetch lyrics and album artwork • Full CUE sheet support • Batch processing and multi-file editing • SFX, visualizations, and more A new, minimalistic version of IDTE is also available for download: https://sourceforge.net/projects/idte ⚠️ Important Notice Tagging MP4 video files and WAV audio files using IDTE is not recommended. ...
    Downloads: 39 This Week
    Last Update:
    See Project
  • 16
    MatlabFunc

    MatlabFunc

    Matlab codes for feature learning

    ...The project is intended to provide reusable and adaptable MATLAB code that can save time for researchers and students working on experimental or applied projects. By consolidating these tools in one place, MatlabFunc serves as a practical reference and toolkit for both academic and engineering purposes. Contributions and improvements from the community are encouraged, allowing the repository to grow into a richer resource over time.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Clu-Linux-Live

    Clu-Linux-Live

    Various Processing and Data Rescue Tools over Wired or Wireless Networ

    This Linux Live CD provides Various Processing Command Line Utilities (Clu) and Data Rescue Tools which can be used on a Wired or Wireless Network. On Startup it prompts the user to change password, mount all filesystems available locally, start wireless network ( if wifi interface present ), start network services (samba/ssh/sftp) and present user with a console for executing various utilities i.e Text, Image, Audio, Video, Downloading etc. on their FileSystems that are mounted. Googling on...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    FFmpegYAG

    FFmpegYAG

    FFmpeg GUI for Linux & Windows

    FFmpegYAG is an advanced GUI for the popular FFmpeg audio/video encoding tool. To use all features (split/concat, x264 10 bit, HE-AAC) it is recommend to use the FFmpeg Hi (http://sourceforge.net/projects/ffmpeg-hi/) build. Main features: * batch encoding for multiple tasks * interactive video preview, real-time video/audio playback * multiple streams processing for video/audio/subtitles * trim file to segments (with optional fade in/out filters) and concatenate them
    Leader badge
    Downloads: 76 This Week
    Last Update:
    See Project
  • 19
    GTK+ IOStream

    GTK+ IOStream

    GTK+iostream, Data plots, ORB, Neural Networks, WSOLA

    ...You can also plot like so : Plot figure; figure.plot(x, y, x.size()); Plotting utilises GtkDataBox.sf.net ORB utilises www.zeroc.com Feed forward neural networks with different activation functions. Audio playback/recording and digital signal processing (DSP) - utilising Jack www.jackaudio.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ...Our method can handle the error analysis of a wider-range of adders with negligible computational overhead. To ensure its rapid adoption in industry and academia, we have open-sourced our LabVIEW and MATLAB libraries. Lab Web Page: http://save.seecs.nust.edu.pk/projects/SEALPAA/ Emails: 14mseemayub@seecs.edu.pk, osman.hasan@seecs.edu.pk, muhammad.shafique@tuwien.ac.at
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MOVE - Mixxx Ogre3D Visual Effects

    MOVE - Mixxx Ogre3D Visual Effects

    Tries to "give" to the music some 3D visual effects

    ✿ More projects here: https://sourceforge.net/u/jorgerosa/profile ✿ Join our group: https://www.facebook.com/groups/programadoresdeportugal ✿ At GitHub: https://github.com/jorgerosa/Mixxx-MOVE MOVE displays some synchronized 3D visuals, while you play your audio. (Gets the audio from an input device from your system, such as a microphone, etc...). Great to send to a 2nd monitor or TV set, etc... The ".exe" (executable files) are inside the "bin"...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Xamp

    This is Xamp, a secure and platform independend media player.

    This is Xamp, the next generation of a skinnable and platform independent media player which based on the Doppio media framework: https://sourceforge.net/projects/doppio. The player supports various pluggable video and audio formats. Xamp is primarily intend to run as a native executable but can also run as a Java player version working with Java 1.1 VM or above. To ensure the best user experience, the Java version of Xamp doesn't use the Java Media Framework, the high level part of the Java Sound API, the high level part of the Java Swing API, or other high level API's: http://maikmerten.livejournal.com/2015/05/13 Development goals are: -Superior sound quality -Combines audio player with media player -Processing of insecure media files -Performance -Reliability -Open codec support -Click to ready installation Supports currently: Ogg Sceleton Ogg Comment ID3 Ogg Vorbis WAV, SND, AU, AIFF Ogg Theora MP1, MP2, MP3 Mpeg1, Mpeg2 M3U, PLZ WSZ
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    GamePlay

    GamePlay

    Open-source, cross-platform, C++ game engine for creating 2D/3D games

    GamePlay (also known as Gameplay3D) is an open-source, cross-platform C++ game engine and framework that supports both 2D and 3D game development on desktop and mobile platforms. Designed to be lightweight yet powerful, it provides developers with robust tools for graphics rendering, scene management, audio, physics, input, and more, as well as templates and tools to help jumpstart projects. The framework emphasizes ease of use and portability, offering native support for Windows, macOS, Linux, iOS, and Android so developers can write their game once and deploy it across many platforms. With an Apache 2.0 license, developers have broad flexibility to use, modify, and distribute their projects commercially or personally. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Vibrato Toolbox

    Vibrato Toolbox

    Vibrato Toolbox: Parametric Analysis of Musical Vibrato

    Vibrato Toolbox: Parametric Analysis of Musical Vibrato from High-Precision Magnitude/Frequency Tracking Mingfeng Zhang ECE, Univ. of Rochester Part I Analysis of individual music tones This MATLAB toolbox provides signal analysis tools for time-frequency analysis and for parametric analysis of musical vibrato Part II Analysis of multiple music tones or multiple harmonic partials This toolbox allows you to explore a group of audio files, and obtains its related parameters...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB