Showing 202 open source projects for "pitch"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Basic Pitch

    Basic Pitch

    A lightweight audio-to-MIDI converter with pitch bend detection

    ...Provide a compatible audio file and a basic-pitch will generate a MIDI file, complete with pitch bends. The basic pitch is instrument-agnostic and supports polyphonic instruments, so you can freely enjoy transcription of all your favorite music, no matter what instrument is used. Basic pitch works best on one instrument at a time.
    Downloads: 32 This Week
    Last Update:
    See Project
  • 2
    Qwen3-TTS

    Qwen3-TTS

    Qwen3-TTS is an open-source series of TTS models

    ...Because it’s part of the broader Qwen ecosystem, it benefits from the model’s understanding of linguistic nuances, enabling more accurate pronunciation, prosody, and contextual delivery than many traditional TTS systems. Developers can customize voice output parameters like speed, pitch, and volume, and combine the TTS stack with other AI components.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 3
    IndexTTS2

    IndexTTS2

    Industrial-level controllable zero-shot text-to-speech system

    ...Compared to many open-source TTS tools, IndexTTS emphasizes efficiency and controllability: it offers faster inference, simpler training pipelines, and controllable speech parameters (like duration, pitch, and prosody), which is critical for production use.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    edge-tts

    edge-tts

    Use Microsoft Edge's online text-to-speech service from Python

    ...It also supports generating subtitle files (such as SRT or VTT) alongside the speech, which is handy for video narration, e-learning, or accessibility workflows. From the CLI you can adjust parameters such as speaking rate, volume, and pitch, giving you some control over prosody without diving into SSML. The library is asynchronous under the hood, which makes it efficient for batch jobs or web services that need to synthesize many utterances concurrently.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 5
    Nerd Fonts

    Nerd Fonts

    Iconic font aggregator, collection, & patcher. 3,600+ icons

    ...There is also a font patcher available if your desired font isn't already patched. For more high-level information see the wiki. Includes an option to create Monospaced (fixed-pitch, fixed-width) or double-width (non-monospaced) glyphs.
    Downloads: 110 This Week
    Last Update:
    See Project
  • 6
    Spark TTS

    Spark TTS

    Spark-TTS Inference Code

    ...The project supports zero-shot voice cloning, meaning it can imitate a new speaker’s voice without dedicated training for that specific voice, and works across languages, including English and Chinese, even in cross-lingual code-switching scenarios. Spark-TTS allows users to control speech characteristics like gender, pitch, and speaking rate to customize synthesized output and support virtual speaker creation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    EasyVoice

    EasyVoice

    Open source text-to-speech tool, supports extra-long text

    ...The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure parameters such as rate, pitch, and volume per role. It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. Under the hood, easyVoice uses a modern stack with Vue 3 and Element Plus on the front end, Node.js and Express on the back end, and TTS engines such as Microsoft Azure TTS and OpenAI-compatible APIs, orchestrated through ffmpeg.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Harmonoid

    Harmonoid

    Plays & manages your music library. Looks beautiful & juicy

    Plays & manages your music library. Looks beautiful & juicy. Playlists, visuals, synced lyrics, pitch shift, volume boost & more.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    NeuralNote

    NeuralNote

    Audio Plugin for Audio to MIDI transcription using deep learning

    ...NeuralNote supports polyphonic transcription, meaning it can detect multiple notes played simultaneously, making it useful for instruments such as piano or guitar. The system relies on neural network models to analyze audio signals and infer pitch, timing, and other musical attributes that can be represented as MIDI data. The resulting MIDI output can be edited, quantized, or exported to other instruments within a music production workflow.
    Downloads: 49 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    StyleTTS 2

    StyleTTS 2

    Towards Human-Level Text-to-Speech through Style Diffusion

    ...StyleTTS2 supports both single-speaker and multi-speaker configurations, with the ability to sample or transfer styles from reference audio, making it powerful for expressive TTS and character voices. The repository includes training scripts, configuration files, and pre-trained auxiliary modules such as a text aligner, pitch extractor, and PL-BERT-based linguistic encoder.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    MuseGAN

    MuseGAN

    An AI for Music Generation

    ...The system focuses specifically on generating multi-track polyphonic music, meaning that it can simultaneously produce multiple instrument parts such as drums, bass, piano, guitar, and strings. Instead of generating raw audio, the model operates on piano-roll representations of music, which encode notes as time-pitch matrices for each instrument track. This representation allows the neural network to capture rhythmic patterns, harmonic relationships, and structural dependencies across instruments. The architecture is based on convolutional GAN models that learn temporal musical structure and inter-track relationships from training data. The project was trained using the Lakh Pianoroll Dataset, a large collection of multitrack musical sequences derived from MIDI files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PitchBlack Recovery Project

    PitchBlack Recovery Project

    An Open Source Custom Recovery For Android

    Pitch Black Recovery is a fork of TWRP with many improvements to make your experience better. It's more flexible & easy to use. Pitch Black Recovery was started in March 18 2018 with the movement to enhance TWRP with better customizatons, themes and features.
    Leader badge
    Downloads: 1,139 This Week
    Last Update:
    See Project
  • 15
    DragonianVoice

    DragonianVoice

    C++ inference library for multiple SVC/TTS

    ...The project supports a wide range of model families: TTS models such as Tacotron2, VITS, EmotionalVITS, BERTVits2, GPT-SoVITS, SVC systems like SoVitsSvc (v2/v3/v4), RVC, DiffSvc, DiffusionSvc, FishDiffusion, ReflowSvc, and even singing systems like DiffSinger and related pitch/feature extractors like FCPE and RMVPE. It uses ONNX Runtime and other backends to accelerate inference, with notes on how different execution providers such as CUDA or DirectML affect operator support and numerical stability. Recent versions integrate with fish-speech via a dedicated fish-speech.cpp subproject using ggml.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    SpectrumNotes

    Live analysis of pitches, harmonics, chords, and keys.

    Windows 10+ 64-bit desktop application for analyzing live audio (mic or output), and displaying as pitches, color coded for pitch, analyzing chords, keys, and harmonics, with a built-in instrument tuner.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    432Hz Batch Converter

    432Hz Batch Converter

    Converts and re-encodes music to 432Hz

    ...Play a music in 440Hz and in 432Hz, and see which one you prefer. Most people choose the 432Hz version, and it's hard to go back to 440Hz. This application re-encodes your audio files while shifting the pitch to 432Hz. It uses a very high-quality pitch-shifting algorithm. Supports Windows (Win 7 SP1 or later) and Linux (all distros). Installation instructions https://github.com/mysteryx93/HanumanInstituteApps/wiki/432hz-Batch-Converter From Etienne Charland aka Hanuman, by a lightworker in his free time. https://www.spiritualselftransformation.com/
    Leader badge
    Downloads: 60 This Week
    Last Update:
    See Project
  • 18
    Five-Limit InterFace

    Five-Limit InterFace

    Five-limit pitch fractal, navigated via motion tracking

    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    432Hz Player

    432Hz Player

    Plays music in 432Hz

    ...Play a music in 440Hz and in 432Hz, and see which one you prefer. Most people choose the 432Hz version, and it's hard to go back to 440Hz. This application uses a very high-quality pitch-shifting algorithm with low CPU usage. Supports Windows (Win 7 SP1 or later) and Linux (all distros). Installation instructions https://github.com/mysteryx93/HanumanInstituteApps/wiki/432hz-Player From Etienne Charland aka Hanuman, by a lightworker in his free time. https://www.spiritualselftransformation.com/
    Leader badge
    Downloads: 58 This Week
    Last Update:
    See Project
  • 20
    byzorgan

    byzorgan

    Specialized sound synthesizer with Byzantine Church music scales

    This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale: 9/8 - 12/11 - 88/81. The octave is divided into 17 (or 19) non-equal parts, it allows you to get all the Byzantine modes.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 21
    Midi router

    Midi router

    Midi router

    ...Midi Router Client is an advanced, cross-platform tool designed to route and manipulate MIDI signals in real time. It allows users to create virtual MIDI ports, map MIDI events (e.g., CC to pitch wheel), monitor inputs, and build custom controls like program change dropdowns—all with minimal latency Documentation: https://shemeshg.github.io/MidiRouterClient-mdbook Github and support: https://github.com/shemeshg/MidiRouterClient step by step install: https://shemeshg.github.io/MidiRouterClient-mdbook/guide/installation.html
    Downloads: 142 This Week
    Last Update:
    See Project
  • 22
    Subs Media Player
    Media Player for Windows, for encoding and decoding subtitles, based on vlc dot.net and Cscore
    Downloads: 82 This Week
    Last Update:
    See Project
  • 23
    Nyquist

    Nyquist

    Nyquist is a language for sound synthesis and music composition.

    Nyquist is a language for sound synthesis and music composition. It is implemented in C and C++ and runs on Win32, OSX, and Linux. Nyquist combines a powerful functional programming style with efficient signal-processing primitives. Nyquist is also embedded as a scripting language in Audacity.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • 24
    InstrumentalMusic

    InstrumentalMusic

    Application which detects musical notes from the microphone.

    ...It allows listening to the microphone and play the detected notes to output (in midi). Multilanguage support. Zoom Dark mode option JDK-17 compatibility With v1.2 it includes a pitch shifter (making voice lower or sharper through a slider) There is a demo video which shows how it works (the demo video can be visited from Help menu of the application) You can also see the pitch-shifter demo version here: https://frojasg1.com:8443/resource_counter/resourceCounter?operation=countAndForward&url=https%3A%2F%2Ffrojasg1.com%2Fdemos%2Faplicaciones%2FInstrumentalMusic%2Fpitch.shifter.mp4%3Forigin%3Dsourceforge&origin=web
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Orange Juice Audio Enhancer for Linux
    ...sudo apt install easyeffects lsp-plugins calf-plugins Download tar.gz and unarchive, these 3 files put into /home/{users folder}/.config/easyeffects/output note: MX-23 uses "rubberband" for pitch effect, MX-25 uses "soundtouch" for pitch effect. These are completely none Compatibility. So it needs different preset I ready.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB