Showing 67 open source projects for "music voice"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Mumble

    Mumble

    Mumble is an open-source, low-latency, high quality voice chat

    Mumble is an open-source, low-latency, high-quality voice chat software. There are two modules in Mumble; the client (mumble) and the server (murmur). The client works on Windows, Linux, FreeBSD, OpenBSD, and macOS, while the server should work on anything Qt can be installed on. Low-latency and high-quality voice-chat program written on top of Qt and Opus. Administrators appreciate Mumble for being able to self-host and have control over data security and privacy.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 2
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    ...It includes features such as flexible multi-run chat, audio understanding/reasoning, music appreciation, and also tool usage (e.g. voice editing).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Kaset

    Kaset

    The missing YouTube Music macOS app

    Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    FusionPBX

    FusionPBX

    A domain based multi-tenant PBX & voice switch for FreeSWITCH

    ...It also offers unlimited extensions, device provisioning, music on hold, call parking, automatic call distribution and many other superb features. Give your company the business functionality and corporate level phone system it needs with FusionPBX!
    Downloads: 5 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    MiniMax-MCP

    MiniMax-MCP

    Official MiniMax Model Context Protocol (MCP) server

    MiniMax-MCP is the official Model Context Protocol (MCP) server for accessing MiniMax’s multimodal generative APIs from MCP-compatible clients. It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license, with a pyproject.toml and uv-based workflow that makes installation and execution reproducible. Configuration is handled through JSON files that tell MCP clients how to launch the server (typically via uvx minimax-mcp) and which environment variables to use for the API key, host, and output directory. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    YuE

    YuE

    Open source AI model for generating full songs from lyrics prompts

    YuE is an open source project that provides a foundation model designed for full-song music generation using artificial intelligence. It focuses on transforming text inputs such as lyrics and genre prompts into complete musical compositions that include both vocal and instrumental tracks. Unlike many shorter audio generators, the model is capable of producing songs that last several minutes while maintaining coherent musical structure and alignment with the provided lyrics. YuE introduces a...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    YandexStation

    YandexStation

    Management of Yandex Station and other smart home devices

    ...It supports both local and cloud control, depending on the device type, with Yandex speakers often supporting both modes and third-party speakers typically limited to cloud control. The integration exposes playback and volume controls, as well as text-to-speech capabilities that send spoken messages in Alice’s voice directly to the speakers. It also lets you send arbitrary text commands as if you were talking to Alice, enabling scenarios such as β€œplay my music,” launching routines, or querying information via Home Assistant automations. In local control mode, the component can read back what is currently playing, including album art, and supports seeking and track skipping, which is more limited in cloud-only mode.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Echo Voice Recorder

    Echo Voice Recorder

    Simple Voice Recorder Application for Debian Based Linux Systems

    This is a good audio recording app that's extremely lightweight and requires minimal dependencies. It records your microphone and, if desired, your system sounds. https://github.com/shampuan/Echo-Voice-Recorder 5 audio formats: wav, mp3, ogg, flac, aac. 7 language options: Azerbaijani, German, English, Spanish, French, Turkish (default), Russian.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    TTS WebUI

    TTS WebUI

    A single Gradio + React WebUI with extensions for ACE-Step

    TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all necessary dependencies, so users can focus on experimenting with voices instead of managing tooling. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 10
    Whisper-WebUI

    Whisper-WebUI

    A Web UI for easy subtitle using whisper model

    ...It supports multiple input sources including local files, YouTube content, and microphone input, making it versatile for different workflows. Whisper WebUI also includes advanced preprocessing and postprocessing features such as voice activity detection, background music separation, and speaker diarization, enabling more accurate and structured outputs.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Audio Priority Bar

    Audio Priority Bar

    A native macOS menu bar app for managing audio device priorities

    ...Once installed, it places an always-accessible control in the menu bar that lets you assign priority levels to individual audio sources so that more important sounds (like alerts, calls, or music) can override or duck less important ones (like background noise or game audio). This becomes especially useful in multitasking situations β€” for example, keeping voice calls audible while muting or lowering other playback automatically when needed. The tool maintains simple but powerful state management so priorities persist across app launches and device changes, plus it supports per-device profiles so your rules can differ between headphones, speakers, and external monitors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Amphion

    Amphion

    Toolkit for audio, music, and speech generation

    Amphion is a toolkit from OpenMMLab dedicated to audio, music, and speech generation, aimed at both reproducible research and helping newcomers get started in generative audio. It provides standardized implementations and recipes for classic and state-of-the-art generative models in audio, including TTS, music generation, and voice conversion. A distinctive feature of Amphion is its emphasis on visualization: it offers interactive visualizations of model architectures and generation processes, making it easier to understand how complex generative audio models work. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Personal A.I Assistant
    An Open Source Personal A.I Assistant Based on the Google Gemini API that is Fully Customizable for Your Needs, Ask Questions, Request Real Time Data and Information, Play Music. Launch Programs and Open Websites on Your PC with Voice Commands. ***Requires Your Google Gemini API Key to Work***
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    Free Karaoke File Maker

    Free Karaoke File Maker

    Free Karaoke File Maker

    You can hide the singer's voice from the music files that cannot hide the voice in the computer. By default, it will be saved with 2 audio tracks of singer + melody. If you want to save only the melody without the singer's voice, you have to select the No Vocal option. To save the output file, click Save Folder and choose the location you want to save (Default: Desktop).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    byzorgan

    byzorgan

    Specialized sound synthesizer with Byzantine Church music scales

    This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale:...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 16
    Mechvibes

    Mechvibes

    Play mechanical keyboard sounds, typewriter sound as you type

    It's a side project that I made for myself since when I use my own mechanical keyboard at late night or in the office and my parents and my coworkers hate that a lot because the sound of the keyboard is very loud (especially in quiet places). So if you are facing this issue just like me then this product is for you! - Plays typewriter sound - You can add more keyboard sound set by recording any sound you like and add them to this app with some easy steps. - Use your laptop keyboard...
    Leader badge
    Downloads: 1,631 This Week
    Last Update:
    See Project
  • 17
    Audio Satanifier 666

    Audio Satanifier 666

    Easily apply cool gnarly voice filters to your audio files

    Transform pure innocent audio files, speech, music, etc into unholy demonic abominations. Audio Satanifier 666 is a fun easy-to-use browser-based tool forged in the pits of hell, for voice actors, musicians, sound designers, for memes, for creative projects or anyone else who want to twist their sound into something absolutely diabolical! Layperson friendly - you'll be able to apply cool effects to your audio file even if you know nothing about audio engineering. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Kytner

    Kytner

    a simple PC Tool Suite to easy Access PC Tasks

    ...Effortlessly personalize your wallpaper with Kytner Settings ✨. Secure your digital life with KytnerSec πŸ”’. Tackle daily math with Kytner Calc βž•βž–βœ–οΈβž—. Enjoy your media with Kytner Film ▢️, Kytner Picture πŸ–ΌοΈ, and Kytner Music 🎢. Capture your screen with KytnerScreen πŸ“Έ. Browse simply with Kytner Web 🌐. Snap photos with Kytner Cam 😊. Manage archives easily with Kytner Pack πŸ“¦. And Save Backup and Copy your Files with Kytner SaveIT!😊 Simply record your Voice with Kytner Voicerecorder 😊 Intuitive desktop UX for Windows 10*/11 <3. English only. Discover the simplicity. πŸš€ !!!...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Xiaomi.eu Multilang MIUI ROMs

    Xiaomi.eu Multilang MIUI ROMs

    OFFICIAL mirror of MIUI Multilang ROM by https://xiaomi.eu/

    Included languages: English (en.miui.com, xiaomi.eu) Chinese (miui.com) Polish (Acid, miuipolska.pl) Slovak (ingbrzy, https://miuios.cz) Czech (VMach, https://miuios.cz) Bulgarian (Global) Hungarian (vagyula, miui.hu) French (Global) Swedish (Global) Russian (Global) Ukrainian (Global) Belorussian (Global) Croatian (Masva, MIUI Adria) Vietnamese (Belmont-Gabriel) Italian (miui.it) Greek (Global) Norwegian (Global) Dutch (cueie, mannyribo, redmaner) Spanish (nosijf982)...
    Leader badge
    Downloads: 19,955 This Week
    Last Update:
    See Project
  • 20
    TURGEN

    TURGEN

    Create your own Atari tapes

    With TURGEN, you can record your own tapes with software for Atari 8-bit computers, and transfer data from your PC or Mac to your Atari 8-bit computer using your data recorder and a cassette adapter. TURGEN converts digital images of Atari software to the original standard Atari tape records and also various "turbo" systems. You can create WAVE files, tape images, or send the signal directly to the sound card. Tools such as Wizard for files, Tape Side Creator, Dual Track Tape creator...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 21
    cerberuscms2

    cerberuscms2

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text, Voice and Video Communications Platform and Content...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Cerberus Content Management System 6

    Cerberus Content Management System 6

    Cerberus Content Management System

    Cerberus Content Management System is a dynamic, secure and infinitely expandable CMS designed after a Unix-Like model complete with a Unix-Like Kernel File named: Cerberus. It is a custom written Web Application Framework ( W.A.F. ) with a consistent and custom written Pre-Hyper-Text-Post-Processor Programming Code Framework ( P.C.F. ). This Web Application Software Project' aim is to be the fastest and most secure Web Application Framework, Web Application Programming Code Framework, Text,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Audio Webui

    Audio Webui

    A webui for different audio related Neural Networks

    Audio Webui is a Gradio-based web user interface that unifies a wide range of audio-related neural networks under a single, accessible front end. It is designed as an β€œall-in-one” environment where users can experiment with text-to-speech, voice cloning, generative music, and other neural audio models without writing boilerplate code. The project supports multiple back-end models and toolchains (such as Bark, RVC, AudioLDM, Audiocraft, and other text-to-audio or voice-cloning tools), exposing them through a consistent UI for inference and experimentation. Installation is streamlined through automatic installers and platform-specific scripts that create a virtual environment, install dependencies, and launch the web app with minimal manual setup. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DiffSinger

    DiffSinger

    Singing Voice Synthesis via Shallow Diffusion Mechanism

    DiffSinger is an open-source PyTorch implementation of a diffusion-based acoustic model for singing-voice synthesis (SVS) and also text-to-speech (TTS) in a related variant. The core idea is to view generation of a sung voice (mel-spectrogram) as a diffusion process: starting from noise, the model iteratively β€œdenoises” while being conditioned on a music score (lyrics, pitch, musical timing). This avoids some of the typical problems of prior SVS models β€” like over-smoothing or unstable GAN training β€” and produces more realistic, expressive, and natural-sounding singing. ...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 25
    Jarvis Python AI Assistant

    Jarvis Python AI Assistant

    Python AI assistant

    Jarvis is a voice commanding assistant service in Python 3.8 It can recognize human speech, talk to user and execute basic commands. Opens a web page (e.g 'Jarvis open youtube') Play music in Youtube (e.g 'Jarvis play mozart') Increase/decrease the speakers master volume (also can set max/mute speakers volume) (e.g 'Jarvis volume up!') Opens libreoffice suite applications (calc, writer, impress) (e.g 'Jarvis open calc') Tells about something, by searching on the internet (e.g 'Jarvis tells me about oranges') Tells the weather for a place (e.g 'Jarvis tell_the_skills me the weather in London') Tells the current time and/or date (e.g 'Jarvis tell me time or date') Set an alarm (e.g 'Jarvis create a new alarm') Tells the internet speed (ping, uplink and downling) (e.g 'Jarvis tell_the_skills me the internet speed') Tells the internet availability (e.g 'Jarvis is the internet connection ok?') ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB