Showing 142 open source projects for "microphone"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    HaishinKit

    HaishinKit

    Camera and Microphone streaming library via RTMP and SRT for iOS, Mac

    Camera and Microphone streaming library via RTMP and SRT for iOS, macOS, tvOS and visionOS.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    ...The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 3
    Hyprnote

    Hyprnote

    Local-first AI Notepad for Private Meetings

    Hyprnote is an open-source, privacy-first AI notepad app designed for taking notes during meetings—transcribing audio (microphone and system) and generating context-rich summaries using on-device AI models like Whisper and HyprLLM, all without any data leaving your machine.(turn0search7, turn0search1). Listens to your meetings while you write. Crafts smart summaries based on your quick notes. Runs completely offline using open-source models like Whisper or HyprLLM.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    Vision Camera

    Vision Camera

    The Camera library that sees the vision

    VisionCamera was designed from the ground up to provide all features a camera app should have. You have full control over what device is used, and can even configure options such as frame rate, colorspace, and more. While having a lot of features, VisionCamera makes sure you don't get overwhelmed from the beginning. It provides hooks and functions to help you get started faster, and if you need full control, you can easily do that. Every functionality has been thoroughly documented and even...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    WhisperLive

    WhisperLive

    A nearly-live implementation of OpenAI's Whisper

    ...The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and network streams such as RTSP and HLS, making it flexible for live events, monitoring, or accessibility workflows. Configuration options let you control the number of clients, maximum connection time, and threading behavior so the server can be tuned for different deployment environments. On the client side, you can set the language, whether to translate into English, model size, voice activity detection, and output recording behavior.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    ScreenPipe

    ScreenPipe

    AI app store powered by 24/7 desktop history. open source

    Screenpipe is an AI app store powered by continuous desktop history recording. It operates entirely locally, offering developers a platform to build, distribute, and monetize AI applications that leverage comprehensive contextual data from users' desktop activities. ​
    Downloads: 36 This Week
    Last Update:
    See Project
  • 7
    Kooha

    Kooha

    Elegantly record your screen

    Capture your screen in an intuitive and straightforward way without distractions. Kooha is a simple screen recorder with a minimal interface. You can simply click the record button without having to configure a bunch of settings.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    WebCord

    WebCord

    A Discord and SpaceBar :electron:-based client

    ...WebCord does a lot to improve the privacy of the users. It blocks known tracing and fingerprinting methods, but it does not end on it. It also manages the permissions to sensitive APIs like camera or microphone, sets its own user agent to the one present in Chromium browsers and spoof web API modifications in order to prevent distinguishing it from the real Chrome/Chromium browsers.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 9
    CAVA

    CAVA

    Cross-platform Audio Visualizer

    ...Choose from several preset settings of incredible colors or create your own. CAVA is a bar spectrum audio viewer based on my own open source project with the same name. Take the audio from the device's microphone and visualize the amplitude of the different frequencies as bars on the screen. Each bar represents a certain bandwidth of low to high frequencies. The leftmost bar starts at 50 Hz and the rightmost bar ends at 10 kHz. Although the frequencies outside this spectrum are audible, they do not contribute much to the overall sound image. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    RealtimeSTT

    RealtimeSTT

    A robust, efficient, low-latency speech-to-text library

    RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Mumble

    Mumble

    Mumble is an open-source, low-latency, high quality voice chat

    Mumble is an open-source, low-latency, high-quality voice chat software. There are two modules in Mumble; the client (mumble) and the server (murmur). The client works on Windows, Linux, FreeBSD, OpenBSD, and macOS, while the server should work on anything Qt can be installed on. Low-latency and high-quality voice-chat program written on top of Qt and Opus. Administrators appreciate Mumble for being able to self-host and have control over data security and privacy. Some make use of the...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 12
    Snapcast

    Snapcast

    Synchronous multiroom audio player

    Snapcast is a multiroom client-server audio player, where all clients are time synchronized with the server to play perfectly synced audio. It's not a standalone player, but an extension that turns your existing audio player into a Sonos-like multiroom solution. Audio is captured by the server and routed to the connected clients. Several players can feed audio to the server in parallel and clients can be grouped to play the same audio stream. One of the most generic ways to use Snapcast is...
    Downloads: 44 This Week
    Last Update:
    See Project
  • 13
    WO Mic

    WO Mic

    Transform your smartphone into a PC microphone

    WO Mic is a free utility that turns your smartphone into a functional microphone for your Windows PC. It eliminates the need to buy a separate microphone, offering a convenient and cost-effective solution for voice chat, recording, or wireless voice control. The app supports multiple connection types including Wi-Fi, Bluetooth, and USB, giving users flexible options to suit their setup. Setup involves installing the mobile app and the PC client with drivers, which is straightforward and fast. ...
    Downloads: 339 This Week
    Last Update:
    See Project
  • 14
    ESP32-CAM_MJPEG2SD

    ESP32-CAM_MJPEG2SD

    ESP32 Camera motion capture application to record JPEGs to SD card

    Application for ESP32 / ESP32S3 with OV2640 / OV5640 camera to record JPEGs to SD card as AVI files and playback to the browser as an MJPEG stream. The AVI format allows recordings to replay at the correct frame rate on media players. If a microphone is installed then a WAV file is also created and stored in the AVI file. The ESP32 cannot support all of the features as it will run out of heap space. For better functionality and performance, use one of the new ESP32S3 camera boards, eg Freenove ESP32S3 Cam, and ESP32S3 XIAO Sense, but avoid no-name boards marked ESPS3 RE:1.0.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15
    Background Music

    Background Music

    Automatically pause your music, set individual apps' volumes, etc.

    ...With Background Music running, launch QuickTime Player and select File > New Audio Recording (or New Screen Recording, New Movie Recording). Then click the dropdown menu next to the record button and select Background Music as the input device. You can record system audio and a microphone together by creating an aggregate device that combines your input device (usually Built-in Input) with the Background Music device.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 16
    Whisper-WebUI

    Whisper-WebUI

    A Web UI for easy subtitle using whisper model

    ...The platform integrates optimized implementations such as faster-whisper, significantly improving transcription speed and reducing memory usage compared to standard models. It supports multiple input sources including local files, YouTube content, and microphone input, making it versatile for different workflows. Whisper WebUI also includes advanced preprocessing and postprocessing features such as voice activity detection, background music separation, and speaker diarization, enabling more accurate and structured outputs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Note67

    Note67

    A private, local meeting notes assistant

    ...Built with a cross-platform architecture using Rust (via Tauri) for backend logic and a TypeScript/React frontend, it prioritizes privacy by performing audio transcription locally with Whisper models and generating summaries with locally-hosted AI, eliminating the need to send sensitive meeting content to external servers. Users can record meetings directly from their microphone, view live transcriptions, filter by speaker, and export structured summaries, making it useful for professionals who need searchable, organized records of discussions. It also features thoughtful signal processing such as voice activity detection and echo deduplication to improve transcription accuracy, and provides standard note-taking features.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    ...Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show controls only on hover, and many other customization options. Export as mp4, gif, and webm, or save the video directly to Google Drive. Make sure the language you want to translate Screenity into is supported by the Chrome Store. Before submitting the pull request, it would be helpful if you tried running the extension in the new language, to make sure everything looks right.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    clone-voice

    clone-voice

    A sound cloning tool with a web interface, using your voice

    ...The tool supports around sixteen languages, including Chinese, English, Japanese, Korean, French, German, Italian, and others, and can capture reference voices directly from a microphone or from uploaded audio.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other form of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Ring

    Ring

    Unofficial packages for Ring Doorbells, Cameras, Alarm System

    ...The ring-client-API is a TypeScript package designed to be used by developers to create their own apps/programs which interact with Rings API. homebridge-ring allows users to integrate Ring products into Apple HomeKit via homebridge easily. With Ring you can control your home from your smartphone, tablet or PC. Each Ring device includes a camera, speakers, and an integrated microphone so you can view, listen, and speak to anyone on your property from anywhere. Ring's customizable motion sensors allow you to focus on the most important areas of your home. You will receive instant warnings as soon as your Ring device detects movement, so you are always the first to know if someone has gotten too close to your property. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Amazon Chime SDK React Components

    Amazon Chime SDK React Components

    Chime React Component Library with integrations with the Amazon SDK

    ...The Amazon Chime SDK React Component Library supplies client-side state management and reusable UI components for common web interfaces used in audio and video conferencing applications, including: video tile grids, microphone activity indicators, and call controls. All components come with a simple, modern design, and can be used as-is or restyled with a custom theme. In addition to UI components, the library leverages Reacts' state management tools such as Providers and Hooks to connect to the Amazon Chime SDK for JavaScript and pass data to the UI layer, simplifying state synchronization so that developers can concentrate on building engaging experiences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    Live API Web Console is a React starter that demonstrates how to use Gemini’s Live API over WebSockets to build real-time, multimodal experiences. The app includes modules for streaming audio playback, recording user media from the microphone, webcam, or even screen capture, and it surfaces a unified event log so you can debug the session as it flows. Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    eduMEET WebRTC meeting service

    eduMEET WebRTC meeting service

    Multiparty web-meetings using mediasoup and WebRTC

    edumeet, multiparty web meetings using mediasoup and WebRTC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    AcousticSoundboard

    AcousticSoundboard

    Free, easy to use, lightweight soundboard for Windows

    Acoustic Soundboard is a free, open source program for Windows which strives to be easy to use and lightweight. The main purpose is to conveniently play audio files to others while allowing the user to hear what is playing and still use their microphone normally. There is no installation required, simply unzip the contents wherever you like. On closing, the soundboard will automatically save the user's last devices, sounds, and hotkeys to a database file. View a demo video here https://www.youtube.com/watch?v=cPdncc-q1oM If you want others to hear your microphone and the sounds you play, you will first need to install a virtual audio cable such as this one: https://vb-audio.com/Cable/ IMPORTANT: Windows may warn you that this program contains a virus because it hooks the keyboard. ...
    Downloads: 8 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB