Showing 11 open source projects for "speaker detection"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    sherpa-onnx

    sherpa-onnx

    Speech-to-text, text-to-speech, and speaker recognition

    Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.
    Downloads: 164 This Week
    Last Update:
    See Project
  • 2
    Whisper-WebUI

    Whisper-WebUI

    A Web UI for easy subtitle using whisper model

    ...It supports multiple input sources including local files, YouTube content, and microphone input, making it versatile for different workflows. Whisper WebUI also includes advanced preprocessing and postprocessing features such as voice activity detection, background music separation, and speaker diarization, enabling more accurate and structured outputs.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Note67

    Note67

    A private, local meeting notes assistant

    ...Built with a cross-platform architecture using Rust (via Tauri) for backend logic and a TypeScript/React frontend, it prioritizes privacy by performing audio transcription locally with Whisper models and generating summaries with locally-hosted AI, eliminating the need to send sensitive meeting content to external servers. Users can record meetings directly from their microphone, view live transcriptions, filter by speaker, and export structured summaries, making it useful for professionals who need searchable, organized records of discussions. It also features thoughtful signal processing such as voice activity detection and echo deduplication to improve transcription accuracy, and provides standard note-taking features.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Glint Translator
    ...It supports 240+ languages using DeepL, Google, OpenAI, Azure, and Google Gemini models. The interface is available in 18 languages. Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Low CPU/RAM usage, optimized for Windows 10/11 Live Subtitle (Real-Time Voice Translation) Real-time speech-to-text translation for games, movies, and voice chats. Automatically detects audio, converts speech to text, and translates it instantly. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    wukong-robot

    wukong-robot

    Chinese voice dialogue robot/smart speaker project

    wukong-robot is a Chinese voice assistant / smart speaker project built to let makers and hackers design highly customizable voice-controlled devices. It combines wake-word detection, automatic speech recognition, natural language understanding, and text-to-speech into a single framework aimed at the Chinese-speaking ecosystem. The project is positioned as a simple, flexible, and elegant platform that can run on devices like Raspberry Pi and other Linux-based boards, making it suitable for DIY smart speakers and home-automation hubs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    footswitch2

    footswitch2

    Audio Transcription software for Linux (Vlc) with a foot pedal

    Footswitch 2 is a media player for transcribers on Linux. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a USB footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a footpedal/footswitch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    footswitch3

    footswitch3

    Audio Transcription software for Linux (Gstreamer) with a foot pedal

    Footswitch 3 is a media player for transcribers on Linux. Written in python using the python bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch. Control of the media player from LibreOffice can be via Hotkeys or an integrated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
     rims-arduino-library

    rims-arduino-library

    Recirculation infusion mash system library for Arduino

    This library implement RIMS controls for home brewers. For definition of a RIMS, see https://tinyurl.com/j3lyuyc For me, an Arduino micro controller + a LCD Keypad shield was cheaper and a lot more customizable than a commercial PID controller. So, with this library, a commercial PID controller is unnecessary. Automatic PID tuning toolkit is also included. Temperature can be read with a thermistor, a resistance temperature detector (RTD) or any custom temperature probe. Heater is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Automatic Volume Mixer

    Automatic Volume Mixer

    A tool for automatization of Windows Volume Mixer.

    Automatic Volume Mixer is a tool that allows automatization of Windows Volume Mixer based on user's rules. You can open the Volume Mixer by right-clicking on the speaker icon in the system tray and selecting Open Volume Mixer. This application is an automatic version of that applet. Common usage examples - Pausing your audio player (e.g. foobar2000) whenever any other application makes a noise, - and resuming playback once the noise is gone. This enables you to keep your audio player...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10

    Bero iOS Open Source Control App

    ios app development for bero

    This open source project is about controlling the Bero (Be The Robot) device using ios device. Provided that Bero is a 5 motors humanoid robot which also installed with SD card, speaker, Infra red detection, Bero has a lot of potential to be explored by all you developers. Now we are making the app open source so that developers can utilize and customize their own Bero app to make it more impressive!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DefendLineII
    ATMEL ATMega1280 based powerful, multifunctional, reliable, expandable and extremely flexible hardware platform for home and industrial processes automation, robotic toys, security systems, education and enjoyment.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB