Showing 20 open source projects for "whisper desktop"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Meetily

    Meetily

    Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper

    This project is a privacy-first AI meeting assistant that captures meeting audio, produces real-time transcripts, and generates summaries while keeping processing entirely on your own machine or infrastructure. It’s built for organizations that want meeting intelligence without sending recordings or transcripts to third-party cloud services, which helps address compliance and data sovereignty requirements. The app supports live transcription with local model options (including Whisper- and...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 2
    Speech Note

    Speech Note

    Speech Note Linux app. Note taking, reading and translating

    Speech Note is a Linux desktop and Sailfish OS application for taking, reading, and translating notes with integrated offline speech technology. It combines speech-to-text, text-to-speech, and machine translation in a single interface, allowing users to dictate notes, listen back to them, and translate them without ever sending data to the cloud. All processing is done locally, which means audio, text, and translations never leave the device, emphasizing strong privacy guarantees. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3

    Whisper-Transcriber-Tool

    Desktop application that converts video files into accurate text

    WhisperTranscriber is a free, powerful desktop application that converts video files into accurate text using OpenAI's Whisper AI model. Perfect for journalists, researchers, students, content creators, and anyone who needs reliable transcription. KEY FEATURES: - High-accuracy AI transcription with 99+ language support - Works completely offline - no internet required, total privacy - Supports all common video formats (mp4) - Batch processing for multiple files - Automatic language detection - Drag & drop interface - Export as SRT formats - No file size limits PORTABLE VERSION: - No installation needed - Run from USB or any folder - FFmpeg and AI models included - Lightweight and fast WHY CHOOSE WHISPERTRANSCRIBER: ✓ 100% free forever - no subscriptions or hidden costs ✓ Complete privacy - all processing happens on your computer ✓ No account or registration required ✓ Professional-grade accuracy ✓ Works offline
    Downloads: 15 This Week
    Last Update:
    See Project
  • 4
    Whisper Batch Transcriber

    Whisper Batch Transcriber

    Unlimited, private and free Speech-To-Text program

    ## About: Automatically transcribe all of your voice recordings into clean, organized, neat text files. It's free, fully automated, unlimited, using state-of-the-art speech-to-text technology. Works 100% offline on your computer, privately and locally. ## Usecases: Convert speeches, podcasts, webinars, monologues, storytellings and other audio speech into a formatted .txt file. One sentence per new line. ## Notes: - Its 2GB in size and requires 2-6GB of GPU VRAM too. (basically...
    Downloads: 20 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    web3.js

    web3.js

    Ethereum JavaScript API

    web3.js is the Ethereum JavaScript API that connects to the Generic JSON-RPC spec. It is composed of a selection of libraries that make it possible to interact with a local or remote ethereum node, using a HTTP or IPC connection. The node may be local, hosted by the DApp provider, or a public gateway such as Infura, which operates free Ethereum access points. It is necessary to run a local or remote Ethereum node to be able to use this library. web3.js is directly usable on web technology...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Transcripciones con Whisper Esta aplicación de escritorio basada en web permite transcribir (o transcribir y traducir al ingles), archivos de audio o video utilizando el modelo Whisper de OpenAI. Transcriptions with Whisper This web-based desktop application allows you to transcribe—or both transcribe and translate into English—audio or video files using OpenAI's Whisper model.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    AutoSubs

    AutoSubs

    Instantly generate AI-powered subtitles on your device

    AutoSubs is an open-source, AI-powered subtitle generation tool that enables users to automatically transcribe audio and video content into accurate, editable subtitles directly on their device. It supports both standalone usage and integration with professional video editing software such as DaVinci Resolve, allowing creators to generate and edit subtitles within their existing workflows. The tool leverages speech-to-text models, including OpenAI Whisper, to produce high-quality...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    stt

    stt

    Voice Recognition to Text Tool

    stt is a standalone speech recognition tool that locally converts spoken content in audio or video files into textual formats without requiring internet access, giving users control over their data and reducing reliance on external APIs. It leverages open-source speech models such as Faster-Whisper to recognize and transcribe human speech into plain text, structured JSON objects, or subtitle files with time codes, making it suitable for both personal and professional transcription tasks. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    RunAnywhere

    RunAnywhere

    Production ready toolkit to run AI locally

    RunAnywhere SDKs are a set of cross-platform development tools that enable applications to run artificial intelligence models directly on user devices instead of relying on cloud infrastructure. The toolkit allows developers to integrate language models, speech recognition, and voice synthesis capabilities into mobile or desktop applications while keeping all computation local. By running models entirely on device, the platform eliminates network latency and protects user data because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    clipwise
    Search through images and videos backed by local LLMs. Private, fast, offline-first — powered by bge-large embeddings, moondream vision, and Whisper transcription.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Amica

    Amica

    Amica is an open source interface for interactive communication

    ...Users can import VRM character models, adjust their appearance, tune the voice to match the character, and define behavior using different large language models and TTS backends. Under the hood, Amica leverages modern web and desktop technologies: three.js and three-vrm for 3D rendering, Transformers.js for running models in the browser, Whisper and Silero VAD for speech recognition and voice-activity detection, and a variety of LLM backends such as llama.cpp servers, ChatGPT-compatible APIs, Ollama, KoboldCpp, and others. It also integrates multiple text-to-speech providers, including ElevenLabs, OpenAI, Coqui, RVC, and AllTalkTTS.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    KoboldCpp

    KoboldCpp

    Run GGUF models easily with a UI or API. One File. Zero Install.

    KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.
    Leader badge
    Downloads: 413 This Week
    Last Update:
    See Project
  • 13
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    SmartFink

    SmartFink

    The Best Asterisk Desktop Managing and Monitoring App

    SmartFink is the best Asterisk Monitoring and Managing App for your Desktop, It has many features like Drag & Drop, Extensions Status, Queue Status, Number Dialing, Recording, Barge & Whisper ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    PFurc is a plugin for the multimessager Pidgin allowing you to connect to the MMOSG Furcadia. It allows you to add buddies to your buddy list and chat with them using Furcadia's whisper system, similar to other solutions provided by Furcadia Proxies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A light-weight yet feature-rich Second Life chat client.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This is just a viewer I'm making for my own use. It's based off of SG 2.0. It's not a griffer client or l337 script kiddie client or anything like that. It has UI tweaks, shift+click/minimap/nearby people teleport functionality, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    WHISPER is a modular software that handles either local or remote data streams. For now, it comes with a VoIP application, using two core libraries. Recents developments (WHISPER+) intend to provide a interface to Guile and a GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Smoke is a C++ Mac/Win game library built on top of OpenGL and parts of the Whisper application framework.The first test app will be a 3D WorldForge client.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Whisper is for keeping your private communications private. Whisper is designed to be easy to use (no PKI). Also Whispers can be written on paper if you have to. You don't need your correspondent to generate a key before you can Whisper.
    Downloads: 34 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB