Search Results for "whisper-bin-x64.zip"

Sort By:

Showing 2479 open source projects for "whisper-bin-x64.zip"

View related business solutions

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection.

Downloads: 69 This Week

Last Update: 2025-06-26
See Project
2

Whisper-WebUI

A Web UI for easy subtitle using whisper model

Whisper WebUI is an open-source browser-based interface that simplifies the use of Whisper speech recognition models by providing an intuitive graphical environment for transcription, translation, and subtitle generation. Built with Gradio, it allows users to upload audio or video files, process them locally, and generate accurate text outputs without relying on command-line tools.

Downloads: 6 This Week

Last Update: 2026-03-18
See Project
3

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. ...

Downloads: 463 This Week

Last Update: 2026-03-19
See Project
4

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps

Multilingual Automatic Speech Recognition with word-level timestamps and confidence. Whisper is a set of multi-lingual, robust speech recognition models trained by OpenAI that achieve state-of-the-art results in many languages. Whisper models were trained to predict approximate timestamps on speech segments (most of the time with 1-second accuracy), but they cannot originally predict word timestamps. This repository proposes an implementation to predict word timestamps and provide a more accurate estimation of speech segments when transcribing with Whisper models. ...

Downloads: 0 This Week

Last Update: 2025-09-09
See Project
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
5

Zip.js

JavaScript library to zip and unzip files

Zip.js is a JavaScript library for creating, reading, and editing ZIP files directly in the browser or in Node.js environments, supporting various compression methods and encryption.

Downloads: 1 This Week

Last Update: 2026-03-05
See Project
6

Insanely Fast Whisper

An opinionated CLI to transcribe Audio files w/ Whisper on-device

Insanely Fast Whisper is a high-performance command-line tool designed to dramatically accelerate speech-to-text transcription using OpenAI’s Whisper models on local hardware. It leverages modern optimizations such as batch processing, mixed precision, and advanced attention mechanisms like Flash Attention to significantly reduce inference time while maintaining high transcription accuracy.

Downloads: 0 This Week

Last Update: 11 hours ago
See Project
7

sharp for AWS Lambda Layers

npm sharp for AWS Lambda layers, separated builds for arm64 and x64

The prebuilt sharp node module for AWS Lambda layer. npm sharp for AWS Lambda layers, separated builds for arm64 and x64, minified and bundled with esbuild.

Downloads: 0 This Week

Last Update: 2025-11-07
See Project
8

get-bin-path

Get the current package's binary path

Get the current package's binary path (using the package.json bin field). This is useful when testing a package's binary. Using get-bin-path (as opposed to hard-coding the path to the binary). Validates that the package.json bin field is correctly setup. Decouples the binary path from the tests, which allows moving the file without rewriting the tests. This package is an ES module and must be loaded using an import or import() statement, not require().

Downloads: 0 This Week

Last Update: 2025-03-23
See Project
9

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently.

Downloads: 12 This Week

Last Update: 2026-03-17
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
10

7-Zip

A free file archiver for extremely high compression

7-Zip is a file archiver with a high compression ratio. 7-Zip works in Windows 11 / 10 / 8 / 7 / Vista / XP / 2022 / 2019 / 2016 / 2012 / 2008 / 2003 / 2000. You can use 7-Zip on any computer, including a computer in a commercial organization. You don't need to register or pay for 7-Zip. And there is command line version for Linux and macOS.

829 Reviews

Downloads: 11,346 This Week

Last Update: 2026-02-19
See Project
11

Final2x

2^x Image Super-Resolution

...The tool is available in English, Chinese, and Japanese, allowing users from different countries to enjoy the benefits of super-resolution. The tool is available for Windows x64/arm64, MacOS x64/arm64, and Linux x64, allowing users to enjoy the benefits of super-resolution regardless of their operating system.

Downloads: 25 This Week

Last Update: 2025-10-05
See Project
12

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 25 This Week

Last Update: 2025-12-05
See Project
13

Go OpenAI

OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go

This library provides Go clients for OpenAI API. OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go.

Downloads: 0 This Week

Last Update: 2025-08-29
See Project
14

WhisperKit

On-device Speech Recognition for Apple Silicon

WhisperKit is a Swift package that integrates OpenAI's popular Whisper speech recognition model with Apple's CoreML framework for efficient, local inference on Apple devices. Whisper has pulled the future forward when fast, free and virtually error-free translation and transcription will be ubiquitous. It inspired numerous developers to improve and deploy it with minimal friction and maximum performance.

Downloads: 2 This Week

Last Update: 2026-03-13
See Project
15

Meetily

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper

...It’s built for organizations that want meeting intelligence without sending recordings or transcripts to third-party cloud services, which helps address compliance and data sovereignty requirements. The app supports live transcription with local model options (including Whisper- and Parakeet-based workflows) and presents the transcript as the meeting happens, making it useful both for note-taking and accessibility. After or during the session, it can produce structured, AI-generated summaries, and it’s designed to be flexible about where that summarization comes from, supporting local providers as well as external endpoints when allowed by policy.

Downloads: 16 This Week

Last Update: 2026-02-11
See Project
16

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper

WhisperSpeech is an open-source text-to-speech system created by “inverting” OpenAI’s Whisper, reusing its strengths as a semantic audio model to generate speech instead of only transcribing it. The project aims to be for speech what Stable Diffusion is for images: powerful, hackable, and safe for commercial use, with code under Apache-2.0/MIT and models trained only on properly licensed data. Its architecture follows a token-based, multi-stage pipeline inspired by AudioLM and SPEAR-TTS: Whisper is used to produce semantic tokens, EnCodec compresses the waveform into acoustic tokens, and Vocos reconstructs high-fidelity audio from those tokens. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
17

swords for whisper

Downloads: 6 This Week

Last Update: 2025-12-15
See Project
18

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

...WhisperJAV introduces a specialized pipeline that separates text generation from timestamp alignment, allowing the system to generate transcripts and then align them with audio using forced alignment techniques. The framework supports several speech recognition models, including Qwen-based ASR systems and fine-tuned Whisper models trained on domain-specific dialogue.

Downloads: 6 This Week

Last Update: 2026-03-19
See Project
19

Whisper-Studio

Another whisper wrapper, built fully in C++, with some neat features.

a native lightweight C++ application for OpenAI's Whisper, with a few new things like transcribing audio in real-time, identifying speakers, auto-paste transcriptions, and a few other things. Its not the prettiest app, I suck at design, but it gets the job done.

Downloads: 2 This Week

Last Update: 2026-02-07
See Project
20

Handy STT

A free, open source, and extensible speech-to-text application

...Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. Its backend leverages OpenAI’s Whisper models for GPU-accelerated speech recognition and Parakeet V3 for efficient CPU-only transcription with automatic language detection. To further refine accuracy and responsiveness, Handy integrates Silero’s Voice Activity Detection (VAD) for silence filtering, ensuring only speech segments are processed.

Downloads: 68 This Week

Last Update: 1 day ago
See Project
21

wsldl

Advanced WSL launcher / installer. (Win10 FCU x64/arm64 or later.)

Advanced WSL launcher/installer. (Win10 FCU x64/arm64 or later.) Requires Windows 10 1709 Fall Creators Update or later(x64/arm64). Windows Subsystem for Linux feature is enabled.

Downloads: 2 This Week

Last Update: 7 days ago
See Project
22

Zip-Ada

A standalone, portable Ada library for .zip archives

Zip-Ada is a library for .zip archives. Full sources are in Ada and are unconditionally portable. Input and output can be any stream (file, buffer,...) for archive creation as well as data extraction. Task safe and endian-neutral. More information on: http://unzip-ada.sf.net Alire crate: https://alire.ada.dev/crates/zipada Mirror: https://github.com/zertovitch/zip-ada/

1 Review

Downloads: 72 This Week

Last Update: 2026-03-14
See Project
23

Hyprnote

Local-first AI Notepad for Private Meetings

Hyprnote is an open-source, privacy-first AI notepad app designed for taking notes during meetings—transcribing audio (microphone and system) and generating context-rich summaries using on-device AI models like Whisper and HyprLLM, all without any data leaving your machine.(turn0search7, turn0search1). Listens to your meetings while you write. Crafts smart summaries based on your quick notes. Runs completely offline using open-source models like Whisper or HyprLLM. Use approved third-party APIs like Gemini, Claude, or Azure-hosted GPT.

Downloads: 17 This Week

Last Update: 2 days ago
See Project
24

emuiibo

Virtual amiibo (amiibo emulation) system for Nintendo Switch

...If (for whatever reason) you would like to disable a virtual amiibo from being recognised by emuiibo, just remove the flag file, and create it again to enable it. While old emuiibo formats are supported and converted to the current format (see above), it is strongly suggested to, unless bin dumps might be indispensable, emuiigen be used, our PC utility designed to create and edit virtual amiibos.

Downloads: 67 This Week

Last Update: 2026-03-14
See Project
25

HeartMuLa

A Family of Open Sourced Music Foundation Models

...The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. It also introduces HeartCLAP, which aligns audio and text into a shared embedding space.

Downloads: 14 This Week

Last Update: 2026-03-05
See Project