Showing 56 open source projects for "text to talk"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Talk to Figma MCP

    Talk to Figma MCP

    AI bridge enabling Cursor agents to read and modify Figma designs

    ...Developers can automate tasks such as creating UI elements, updating text, organizing layout structures, or managing annotations inside a design file. It also provides strategies and helper prompts that guide AI agents in performing more complex design-related workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Handy STT

    Handy STT

    A free, open source, and extensible speech-to-text application

    Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active...
    Downloads: 68 This Week
    Last Update:
    See Project
  • 3

    dockserver-talk

    Python module to interface with a Slocum glider dockserver

    ...Surfalarm: a program that can send text messages or make direct phone calls, acting upon a glider to surface and connect to a dockserver, being in a mission, or aborted. Serial2rudics: a helper program that lets a glider simulator connected to a serial port appear as a glider connecting and disconnection via rudics/iridium when at ther surface or underwater, respectively.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyGPT

    PyGPT

    Open source personal AI Assistant for Linux, Windows and Mac

    PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    SillyTavern

    SillyTavern

    LLM Frontend for Power Users

    Mobile-friendly, Multi-API (KoboldAI/CPP, Horde, NovelAI, Ooba, OpenAI, OpenRouter, Claude, Scale), VN-like Waifu Mode, Horde SD, System TTS, WorldInfo (lorebooks), customizable UI, auto-translate, and more prompt options than you'd ever want or need. Optional Extras server for more SD/TTS options + ChromaDB/Summarize. SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters...
    Downloads: 485 This Week
    Last Update:
    See Project
  • 6
    Polyglot

    Polyglot

    Cross-platform AI language practice app

    Polyglot is a cross platform AI language practice application that runs as a desktop app and also offers a web version. It is built around conversational large language models and Azure based text to speech services, turning them into an interactive environment for speaking practice in multiple languages. Users can define custom AI personas, choose languages, and configure their own OpenAI and Azure keys so they retain control over which backends they use. The app supports speech recognition...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Certbot

    Certbot

    Get free HTTPS certificates forever from Let's Encrypt

    Certbot is a fully-featured, easy-to-use, extensible client for the Let's Encrypt CA. It fetches a digital certificate from Let’s Encrypt, an open certificate authority launched by the EFF, Mozilla, and others. This certificate then lets browsers verify the identity of web servers and ensures secure communication over the Web. Obtaining and maintaining a certificate is usually such a hassle, but with Certbot and Let’s Encrypt it becomes automated and hassle-free. With just a few simple...
    Downloads: 112 This Week
    Last Update:
    See Project
  • 8
    ELisp Tree-sitter

    ELisp Tree-sitter

    Tree-sitter bindings for Emacs Lisp

    ...Structural editing (like Paredit, or even better) for non-Lisp code. More informative indexing for imenu. The author of Tree-sitter articulated its merits a lot better in this Strange Loop talk. The minor mode tree-sitter-mode provides a buffer-local syntax tree, which is kept up-to-date with changes to the buffer’s text. Run M-x tree-sitter-hl-mode to replace the regex-based highlighting provided by font-lock-mode with tree-based syntax highlighting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Dicio assistant

    Dicio assistant

    Dicio assistant app for Android

    Dicio is a free and open source voice assistant for Android that focuses on strong privacy by running its understanding and response generation directly on the device whenever possible. It supports multiple input and output methods, including hotword-based voice input using the Vosk speech-to-text engine and a graphical interface for users who prefer to tap instead of talk. The assistant is built around a flexible “skills” system that lets it respond to a wide variety of requests such as search, weather, navigation, calculator, timers, media control, and more. It offers rich multilingual support and is already available in a wide range of languages including English, Spanish, German, French, Italian, Russian, and others. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    OpenAI-Compatible Edge-TTS API

    OpenAI-Compatible Edge-TTS API

    Free, high-quality text-to-speech API endpoint to replace OpenAI

    OpenAI-Compatible Edge-TTS API is a local, OpenAI-compatible text-to-speech API that uses edge-tts—Microsoft Edge’s online TTS service—as the backend. The project emulates the /v1/audio/speech endpoint used by OpenAI, so any client that can talk to the OpenAI TTS API can be redirected to this service with minimal changes. It exposes parameters for input text, voice selection, audio format, and playback speed, mirroring the OpenAI interface while mapping popular OpenAI voice names to equivalent Edge voices. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    ...Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show controls only on hover, and many other customization options. Export as mp4, gif, and webm, or save the video directly to Google Drive. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    asciinema

    asciinema

    Open source terminal session recorder

    ...Forget old screen recording methods and resulting blurry videos. asciinema lets you record your terminal sessions the right way, which is right where you work, in the terminal. Recording is as easy as running one command, and since it’s purely text-based you can copy and paste any content you want, simply pause the recording! You can also easily share your recordings on the web, embed an asciicast player in your blog post, project documentation page or in your conference talk slides. See plenty of example sessions recorded with asciinema here: https://asciinema.org/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VideoChat

    VideoChat

    Real-time voice interactive digital human

    VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head. It is built as a Gradio Python demo, exposing a web interface where users can talk to an animated avatar that lip-syncs to synthesized speech while responding intelligently. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Rivatuner

    Rivatuner

    rivatuner statistics server

    Introduced as a highly successful design with the help of a statistics server, these applications are designed to bring you the maximum functionality of your graphics card and bring you a bunch of controls that will help you get the most out of your GPU. can be specified. RivaTuner Download also lets you open-source overclocking and hardware debugging for NVIDIA video cards by modifying the NVIDIA graphics card, a revolutionary graphics card overclocking, and subsequent utility...
    Leader badge
    Downloads: 1,354 This Week
    Last Update:
    See Project
  • 16

    texttalk

    Talk through typing the text

    Speak with generated voice of text input. Using Google translate web service, the audio sound of the spoken text can be extracted. Ideal for helping remote support, voice proxy for privacy, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ViaVoip

    ViaVoip

    A portable peer to peer voice-chat/walkie-talkie.

    ViaVoip is a simple Voice Over IP application that can be used when you need to talk, chat, or send files through the internet, but you can't or don't want to make use of any third party services. Its peer to peer design allows the two end points to connect directly to each other, without any central server nor account registration. It runs on Windows, Linux, Mac OS X and Android, and is portable, that is you don't need any setup, just get a copy and run it from any storage...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    Obsidian MEDILIG

    Obsidian MEDILIG

    Obsidian MEDILIG (Medical Life Guard) App Local-First Offline-First

    Obsidian MEDILIG (Medical Life Guard): An easy-to-use, flexible, secure, cross-platform medical knowledge management system for patient's Electronic Health Record (EHR) that is built on top of Obsidian software application for the design, implementation and use of autonomous, multilingual, clinical documents from primary care to continuing care.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    PyCAPGE

    PyCAPGE

    PyCAPGE - Python Classic Adventure Point and Click Game Engine

    PyCAPGE (Python Classic Adventure Point and Click Game Engine) is a versatile, open-source framework designed for creating retro-style 2D graphic adventures using Python and Pygame. Inspired by the golden age of SCUMM games, it features a customizable 9-verb interface and robust inventory management. Key features include a Scene Manager supporting parallax scrolling, walk-behind masks, and depth-based character scaling. It implements intelligent Pathfinding to navigate complex...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Feishu ChatGPT

    Feishu ChatGPT

    Voice dialogue, role-playing, multi-topic discussion, picture creation

    Feishu × (GPT-3.5 + DALL·E + Whisper) = flying-like work experience. Voice dialogue, role-playing, multi-topic discussion, picture creation, table analysis, document export. Golang language, it goes without saying! Master the gin framework proficiently, developing the backend is as natural as breathing! Familiar with the SDKs of DingTalk, Feishu, Qiwei and other platforms, and be able to develop and integrate a series of amazing functions! Proficient in platform-based detail thinking, let...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    ChatFred

    ChatFred

    Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting

    Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more. Access ChatGPT, DALL·E 2, and other OpenAI models. Language models often give wrong information. Verify answers if they are important. Talk with ChatGPT via the cf keyword. Answers will show as Large Type. Alternatively, use the Universal Action, Fallback Search, or Hotkey. To generate text with InstructGPT models and see results in-line, use the cft keyword. ⤓ Install on the Alfred Gallery or download it over GitHub and add your OpenAI API key. If you have used ChatGPT or DALL·E 2, you already have an OpenAI account. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Google Translate

    Google Translate

    Google Translate is used to translate any text to any language.

    Dynamically translate between multiple languages with Google machine learning. Get a fast and dynamic translation that adapts to your content needs. Translation enables organizations to dynamically translate from one language to another. To do this, use pre-trained Google machine learning models or custom models. Talk to the world connecting with people, places, and cultures across language barriers. The Translator file can help you with long texts, difficult pronunciations, and even upload...
    Leader badge
    Downloads: 690 This Week
    Last Update:
    See Project
  • 23
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 24
    AIML_chung

    AIML_chung

    an AIML chatbot engine with 3D avatars, maths parser, speech and dll

    AIML chung is an full AIML1.0 based standalone chat bot engine trial with dll , tts / espeak speech voices, synonyms substitutions, maths parser and 3D photorealistics openGL avatars written in compiled freebasic.Comes with GUI window and console examples, 3D world mode and a dll version to use with other programming languages like c++ or Liberty Basic , or to easily embed in your applications .Talk with your A.I. computer.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Camvoice

    Camvoice

    CamVoice is a Free Live Video Chat Rooms Live Text Chat

    CamVoice is a live video streaming social network with millions of active users in thousands of video chat rooms. Video chat with anyone in the CamVoice community whether they're on an iPhone, iPod Touch, iPad, Android, Mac or PC. It allows you to live stream your special moments, live talk with your friends or make a video call. Features at a glance: - With CamVoice you can have video chat with millions of people from different countries and make free video calls with your friends or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB