Showing 17 open source projects for "speaker design"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    VoxCPM

    VoxCPM

    TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

    ...It supports zero-shot voice cloning from a short reference audio clip, capturing timbre, accent, and pacing to closely mimic a target speaker without per-speaker fine-tuning.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    jekyll-theme-conference

    jekyll-theme-conference

    Jekyll template for a conference website containing program

    This is a responsive Jekyll theme based on Bootstrap 4 for conferences. All components such as talks, speakers or rooms are represented as collection of files. The schedule is given is defined via a simple structure stored in a YAML file. There is no need for databases and once generated the website consists only of static files. A script and workflows are available for easy import, e.g. of frab-compatible schedules. The design is easily customizable and is adapted for mobile uses and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    PPTist is a web-based presentation editing application designed to replicate many of the commonly used features found in traditional slide presentation software. It allows users to create, edit, and present slide decks directly within a web browser while maintaining a desktop-like editing experience. PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. PPTist supports a wide...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Coqui TTS

    Coqui TTS

    A deep learning toolkit for Text-to-Speech, battle-tested in research

    TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. High-performance Deep Learning models for Text2Speech tasks. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings...
    Downloads: 20 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    vits_chinese

    vits_chinese

    Best practice TTS based on BERT and VITS

    vits_chinese is an implementation of the VITS end-to-end text-to-speech (TTS) architecture tailored for Chinese (and possibly multilingual) speech synthesis. VITS is a model combining variational autoencoders (VAEs), normalizing flows, adversarial learning, and a stochastic duration predictor — a design that enables generation of natural, expressive speech, capturing variations in rhythm and prosody. By customizing or porting VITS for Chinese, this project aims to produce high-quality TTS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    wukong-robot

    wukong-robot

    Chinese voice dialogue robot/smart speaker project

    wukong-robot is a Chinese voice assistant / smart speaker project built to let makers and hackers design highly customizable voice-controlled devices. It combines wake-word detection, automatic speech recognition, natural language understanding, and text-to-speech into a single framework aimed at the Chinese-speaking ecosystem. The project is positioned as a simple, flexible, and elegant platform that can run on devices like Raspberry Pi and other Linux-based boards, making it suitable for DIY smart speakers and home-automation hubs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Loudspeaker Design Calculations Toolkit

    Loudspeaker Design Calculations Toolkit

    LDCT is a cross-platform tool for common speaker design calculations.

    LDCT is an open source, cross-platform application containing many calculations commonly used in loudspeaker design.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 8

    ShiftRC

    Simple cheap DIY RC receiver/handler.

    ShiftRC is a opensource design for a very cheap, small and simplistic rc driver/rc statemashine. mimics Nikola Tesla's boat it uses a shift chip with multiple outputs and one input for the radio or IR signal. This project is something I wanted to do for a while to make cheap and light weight custom rc chips, and I decided to make it a real project when I found a wooden single use spoon which I wanted to turn into a boat.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    reveal-jekyll

    reveal-jekyll

    Online presentation for GitHub Pages and Jekyll in Markdown

    Transforms Markdown files into presentation slides using reveal.js and Jekyll. The theme is based on Solarized Colors (by Ethan Schoonover) containing a light and a dark theme. reveal-jekyll is ready for GitLab Pages as well as GitHub Pages. Jekyll is a simple, blog-aware, static site generator perfect for personal, project, or organization sites. Think of it like a file-based CMS, without all the complexity. Jekyll takes your content, renders Markdown and Liquid templates, and spits out a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    DC-TTS

    DC-TTS

    TensorFlow Implementation of DC-TTS: yet another text-to-speech model

    ...It follows the “Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention” paper, but the author adapts and extends the design to make it practical for real experiments. The model is split into two networks: Text2Mel, which maps text to mel-spectrograms, and SSRN (spectrogram super-resolution network), which converts low-resolution mel-spectrograms into high-resolution magnitude spectrograms suitable for waveform synthesis. Training scripts, data loaders, and hyperparameter configurations are provided to reproduce results on several datasets, including LJ Speech for English, a Korean single-speaker dataset, and audiobook data from Nick Offerman and Kate Winslet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ServerGRIS

    Spatilization tool in in 2D and 3D

    The ServerGRIS is an external multichannel sound spatializer for different configurations of speakers in 2D (plane mode: X and Y axis) or 3D (vertical mode: X, Y and Z axis). It is based on the JackRouter HAL plugin which is installed at the same time as the Server. The Server may include up to 128 inputs and outputs. The movements are sent from the SpatGRIS in OSC mode to the Server. The audio spatialization itself is made by the ServerGRIS and sent to the audio interface. VBAP, BINAURAL...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MultiMedia Logic

    MultiMedia Logic

    A MultiMedia Logic Design System

    ...The MultiMedia Logic Simulator has taken this one step further and introduced devices that connect directly to your computers’ real devices (e.g. Keyboard, Screen, Serial Ports) including MultiMedia ones (PC speaker, Wave, Bitmaps). The intent of this system is not to necessarily build the logic circuit you design. The intent is to use what you build, to allow experimentation, to learn and to have fun.
    Leader badge
    Downloads: 223 This Week
    Last Update:
    See Project
  • 13
    Java apple computer emulator

    Java apple computer emulator

    Cross-platform Apple //e emulator

    NOTE: This is the old Swing version. The current version of Jace can be found here: https://github.com/badvision/jace Because of malware injections in other Sourceforge projects, the author has removed all binaries from Sourceforge. You may look at the legacy source still hosted here but are highly encouraged to check out the new version hosted in Github for recent updates on this project. Thank you for your interest in this project! ][ Infinitum!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    codec2demo

    real-time demonstration of the early Codec2 codec

    This project was created in 2011 on Google Code to provide a PC-based real-time demonstration of the then new Codec2. A user could speak into a microphone and hear the encode->decode result on the speaker. Codec2 has since gone through subsequent design iterations and there are PC software tools for ham radio operators.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Speaker Workshop is an application for testing speaker drivers using a Windows sound card, then designing and simulating loudspeakers and enclosures and creating and optimizing crossovers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Control your pc-speaker in Delphi in design- and run-time mode!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SDAK is a small app for making the formulas for making speaker cabinets simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB