Showing 37 open source projects for "voice analysis"

View related business solutions
  • The All-in-One Commerce Platform for Businesses - Shopify Icon
    The All-in-One Commerce Platform for Businesses - Shopify

    Shopify offers plans for anyone that wants to sell products online and build an ecommerce store, small to mid-sized businesses as well as enterprise

    Shopify is a leading all-in-one commerce platform that enables businesses to start, build, and grow their online and physical stores. It offers tools to create customized websites, manage inventory, process payments, and sell across multiple channels including online, in-person, wholesale, and global markets. The platform includes integrated marketing tools, analytics, and customer engagement features to help merchants reach and retain customers. Shopify supports thousands of third-party apps and offers developer-friendly APIs for custom solutions. With world-class checkout technology, Shopify powers over 150 million high-intent shoppers worldwide. Its reliable, scalable infrastructure ensures fast performance and seamless operations at any business size.
    Learn More
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 1
    Qwen2-Audio

    Qwen2-Audio

    Repo of Qwen2-Audio chat & pretrained large audio language model

    Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models. It is evaluated on many benchmarks (speech recognition, translation, sound...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    NVIDIA NeMo

    NVIDIA NeMo

    Toolkit for conversational AI

    NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Eliza

    Eliza

    Autonomous agents for everyone

    Build and deploy autonomous AI agents with consistent personalities across Discord, Twitter, and Telegram. Full support for voice, text, and media interactions. Built-in RAG memory system, document processing, media analysis, and autonomous trading capabilities. Supports multiple AI models including Llama, GPT-4, and Claude. Create custom actions, add new platform integrations, and extend functionality through a modular plugin system. Full TypeScript support.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    ... without task-specific fine‐tuning. It includes features such as flexible multi-run chat, audio understanding/reasoning, music appreciation, and also tool usage (e.g. voice editing).
    Downloads: 2 This Week
    Last Update:
    See Project
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 5
    Feishu ChatGPT

    Feishu ChatGPT

    Voice dialogue, role-playing, multi-topic discussion, picture creation

    Feishu × (GPT-3.5 + DALL·E + Whisper) = flying-like work experience. Voice dialogue, role-playing, multi-topic discussion, picture creation, table analysis, document export. Golang language, it goes without saying! Master the gin framework proficiently, developing the backend is as natural as breathing! Familiar with the SDKs of DingTalk, Feishu, Qiwei and other platforms, and be able to develop and integrate a series of amazing functions! Proficient in platform-based detail thinking, let...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Open Interpreter

    Open Interpreter

    A natural language interface for computers

    Open Interpreter is an open-source tool that provides a natural-language interface for interacting with your computer. It lets large language models (LLMs) run code locally (Python, JavaScript, shell, etc.), enabling you to ask your computer to do tasks like data analysis, file manipulation, browsing, etc. in human terms (“chat with your computer”), with safeguards. Runs locally or via configured remote LLM servers/inference backends, giving flexibility to use models you trust or have locally...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ... of browser (including PWA, WebClip, any App) on low-version iOS (11.0-14.2) except Safari inside page). Provides multiple plug-in function support. Rich audio visualization, variable speed and pitch processing, speech recognition, audio stream playback, etc.; with powerful real-time processing support, it can be used in various web applications: from simple recording to complex real-time voice Recognition (ASR), and even audio-related games, are handled with ease.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    MiniCPM-o

    MiniCPM-o

    A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming

    ... text and audio inputs to generate outputs in various forms, including voice cloning, emotion control, and interactive role-playing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DoSA-3D

    DoSA-3D

    3D open source actuator simulation software

    DoSA-3D is a 3D open source software for magnetic force analysis of actuators and solenoids. Not only individuals but also companies can use the program for free and participate in the development of it themselves. The program environment is developed to be similar to that of product development, so even product developers who have not majored in analysis can easily analyze the magnetic force of actuators. In DoSA-3D, three programs are connected and operated as follows. - DoSA-3D...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 10
    InstrumentalMusic

    InstrumentalMusic

    Application which detects musical notes from the microphone.

    Application which detects musical notes from the microphone. It allows listening to the microphone and play the detected notes to output (in midi). Multilanguage support. Zoom Dark mode option JDK-17 compatibility With v1.2 it includes a pitch shifter (making voice lower or sharper through a slider) There is a demo video which shows how it works (the demo video can be visited from Help menu of the application) You can also see the pitch-shifter demo version here: https...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    DoSA-2D

    DoSA-2D

    2D open source actuator simulation software

    DoSA-2D is a two-dimensional open source software for magnetic force analysis of actuators and solenoids. Not only individuals but also companies can use the program for free and participate in the development of it themselves. The program environment is developed to be similar to that of product development, so even product developers who have not majored in analysis can easily analyze the magnetic force of actuators or solenoids. DoSA-2D is responsible for an easy working environment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    vocoder_chung
    vocoder chung is a small educational vocoder using discrete fourier transform FFT spectrum written in easy fast compiled freebasic . (24/12/2019) uses fast and accurate FFTdll.dll (28/03/2020) algorythmic voice cloning / change / morphing experiment added
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Psygraph

    Psygraph

    Code for the Psygraph mobile application

    Psygraph is a Personal Data Collector (PDC) and activity timer. It includes a stopwatch, timer, counter, and note taker (voice recorder), each of which collects data from the device’s sensors (e.g. the device velocity and location (via GPS)). Although the interface is simple (a button or two on each screen), the data is saved for later analysis and display (you can store and view the data on WordPress). It is a scientific instrument that is easy to use. It makes an excellent general-purpose...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ZuppaGCS V 1.5.4 Alpha

    ZuppaGCS V 1.5.4 Alpha

    Fully Functional GCS(Ground Control System) for Zuppa Autopilot

    This is Fully Functional Ground Control Station Designed for the Zuppa Autopilot , Zuppa FLAME (Farm and Large Area Mapping Solution) and the Zuppa DUSTER (Pesticide Spraying Drone). It has data analyzer tools as well as playback and post flight analysis features. It is compatible for a typical Window Laptop or Windows tab (8" and above) . It has Both offline and online map operation modes , additionally you can also download maps using the ZUPPA GCS software. Very Ergonomically...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Thérémine

    Thérémine

    Thérémine generates high quality audio from an USB Arduino Theremin

    ... is to extract stationnary, looping samples from recorded wave files. It allows importing new instrumental tones into Thérémine, as long as the source recording contains reasonable data. Two demo sample banks are provided with Thérémine featuring violin and voice tones. Both were made using Loop bank manager.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    voice enpoint detect

    voice enpoint detect study project

    This project contains a pitch detect and a voice endpoint detect algorithm. Also codes for a digital filter and drawing font on bitmap file. It is just for study, nothing more, so be have fun!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Speech Sentiment Analysis

    Voice to Text Sentiment Analysis

    Voice to text Sentiment analysis converts the audio signal to text to calculate appropriate sentiment polarity of the sentence. The code currently works on one sentence at a time. Sentiment scoring is done on the spot using a speaker. The Speech to text processing system currently being used is the MS Windows speech to text converter. However significant modifications can be made for audio recognition by a refined signal processing system. The sentiment operator in textblob is used...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    SIP Anonymization Tool (SiAnTo)

    Small and effective program for SIP traces anonymization

    The Session Initiation Protocol (SIP) is a signaling communications protocol widely used nowadays for controlling multimedia communication sessions such as voice and video calls over Internet Protocol (IP) networks. A good way to design optimization techniques for SIP deployment would be to analyze SIP traffic from existing networks. However, publicly available analyses of SIP traffic are rare and thus not a lot of knowledge exists about typical behavior of a SIP server (as opposed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    ReconnaissanceVocale

    Reconnaissance vocale puis restitution avec synthèse vocale

    Ce programme est une démonstration de la reconnaissance vocale de windows 7 en Français N.B il est très important de faire l'apprentissage de votre voix pour avoir de bons résultats. Ce programme reconnait votre voix puis transmet à un logiciel de synthèse vocale l'ordinateur répète ce que vous dites. voir la vidéo http://youtu.be/_bGJxT1ulLY This program is a demonstration of the speech recognition in windows 7 in French. Note it is very important to learn your voice to have good...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    1.) Investigation with cosine transform, and anti transform algorithm, with some voice recognition code. 2.) Translator: Croatian, English. 3.) 2D to 3D picture algorithm (principle) and new 2Dto3D video conversion code with AviSynth video scripting
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    The open source, multimodal interactive "Sensitive Artificial Listener" dialogue system created by the EU project SEMAINE.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    C4 is a C++ class library for analyzing sound files, particularly spoken and sung phonations. C4 provides features such as frequency analysis, pitch extraction, or calculation of voice quality parameters (e.g. alpha ratio, HNR, jitter, etc.).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    DawNLITE is a Natural-Language-based Image Transmoding Engine. The software transforms an image to a video as recorded by a virtual camera panning and zooming over the image, following a natural language text description of the image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.