Repo of Qwen2-Audio chat & pretrained large audio language model
Toolkit for conversational AI
Autonomous agents for everyone
Chat & pretrained large audio language model proposed by Alibaba Cloud
Voice dialogue, role-playing, multi-topic discussion, picture creation
A natural language interface for computers
HTML5 js recording mp3 wav ogg webm amr format
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
3D open source actuator simulation software
Application which detects musical notes from the microphone.
2D open source actuator simulation software
Code for the Psygraph mobile application
Fully Functional GCS(Ground Control System) for Zuppa Autopilot
Thérémine generates high quality audio from an USB Arduino Theremin
voice enpoint detect study project
Voice to Text Sentiment Analysis
Small and effective program for SIP traces anonymization
Reconnaissance vocale puis restitution avec synthèse vocale