Repo of Qwen2-Audio chat & pretrained large audio language model
Curated collection of Amazing Python scripts
Fast multimodal LLM for real-time voice interaction and AI apps
Virtual AI anchor that combines state-of-the-art technology
AI suite powered by state-of-the-art models and providing advanced AI
Chat & pretrained large audio language model proposed by Alibaba Cloud
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Toolkit for conversational AI
Autonomous agents for everyone
A natural language interface for computers
A Claude skill that automatically posts personalized comments
HTML5 js recording mp3 wav ogg webm amr format
Convert VoIP calls to text and analyze them with AI
Toolkit for audio, music, and speech generation
Live analysis of pitches, harmonics, chords, and keys.
2D open source actuator simulation software
3D open source actuator simulation software
Application which detects musical notes from the microphone.
Voice dialogue, role-playing, multi-topic discussion, picture creation
General Speech Restoration
Code for the Psygraph mobile application
Fully Functional GCS(Ground Control System) for Zuppa Autopilot
Thérémine generates high quality audio from an USB Arduino Theremin