AudioMuse-AI is an Open Source Dockerized environment
A set of AI-enabled effects, generators, and analyzers for Audacity
AI tool converting video/audio into structured documents instantly
Audio foundation model excelling in audio understanding
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Sample code and notebooks for Generative AI on Google Cloud
Open-source framework for intelligent speech interaction
Large Audio Language Model built for natural interactions
Curated AI engineering notes on LLMs, generative models, and tools
LLM-based Reinforcement Learning audio edit model
A gallery that showcases on-device ML/GenAI use cases
Multi-modal large language model designed for audio understanding
A blazing fast AI Gateway with integrated guardrails
Use API to call the music generation AI of suno.ai
Official Python inference and LoRA trainer package
Spring AI Alibaba examples for building and testing AI apps
A python tool that uses GPT-4, FFmpeg, and OpenCV
Framework for building real-time voice and multimodal AI agents
Implementation of AudioLM audio generation model in Pytorch
Multimodal Diffusion with Representation Alignment
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Family of Open Sourced Music Foundation Models
Secure open source cloud runtime for AI apps & AI agents
The official Python library for the OpenAI API