Capable of understanding text, audio, vision, video
End-to-end stack for WebRTC. SFU media server and SDKs
A multimodal model for brain response prediction
DistroAV (formerly OBS-NDI): NDI integration for OBS Studio
Make videos programmatically with React
Command line video player
Streamlink is a CLI utility which pipes video streams
GenAI Processors is a lightweight Python library
A python tool that uses GPT-4, FFmpeg, and OpenCV
Open source Spotify client that doesn't require Premium
Multimodal Diffusion with Representation Alignment
Collection of publicly available IPTV channels from all over the world
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Download video and audio from over 1,000+ websites with one click
Qwen3-omni is a natively end-to-end, omni-modal LLM
Media Player
Free video downloader for YouTube and hundreds of other websites
Video editing with Python
Subtitle Editor derived from 6.0c, but with VLC and Hunspell checker
Generate blog articles from video or audio
WebTorrent, the streaming torrent client. For the command line
Cross platform GUI tool for downloading videos from Bilibili sites
The missing YouTube Music macOS app
Simple, Flexible & Powerful H.265/HEVC & H266/VVC video encoder!
HunyuanVideo: A Systematic Framework For Large Video Generation Model