Search Results for "audio and video stream" - Page 3

Sort By:

Showing 1742 open source projects for "audio and video stream"

View related business solutions

Windows Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

Qwen2.5-Omni

Capable of understanding text, audio, vision, video

Qwen2.5-Omni is an end-to-end multimodal flagship model in the Qwen series by Alibaba Cloud, designed to process multiple modalities (text, images, audio, video) and generate responses both as text and natural speech in streaming real-time. It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds state-of-the-art performance in many multimodal benchmarks, particularly spoken language understanding, audio reasoning, image/video understanding, etc. ...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
2

LiveKit

End-to-end stack for WebRTC. SFU media server and SDKs

LiveKit is an open-source project that provides a scalable, multi-user conferencing system based on WebRTC, designed to offer real-time video, audio, and data capabilities for developers.

Downloads: 5 This Week

Last Update: 4 days ago
See Project
3

TRIBE v2

A multimodal model for brain response prediction

TRIBE v2 is a multimodal foundation model developed by Meta AI for predicting human brain activity from naturalistic stimuli such as video, audio, and text. It is designed for in-silico neuroscience, enabling researchers to model how the brain responds to complex real-world inputs. The system integrates state-of-the-art encoders—including LLaMA for text, V-JEPA for video, and Wav2Vec-BERT for audio—into a unified Transformer architecture. This combined representation is mapped onto the cortical surface to predict fMRI responses across thousands of brain regions. ...

Downloads: 22 This Week

Last Update: 7 days ago
See Project
4

DistroAV

DistroAV (formerly OBS-NDI): NDI integration for OBS Studio

DistroAV is an open-source integration plugin for OBS Studio that provides Network Device Interface (NDI) support so users can send and receive live audio and video over IP networks directly within OBS, formerly known as the OBS-NDI project before being renamed. By implementing NDI input sources, dedicated output transports, and special filter modes, it allows creativity-oriented workflows such as capturing remote cameras, sharing scenes between machines, or distributing live feeds without capture cards or physical cabling. ...

Downloads: 70 This Week

Last Update: 2026-01-20
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Remotion

Make videos programmatically with React

...The framework supports exporting to standard video formats, audio synchronization, frame callbacks, and powerful tooling for previewing and debugging, so teams can iterate quickly and reliably.

Downloads: 6 This Week

Last Update: 3 days ago
See Project
6

mpv

Command line video player

mpv is a free (as in freedom) media player for the command line. It supports a wide variety of media file formats, audio and video codecs, and subtitle types. Powerful scripting capabilities can make the player do almost anything. There is a large selection of user scripts on the wiki. While mpv strives for minimalism and provides no real GUI, it has a small controller on top of the video for basic control. mpv has an OpenGL, Vulkan, and D3D11 based video output that is capable of many features loved by videophiles, such as video scaling with popular high quality algorithms, color management, frame timing, interpolation, HDR, and more. mpv can leverage most hardware decoding APIs on all platforms. ...

Downloads: 63 This Week

Last Update: 2025-12-22
See Project
7

Streamlink

Streamlink is a CLI utility which pipes video streams

Streamlink is a command-line utility that pipes video streams from various services into a video player, such as VLC. The main purpose of Streamlink is to avoid resource-heavy and unoptimized websites, while still allowing the user to enjoy various streamed content. There is also an API available for developers who want access to the stream data. Streamlink is built upon a plugin system that allows support for new services to be easily added.

Downloads: 16 This Week

Last Update: 2026-03-05
See Project
8

GenAI Processors

GenAI Processors is a lightweight Python library

GenAI Processors is a lightweight Python library for building modular, asynchronous, and composable AI pipelines around Gemini. Its central abstraction is the Processor, a unit of work that consumes an asynchronous stream of parts (text, images, audio, JSON) and produces another stream, making it natural to chain operations and keep everything streaming end-to-end. Processors can be composed sequentially (to build multi-step flows) or in parallel (to fan-out work and merge results), which makes sophisticated agent behaviors easy to express with simple operators. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
9

AI YouTube Shorts Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV

AI-YouTube-Shorts-Generator is a Python-based tool that automates the creation of short-form vertical video clips (“shorts”) from longer source videos — ideal for adapting content for platforms like YouTube Shorts, Instagram Reels, or TikTok. It analyzes input video (whether a local file or a YouTube URL), transcribes audio (with optional GPU-accelerated speech-to-text), uses an AI model to identify the most compelling or engaging segments, and then crops/resizes the video and applies subtitle overlays, producing a polished short video without manual editing. ...

Downloads: 9 This Week

Last Update: 2026-02-05
See Project
Add Two Lines of Code. Get Full APM.
AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.

Start Free
10

Spotube

Open source Spotify client that doesn't require Premium

An open source, cross-platform Spotify client compatible across multiple platforms utilizing Spotify's data API and YouTube, Piped video or JioSaavn as an audio source, eliminating the need for Spotify Premium. It is still recommended to support creators by engaging with their YouTube channels/Spotify tracks (or preferably by buying their merch/concert tickets/physical media).

Downloads: 66 This Week

Last Update: 2026-02-24
See Project
11

HunyuanVideo-Foley

Multimodal Diffusion with Representation Alignment

HunyuanVideo-Foley is a multimodal diffusion model from Tencent Hunyuan for high-fidelity Foley (sound effects) audio generation synchronized to video scenes. It is designed to generate audio that matches both visual content and textual semantic cues, for use in video production, film, advertising, games, etc. The model architecture aligns audio, video, and text representations to produce realistic synchronized soundtracks. Produces high-quality 48 kHz audio output suitable for professional use. ...

Downloads: 0 This Week

Last Update: 2025-09-28
See Project
12

iptv

Collection of publicly available IPTV channels from all over the world

iptv-org/iptv is a community-maintained index of publicly available IPTV channels organized into a giant, continuously updated set of M3U playlists. Instead of hosting streams, the project curates links already available on the public internet and classifies them by country, language, category, and channel metadata. The repository’s structure makes it easy to consume at different granularities: you can grab a global list, a country-specific list, or a themed list such as news, sports, or...

Downloads: 123 This Week

Last Update: 2026-03-27
See Project
13

HunyuanVideo-Avatar

Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

...Emotion control by extracting emotion reference images and transferring emotional style into video sequences.

Downloads: 0 This Week

Last Update: 2025-12-16
See Project
14

MediaCrate — Video/Audio Downloader

Download video and audio from over 1,000+ websites with one click

MediaCrate is a lightweight desktop application for downloading video and audio from various websites, including YouTube, Instagram, TikTok, Facebook and many others. It's rather simple to use. Paste a link, select format and quality, and download. MediaCrate is designed with performance and simplicity in mind, maintaining minimal CPU usage while idle and a small memory footprint during downloads.

2 Reviews

Downloads: 7 This Week

Last Update: 2026-02-28
See Project
15

Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM

...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.

Downloads: 6 This Week

Last Update: 2026-01-08
See Project
16

jfMedia Player

Media Player

Media Player. Plays video and audio files. Supports RTSP network streams (auto reconnecting). Includes CLI to play file or network stream.

Downloads: 4 This Week

Last Update: 2025-12-04
See Project
17

Video Download++

Free video downloader for YouTube and hundreds of other websites

Video Download++ is a Windows app that lets you download videos from various websites, including YouTube, Tiktok, Vimeo, Bilibili and various media channels. Just copy a video URL, paste it in Vividl and download the video in different qualities, convert it to other formats or to an audio track. You can also manage the parallel download of multiple videos and retrieve additonal meta information using the simple UI.

1 Review

Downloads: 26 This Week

Last Update: 2025-02-04
See Project
18

MoviePy

Video editing with Python

MoviePy is a Python module for video editing, which can be used for basic operations (like cuts, concatenations, title insertions), video compositing (a.k.a. non-linear editing), video processing, or to create advanced effects. It can read and write the most common video formats, including GIF. MoviePy is an open source software originally written by Zulko and released under the MIT licence. It works on Windows, Mac, and Linux, with Python 2 or Python 3. The code is hosted on Github, where...

Downloads: 27 This Week

Last Update: 2025-05-21
See Project
19

Subtitle-Workshop-Classic-v6.3.4

Subtitle Editor derived from 6.0c, but with VLC and Hunspell checker

Audio waveform, VLC Video Renderer, UTF8 coding, Audio stream detection and Selection, Resizeable screens, Hunspell spellcheck, Easy shortcut editing, user profiles and more than 70 filetypes supported.

Downloads: 74 This Week

Last Update: 2025-02-23
See Project
20

BlogWizard

Generate blog articles from video or audio

BlogWizard is a demo/utility project built on top of Groq’s LLM infrastructure that converts video or audio content into well-structured blog posts, enabling creators to repurpose multimedia content into text — useful for SEO, accessibility, or reaching audiences that prefer reading. The tool uses transcription (e.g. via Whisper) to extract text from audio/video, then runs an LLM-based generation pipeline to transform that content into coherent, readable blog-format posts — with sections, formatting, and possibly metadata. ...

Downloads: 0 This Week

Last Update: 2025-12-19
See Project
21

WebTorrent CLI

WebTorrent, the streaming torrent client. For the command line

WebTorrent CLI is a command-line interface for the WebTorrent protocol, enabling users to stream, download, and seed torrents directly from the terminal using Node.js. It supports both traditional BitTorrent and WebTorrent peers, bridging the gap between desktop clients and browser-based peers. With a simple and intuitive interface, WebTorrent CLI allows users to instantly stream videos (especially with built-in support for VLC, MPV, and Chromecast), making it ideal for media consumption...

Downloads: 2 This Week

Last Update: 2025-04-09
See Project
22

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

...It can parse different types of links such as standard video pages, short links, and collection or activity pages to automatically retrieve downloadable media. It also allows users to choose video resolution, audio quality, and encoding format based on the available sources. Additional features include downloading subtitles, comments, metadata, and artwork associated with videos.

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
23

Kaset

The missing YouTube Music macOS app

Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio...

Downloads: 4 This Week

Last Update: 6 days ago
See Project
24

Internet Friendly Media Encoder

Simple, Flexible & Powerful H.265/HEVC & H266/VVC video encoder!

A versatile, future-proof, and expandable multimedia encoder designed for ease of use. Internet Friendly Media Encoder supports adding subtitles and attachments, and can function as a remuxing tool with a Copy Stream option. It allows you to combine multiple video, audio, subtitle, and attachment streams into a single file, remove unwanted streams, or incorporate subtitles from another video without extracting. The software also supports AviSynth for advanced video processing. The video and audio encoders are implemented in a plug-in style, enabling users to add their own compiled, optimized CPU architectures for increased speed. ...

9 Reviews

Downloads: 31 This Week

Last Update: 2025-11-29
See Project
25

HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

HunyuanVideo is a cutting-edge framework designed for large-scale video generation, leveraging advanced AI techniques to synthesize videos from various inputs. It is implemented in PyTorch, providing pre-trained model weights and inference code for efficient deployment. The framework aims to push the boundaries of video generation quality, incorporating multiple innovative approaches to improve the realism and coherence of the generated content. Release of FP8 model weights to reduce GPU...

1 Review

Downloads: 8 This Week

Last Update: 2025-09-23
See Project