Showing 48 open source projects for "video clip"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    ComfyUI-LTXVideo

    ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    ...This integration empowers non-programmers and rapid-iteration teams to harness the performance of LTX-Video while maintaining the clarity and flexibility of a dataflow graph model. It supports nodes for common video operations like trimming, layering, color grading, and generative augmentations, making it suitable for everything from simple clip edits to complex sequences with conditional behavior.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Auto Synced & Translated Dubs

    Auto Synced & Translated Dubs

    Automatically translates the text of a video based on a subtitle file

    ...Using the timestamps of each subtitle line, it computes the required duration of each spoken segment and synthesizes audio via neural TTS services, producing one audio clip per subtitle entry. The tool then time-stretches or compresses each TTS clip to match the original speech duration exactly, which preserves lip-sync and rhythm as closely as possible without manual editing. Finally, it combines all the clips into a single dubbed audio track that can be muxed with the original video, along with new translated subtitle files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Bili23 Downloader

    Bili23 Downloader

    Cross platform GUI tool for downloading videos from Bilibili sites

    Bili23-Downloader is an open source desktop application designed for downloading video content from the Bilibili platform. It provides a graphical interface that allows users to download various types of media including user-uploaded videos, series episodes, movies, and other hosted content. It focuses on ease of use with a zero-configuration setup, making it accessible to both beginners and experienced users. It supports high performance downloads through multi-threading and includes resume...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Vidi2

    Vidi2

    Large Multimodal Models for Video Understanding and Editing

    Vidi is a family of large multimodal models developed for deep video understanding and editing tasks, integrating vision, audio, and language to allow sophisticated querying and manipulation of video content. It’s designed to process long-form, real-world videos and answer complex queries such as “when in this clip does X happen?” or “where in the frame is object Y during that moment?” — offering temporal retrieval, spatio-temporal grounding (i.e. locating objects over time + space), and even video question answering. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Perception Models

    Perception Models

    State-of-the-art Image & Video CLIP, Multimodal Large Language Models

    Perception Models is a state-of-the-art framework developed by Facebook Research for advanced image and video perception tasks. It introduces two primary components: the Perception Encoder (PE) for visual feature extraction and the Perception Language Model (PLM) for multimodal decoding and reasoning. The PE module is a family of vision encoders designed to excel in image and video understanding, surpassing models like SigLIP2, InternVideo2, and DINOv2 across multiple benchmarks. Meanwhile,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    ComfyUI

    ComfyUI

    The most powerful and modular diffusion model GUI, api and backend

    The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...
    Downloads: 183 This Week
    Last Update:
    See Project
  • 7
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    ...It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    UForm

    UForm

    Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion

    ...This type of models is well-suited for retrieval in large collections. The most famous example of such models is CLIP by OpenAI. Early-fusion models encode both modalities jointly so they can take into account fine-grained features. Usually, these models are used for re-ranking relatively small retrieval results. Mid-fusion models are the golden midpoint between the previous two types. Mid-fusion models consist of two parts – unimodal and multimodal.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 10
    HunyuanWorld 1.0

    HunyuanWorld 1.0

    Generating Immersive, Explorable, and Interactive 3D Worlds

    ...HunyuanWorld-1.0 surpasses existing open-source methods in visual quality and geometric consistency, demonstrated by superior scores in BRISQUE, NIQE, Q-Align, and CLIP metrics.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    ClipCutter

    ClipCutter

    Simple Video Cutter

    navigate to the start point you want for the new video, click start, do the same for end, then save the new video file. Free forever, no personal info needed. Sometimes you want to snip a part of a video and don't want to pay someone or open a bloated program to do so
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    VCT - Video Converter & Transcoder

    VCT - Video Converter & Transcoder

    Transcode or convert files to H.264 or H.265 HEVC MKV/MP4 - FFmpeg GUI

    Open source video/audio converter GUI based on FFmpeg (included) Works natively for Windows and Linux (with mono-complete installed) Batch convert audio/video collection to H.264 or H.265 HEVC AAC/MP3, etc. Transcode MKV to MP4 or MP4/M4V to MKV in 1 minute. Rotate video Resize video Extract audio from movie clip Extract specific part of video/audio file Convert various audio files to MP3 or AAC Select audio stream to encode with video Add subtitle stream Easily create conversion tasks Pause/resume encoding queue Play files Record AV streams from Internet (Record tab) FFmpeg command is exposed so FFmpeg command is fully editable 64-bit version is now default. 32 bit and XP version as separate downloads For more details on usage, please check README Short video demonstration: https://user-images.githubusercontent.com/10812743/103661971-a568b900-4f6f-11eb-8c11-ac92ea41944b.mp4 NOTE: Windows will warn you about security. ...
    Leader badge
    Downloads: 54 This Week
    Last Update:
    See Project
  • 13
    Auto Movie Assembler

    Auto Movie Assembler

    Automating making many trailer-like videos with a single click!

    This program can mass create multiple promotional movies at once using only these elements: - Pre-recorded .mp4 video clips. - Title card .png image file. - Ending card .png image file. - Sound effect 1 that plays during the Title card. - Sound effect 2 that plays during the Ending card. It will join the video clips in a alphabetical order, apply a Fade from Black transitions to all them individually, place a Title Card + sound effect after the first clip and an Ending Card with stylish Fade from White effect, also with its own sound effect. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    xjadeo

    xjadeo

    The X Jack Video Monitor

    X JAck viDEo mOnitor: xjadeo is a program that displays a video clip in sync with an external time source, most notably jack-transport, LTC and MTC. Applications include: soundtrack composition/editing, video monitoring and -installations.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 15
    Image Optimizer
    ...It utilizes opensource tools such as imagemagick, with a modified binary search tree algorithm. Inspired by handbrake, it has been compiled for Windows 11 with Python 3.13.0 version. Most compatible for jpeg image files. See the intro video clip for how it works.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Animation Compression Library

    Animation Compression Library

    Animation Compression Library

    Animation compression is a fundamental aspect of modern video game engines. Not only is it important to keep the memory footprint down but it is also critical to keep the animation clip sampling performance fast. The more memory an animation clip consumes, the slower it will be to sample it and extract a character pose at runtime. For these reasons, any game that attempts to push the boundaries of what the hardware can achieve will at some point need to implement some form of animation compression. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Clipstitch

    Uility to make home movies from your digital camera files

    Full documentation: Download clipstitchX.Y.html To make movies from your camera (or phone) video files. FFmpeg is a professional-quality, free, open-source program for video editing, with the ability to implement a huge number of operations and handle every data format! This kind of ability comes at a cost: its commands are quite complex-looking and difficult to use and remember. Clipstitch runs as a front-end to ffmpeg so that you use only the sub-set of ffmpeg commands necessary for making a home movie from your digital camera, puts them in easier-to-read form, and internally combines multiple ffmpeg commands to do certain tasks.. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Aphantasia

    Aphantasia

    CLIP + FFT/DWT/RGB = text to image/video

    This is a collection of text-to-image tools, evolved from the artwork of the same name. Based on CLIP model and Lucent library, with FFT/DWT/RGB parameterizes (no-GAN generation). Illustrip (text-to-video with motion and depth) is added. DWT (wavelets) parameterization is added. Check also colabs below, with VQGAN and SIREN+FFM generators. Tested on Python 3.7 with PyTorch 1.7.1 or 1.8. Generating massive detailed textures, a la deepdream, fullHD/4K resolutions and above, various CLIP models (including multi-language from SBERT), continuous mode to process phrase lists (e.g. illustrating lyrics), pan/zoom motion with smooth interpolation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pot-O MusiQT

    Pot-O MusiQT

    Official Repository for Pot-O MusiQT

    Pot-O MusiQT is a lightweight yet feature-rich desktop music player built with Python and PyQt5, designed for users who want a clean interface, strong playlist control, and practical everyday playback features without unnecessary complexity. It focuses on local media playback, fast interaction, and keyboard-friendly operation, while still offering modern conveniences such as metadata handling, lyrics viewing, and smooth playback transitions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pearl Desktop (PDE) 12

    Pearl Desktop (PDE) 12

    The Stable Solid Multimedia Workhorse Powerful OS with Eye Candy

    Pearl Linux Desktop (PDE) 12 is based on Ubuntu 24.04 LTR. This is your go to work horse daily driver for the advanced as well as the new Linux user. We say YES to APT, Flatpak and Appimages but NO to Snaps. Featuring Firefox-ESR instead of Firefox, Pulseasudio by default however Install package pearl-pipewire-config from our REPO to have pipewire as your default sound server. Very Smooth and Easy Configs. Compiz is the default Window Manager and you may switch window managers without...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 21
    Linux Air Combat

    Linux Air Combat

    Combat Flight Simulator for LINUX. WW2. Network and VOIP. Open-source.

    ...Control flight and views with mouse and keyboard, console game controller, or joystick, all easily configured from prominent configuration menus. Learn all about LAC from our primary, official Web Page: http://askmisterwizard.com/2019/LinuxAirCombat/LacOnlineDocs.htm Recent video clip: https://youtu.be/C77C-e8axWs
    Leader badge
    Downloads: 79 This Week
    Last Update:
    See Project
  • 22
    Quick Subtitles

    Quick Subtitles

    HTML5 Based Subtitle Creation Tool

    Quick Subtitles in an HTML5 based solution for rapid creation and syncing of subtitles while playing your video. It is designed around the concept that you should minimize the need to take your hands off the keyboard while performing data entry.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Motionity

    Motionity

    The web-based motion graphics editor for everyone

    Motionity is an open-source, browser-based motion-graphics editor designed to bring animation and video editing power to the web, blending the intuitiveness of drag-and-drop tools with advanced capabilities akin to desktop motion editors. It allows users to assemble video or image assets, layer them, and apply animation via keyframes — with customizable easing — giving fine-grained control over motion timing. In addition to keyframing, motionity supports layer masking, filters (color...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    AI Atelier

    AI Atelier

    Based on the Disco Diffusion, version of the AI art creation software

    Based on the Disco Diffusion, we have developed a Chinese & English version of the AI art creation software "AI Atelier". We offer both Text-To-Image models (Disco Diffusion and VQGAN+CLIP) and Text-To-Text (GPT-J-6B and GPT-NEOX-20B) as options. Making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. When a modified version is used to provide a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LiVES

    LiVES

    LiVES is a Video Editing System. It is designed to be simple to use, y

    LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime effects, trim and edit your clips in the clip editor, and bring them together using the multitrack timeline. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB