HunyuanCustomTencent
|
Wan2.1Alibaba
|
|||||
Related Products
|
||||||
About
HunyuanCustom is a multi-modal customized video generation framework that emphasizes subject consistency while supporting image, audio, video, and text conditions. Built upon HunyuanVideo, it introduces a text-image fusion module based on LLaVA for enhanced multi-modal understanding, along with an image ID enhancement module that leverages temporal concatenation to reinforce identity features across frames. To enable audio- and video-conditioned generation, it further proposes modality-specific condition injection mechanisms, an AudioNet module that achieves hierarchical alignment via spatial cross-attention, and a video-driven injection module that integrates latent-compressed conditional video through a patchify-based feature-alignment network. Extensive experiments on single- and multi-subject scenarios demonstrate that HunyuanCustom significantly outperforms state-of-the-art open and closed source methods in terms of ID consistency, realism, and text-video alignment.
|
About
Wan2.1 is an open-source suite of advanced video foundation models designed to push the boundaries of video generation. This cutting-edge model excels in various tasks, including Text-to-Video, Image-to-Video, Video Editing, and Text-to-Image, offering state-of-the-art performance across multiple benchmarks. Wan2.1 is compatible with consumer-grade GPUs, making it accessible to a broader audience, and supports multiple languages, including both Chinese and English for text generation. The model's powerful video VAE (Variational Autoencoder) ensures high efficiency and excellent temporal information preservation, making it ideal for generating high-quality video content. Its applications span across entertainment, marketing, and more.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Digital content creators and filmmakers wanting a solution to generate personalized, subject-consistent videos using multi-modal inputs
|
Audience
AI researchers, content creators, and developers looking to integrate high-performance, scalable video generation tools into their projects, with applications across entertainment, marketing, and multimedia
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationTencent
Founded: 1998
China
hunyuancustom.github.io
|
Company InformationAlibaba
Founded: 1999
China
wan.video
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Auralume AI
CUDA
Everlyn
Hugging Face
Hunyuan T1
HunyuanVideo
Promptus
SiliconFlow
VidFlux AI
WaveSpeedAI
|
Integrations
Auralume AI
CUDA
Everlyn
Hugging Face
Hunyuan T1
HunyuanVideo
Promptus
SiliconFlow
VidFlux AI
WaveSpeedAI
|
|||||
|
|
|