UniVL is a video-language pretrain model. It is designed with four modules and five objectives for both video language understanding and generation tasks. It is also a flexible model for most of the multimodal downstream tasks considering both efficiency and effectiveness.
Features
- Finetune on YoucookII
- Documentation available
- Examples available
- Run caption task on YoucookII
- Pretrain on HowTo100M
- Licensed under the MIT License
License
MIT LicenseFollow UniVL
Other Useful Business Software
Build AI Apps with Gemini 3 on Vertex AI
Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of UniVL!