ERNIE-4.5-VL-424B-A47B-Base-PT download

ERNIE-4.5-VL-424B-A47B-Base-PT is a powerful multimodal Mixture-of-Experts (MoE) model developed by Baidu and fine-tuned for enhanced performance across both text and visual tasks. It builds upon the pretraining of ERNIE 4.5, using modality-specific post-training techniques to optimize for general-purpose natural language processing and visual-language reasoning. The model employs a heterogeneous MoE architecture with modality-isolated routing and loss-balancing mechanisms to ensure efficient and specialized expert activation. With a total of 424 billion parameters—47 billion of which are active per token—it supports large context windows and deep cross-modal understanding. Key training strategies include FP8 mixed precision, fine-grained recomputation, and advanced quantization methods for efficient inference. It supports both “thinking” and “non-thinking” visual modes, allowing it to handle a range of tasks from pure text generation to image-aware reasoning.

Features

Multimodal model supporting both text and image inputs
424B parameters with 47B activated per token
Fine-tuned for cross-modal comprehension and generation
Heterogeneous MoE architecture with modality-isolated routing
Trained using FP8 mixed precision and hybrid parallelism
Optimized for long context length up to 131,072 tokens
Supports supervised, DPO, and UPO post-training techniques
Apache 2.0 license for commercial and research use

Project Samples

ERNIE-4.5-VL-424B-A47B-Base-PT Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-VL-424B-A47B-Base-PT

ERNIE-4.5-VL-424B-A47B-Base-PT Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-VL-424B-A47B-Base-PT!

Additional Project Details

Registered

2025-06-30

Similar Business Software

ERNIE 4.5 Turbo

ERNIE 4.5 Turbo, unveiled by Baidu at the 2025 Baidu Create conference, is a cutting-edge AI model designed to handle a variety of data inputs, including text, images, audio, and video. It offers powerful multimodal processing capabilities that enable it to perform complex tasks across...

See Software
ERNIE 4.5

ERNIE 4.5 is a cutting-edge conversational AI platform developed by Baidu, leveraging advanced natural language processing (NLP) models to enable highly sophisticated human-like interactions. The platform is part of Baidu’s ERNIE (Enhanced Representation through Knowledge Integration) series,...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software