ERNIE-4.5-VL-424B-A47B-PT download

ERNIE-4.5-VL-424B-A47B-PT is a large-scale multimodal MoE model developed by Baidu, integrating advanced capabilities in both language and vision. With 424 billion total parameters and 47 billion activated per token, it builds on ERNIE 4.5’s MoE foundation and introduces strong image-text interaction for complex reasoning and generation tasks. The model benefits from a structured post-training process including Supervised Fine-tuning (SFT) and Reinforcement Learning with Verifiable Rewards (RLVR), enhancing its alignment and performance across diverse use cases. Designed to support both thinking and non-thinking inference modes, it enables flexible and interpretable outputs in real-world applications. Its heterogeneous MoE structure includes modality-isolated routing and token-balanced loss to ensure efficient joint training of text and visual components.

Features

424B total parameters with 47B activated per token
Multimodal input: supports both vision and text tasks
Post-trained with SFT and RLVR for improved alignment
Switchable thinking mode for flexible reasoning depth
Built with PaddlePaddle and supports FastDeploy
Uses modality-isolated routing and balanced token loss
Compatible with vLLM and supports 4-bit/8-bit quantization
Handles long-context sequences up to 131,072 tokens

Project Samples

Project Activity

See All Activity >

Follow ERNIE-4.5-VL-424B-A47B-PT

ERNIE-4.5-VL-424B-A47B-PT Web Site

Other Useful Business Software

Build Securely on AWS with Proven Frameworks

Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-VL-424B-A47B-PT!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek R1

DeepSeek-R1 is an advanced open-source reasoning model developed by DeepSeek, designed to rival OpenAI's Model o1. Accessible via web, app, and API, it excels in complex tasks such as mathematics and coding, demonstrating superior performance on benchmarks like the American Invitational...

See Software
DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software