ERNIE-4.5-VL-424B-A47B-Base-Paddle download

ERNIE-4.5-VL-424B-A47B-Base-Paddle is a multimodal Mixture-of-Experts (MoE) model developed by Baidu, designed to understand and generate both text and image-based information. It utilizes a heterogeneous MoE architecture with modality-isolated routing and specialized loss functions to ensure effective learning across both modalities. Pretrained with trillions of tokens, the model activates 47B parameters per token out of a total of 424B, optimizing for scalability and precision. Its training incorporates a staged approach, first focusing on language, then extending to vision with additional modules like ViT and visual experts. The model supports extremely long contexts (up to 131,072 tokens), enabling complex reasoning and narrative generation. Built on the PaddlePaddle framework, it leverages FP8 mixed precision, hybrid parallelism, and quantization techniques for efficient performance.

Features

424B total parameters with 47B activated per token
Trained for both language and visual understanding
Multimodal heterogeneous MoE architecture
Supports ultra-long context length (131,072 tokens)
Includes modality-specific experts and visual adapters
Trained using FP8 mixed precision and efficient pipeline scheduling
Optimized for cross-modal reasoning and generation
Built with PaddlePaddle for wide hardware compatibility

Project Samples

ERNIE-4.5-VL-424B-A47B-Base-Paddle Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-VL-424B-A47B-Base-Paddle

ERNIE-4.5-VL-424B-A47B-Base-Paddle Web Site

Other Useful Business Software

Your top-rated shield against malware and online scams | Avast Free Antivirus

Browse and email in peace, supported by clever AI

Our antivirus software scans for security and performance issues and helps you to fix them instantly. It also protects you in real time by analyzing unknown files before they reach your desktop PC or laptop — all for free.

Free Download

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-VL-424B-A47B-Base-Paddle!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
Qwen2.5-Max

Qwen2.5-Max is a large-scale Mixture-of-Experts (MoE) model developed by the Qwen team, pretrained on over 20 trillion tokens and further refined through Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). In evaluations, it outperforms models like DeepSeek V3 in...

See Software