ERNIE-4.5-VL-28B-A3B-Base-PT download

ERNIE-4.5-VL-28B-A3B-Base-PT is a large-scale multimodal Mixture-of-Experts (MoE) model developed by Baidu, featuring 28 billion total parameters and 3 billion activated per token. It is pretrained to handle both text and image inputs, enabling it to excel in image-to-text and conversational AI tasks. The model uses a staged training strategy—starting with text-only training and then integrating vision components using ViT, adapters, and visual experts for robust cross-modal understanding. A heterogeneous MoE design, combined with advanced routing techniques and token-balancing strategies, ensures high efficiency and minimal interference between modalities. It is built on PaddlePaddle and includes innovations like intra-node parallelism, FP8 mixed precision, and 2/4-bit quantization for efficient inference. This PT (pretrained) version is suited for further fine-tuning on downstream multimodal tasks. The model supports English and Chinese and is released under the Apache 2.0 license.

Features

Pretrained multimodal MoE model with 28B parameters
3B activated parameters per token for efficient inference
Supports both text and vision inputs with 64 text and 64 vision experts
Staged training for stable multimodal learning
Long context window up to 131,072 tokens
Built with PaddlePaddle and supports Transformer inference
Includes visual experts and adapters for image processing
Commercial-use friendly under Apache 2.0 license

Project Samples

ERNIE-4.5-VL-28B-A3B-Base-PT Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-VL-28B-A3B-Base-PT

ERNIE-4.5-VL-28B-A3B-Base-PT Web Site

Other Useful Business Software

Simplify IT and security with a single endpoint management platform

Automate the hardest parts of IT

NinjaOne automates the hardest parts of IT, delivering visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. The company seamlessly integrates with a wide range of IT and security technologies. NinjaOne is obsessed with customer success and provides free and unlimited onboarding, training, and support.

Learn More

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-VL-28B-A3B-Base-PT!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software