ERNIE-4.5-VL-28B-A3B-Base-Paddle download

ERNIE-4.5-VL-28B-A3B-Base-Paddle is a multimodal Mixture-of-Experts (MoE) model designed to understand and generate content from both text and images. With 28 billion total parameters and 3 billion activated per token, it strikes a balance between performance and efficiency. It leverages a heterogeneous MoE architecture with modality-isolated routing and token-balanced losses to avoid cross-modality interference. The model undergoes staged pretraining: first focusing on textual understanding, then incorporating visual capabilities using Vision Transformers, adapters, and dedicated visual experts. It supports context lengths up to 131,072 tokens, making it suitable for long-form reasoning and image-text interactions. Built on PaddlePaddle and pretrained on trillions of tokens, it is optimized for conversational, generative, and reasoning tasks. The model supports English and Chinese and is released under the Apache 2.0 license.

Features

Multimodal support for text and vision tasks
28B total parameters with 3B activated per token
64 text and 64 vision experts with 2 shared experts
Staged training with dedicated visual and textual phases
Long context window up to 131,072 tokens
Supports English and Chinese
Built on PaddlePaddle with scalable inference support
Released under Apache 2.0 for commercial use

Project Samples

ERNIE-4.5-VL-28B-A3B-Base-Paddle Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-VL-28B-A3B-Base-Paddle

ERNIE-4.5-VL-28B-A3B-Base-Paddle Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.

Start Free

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-VL-28B-A3B-Base-Paddle!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software