ERNIE-4.5-VL-28B-A3B-Paddle download

ERNIE-4.5-VL-28B-A3B-Paddle is a multimodal MoE chat model designed for complex image-text tasks, featuring 28 billion total parameters with 3 billion activated per token. Built on PaddlePaddle, it excels in tasks like visual question answering, description generation, and multimodal reasoning. It employs a heterogeneous Mixture-of-Experts architecture that supports both thinking and non-thinking inference modes. The model benefits from advanced pretraining and posttraining strategies, including Reinforcement Learning with Verifiable Rewards (RLVR), to enhance alignment and performance. Fine-tuned for real-world applications, it integrates language and vision through supervised learning, DPO, and UPO techniques. It supports long contexts up to 131,072 tokens and can be deployed using FastDeploy or the Hugging Face Transformers library. This version is ideal for developers needing high-performance, scalable multimodal capabilities in chat or image-based reasoning systems.

Features

28B parameter multimodal MoE with 3B active per token
Handles image-text chat, reasoning, and description tasks
Supports thinking and non-thinking inference modes
Uses RLVR, SFT, DPO, and UPO for robust posttraining
PaddlePaddle-based for optimized performance and deployment
FastDeploy-ready with GPU-efficient quantization support
Long context support up to 131,072 tokens
Transformers-compatible with Python inference examples Preguntar a ChatGPT

Project Samples

ERNIE-4.5-VL-28B-A3B-Paddle Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-VL-28B-A3B-Paddle

ERNIE-4.5-VL-28B-A3B-Paddle Web Site

Other Useful Business Software

MongoDB Atlas | Run databases anywhere

Ensure the availability of your data with coverage across AWS, Azure, and GCP on MongoDB Atlas—the multi-cloud database for every enterprise.

MongoDB Atlas allows you to build and run modern applications across 125+ cloud regions, spanning AWS, Azure, and Google Cloud. Its multi-cloud clusters enable seamless data distribution and automated failover between cloud providers, ensuring high availability and flexibility without added complexity.

Learn More

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-VL-28B-A3B-Paddle!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software