ERNIE-4.5-300B-A47B-Base-Paddle download

ERNIE-4.5-300B-A47B-Base-Paddle is a powerful large language model by Baidu, based on a 300B parameter Mixture-of-Experts (MoE) architecture. It activates 47B parameters per token and is optimized for high-quality text generation and reasoning. This model is part of the ERNIE 4.5 series and leverages a heterogeneous MoE structure to balance performance and efficiency. It was trained in stages, starting with language understanding before expanding to include vision capabilities—though this variant focuses solely on text. Built using PaddlePaddle, it supports advanced infrastructure features like FP8 mixed-precision training, hybrid parallelism, and 4-bit/2-bit quantization for scalable deployment. The model supports long-context tasks with a maximum sequence length of 131,072 tokens. ERNIEKit enables easy fine-tuning using LoRA, SFT, or DPO, while FastDeploy and Transformers provide flexible deployment options across environments.

Features

300B total parameters, 47B activated per token
Designed for high-performance text generation and reasoning
Heterogeneous Mixture-of-Experts (MoE) structure
Supports long-context processing (up to 131,072 tokens)
Trained with FP8 mixed precision and advanced scheduling
Compatible with ERNIEKit for SFT, LoRA, and DPO fine-tuning
Deployable via FastDeploy and Transformers libraries
Built on PaddlePaddle with multi-GPU and quantization support

Project Samples

ERNIE-4.5-300B-A47B-Base-Paddle Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-300B-A47B-Base-Paddle

ERNIE-4.5-300B-A47B-Base-Paddle Web Site

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-300B-A47B-Base-Paddle!

Additional Project Details

Registered

2025-06-30

Similar Business Software

Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software
DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software