ERNIE-4.5-300B-A47B-W4A8C8-TP4-Paddle download

ERNIE-4.5-300B-A47B-W4A8C8-TP4-Paddle is a 300B-parameter Mixture-of-Experts (MoE) language model by Baidu, optimized with 4-bit weights and 8-bit activations for highly efficient inference. This quantized variant significantly reduces memory requirements while preserving output quality, enabling deployment on systems with limited GPU capacity. The model activates 47 billion parameters per token and is trained for high-performance text generation, supporting both Chinese and English. It leverages PaddlePaddle with TP4 (tensor parallelism across 4 GPUs), fine-grained scheduling, and expert parallelism for scalable, modular performance. The model includes long context support up to 131,072 tokens and integrates easily with FastDeploy for real-time applications. Like other ERNIE 4.5 variants, it was trained using supervised fine-tuning (SFT), DPO, and UPO to align with complex reasoning and generative tasks.

Features

4-bit weights and 8-bit activations for optimized efficiency
300B parameters with 47B active per token
Tensor parallelism across 4 GPUs (TP4 configuration)
Built on PaddlePaddle with FastDeploy support
Context window up to 131,072 tokens
Multilingual support (English and Chinese)
Pretrained and post-trained for advanced language generation
Open-source under Apache 2.0 license

Project Samples

ERNIE-4.5-300B-A47B-W4A8C8-TP4-Paddle Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-300B-A47B-W4A8C8-TP4-Paddle

ERNIE-4.5-300B-A47B-W4A8C8-TP4-Paddle Web Site

Other Useful Business Software

Picsart Enterprise Background Removal API for Stunning eCommerce Visuals

Instantly remove the background from your images in just one click.

With our Remove Background API tool, you can access the transformative capabilities of automation , which will allow you to turn any photo asset into compelling product imagery. With elevated visuals quality on your digital platforms, you can captivate your audience, and therefore achieve higher engagement and sales.

Learn More

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-300B-A47B-W4A8C8-TP4-Paddle!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software