ERNIE-4.5-300B-A47B-2Bits-Paddle download

ERNIE-4.5-300B-A47B-2Bits-Paddle is a 2-bit quantized variant of Baidu’s 300B-parameter Mixture-of-Experts (MoE) language model, designed for ultra-low-resource inference. Despite the extreme compression, the model retains 47 billion active parameters per token and supports high-quality language generation across English and Chinese. Built with PaddlePaddle and optimized for deployment on a single 141GB GPU, it uses sophisticated quantization (WINT2) and expert-parallel collaboration to achieve lossless performance. The model supports a context length of up to 131,072 tokens and integrates with FastDeploy for fast service setup. Like other ERNIE 4.5 models, it benefits from pretraining and modality-specific post-training via SFT, DPO, and UPO methods. It is especially suited for applications requiring high throughput and minimal latency with limited hardware. Users are advised to use temperature 0.8 and top-p 0.8 for optimal sampling.

Features

2-bit quantized weights for minimal memory usage
300B total parameters with 47B active per token
Supports deployment on a single 141GB GPU
Long context window of up to 131,072 tokens
Expert parallelism and load balancing for scalable performance
Multilingual text generation (English and Chinese)
Integrated with FastDeploy for quick inference setup
Open-source under Apache 2.0 license

Project Samples

ERNIE-4.5-300B-A47B-2Bits-Paddle Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-300B-A47B-2Bits-Paddle

ERNIE-4.5-300B-A47B-2Bits-Paddle Web Site

Other Useful Business Software

No-Nonsense Code-to-Cloud Security for Devs | Aikido

Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.

Start for Free

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-300B-A47B-2Bits-Paddle!

Additional Project Details

Registered

2025-06-30

Similar Business Software

DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-Coder-V2

DeepSeek-Coder-V2 is an open source code language model designed to excel in programming and mathematical reasoning tasks. It features a Mixture-of-Experts (MoE) architecture with 236 billion total parameters and 21 billion activated parameters per token, enabling efficient processing and high...

See Software