ERNIE-4.5-300B-A47B-Base-PT download

ERNIE-4.5-300B-A47B-Base-PT is a post-trained variant of Baidu’s large-scale text-only MoE model, featuring 300 billion total parameters with 47 billion active per token. It builds upon the pretrained ERNIE 4.5 foundation and is optimized for natural language understanding and generation. The model supports advanced fine-tuning via SFT, LoRA, and DPO through the ERNIEKit training toolkit. It is compatible with PaddlePaddle and Transformers, making deployment and customization highly flexible. The architecture maintains scalability and efficiency using heterogeneous expert routing, FP8 precision, and quantized inference up to 2-bit. With a context length of 131,072 tokens, it’s designed for long-form generation and reasoning tasks. This post-trained version is ideal for developers seeking reliable LLM performance with high adaptability to real-world workloads.

Features

300B parameters with 47B activated per token
Post-trained for improved language modeling tasks
Supports LoRA, SFT, and DPO fine-tuning via ERNIEKit
Long context support up to 131,072 tokens
FP8 and quantized (4/2-bit) inference ready
Built for PaddlePaddle and compatible with Transformers
Supports vLLM serving with multi-GPU setups
Optimized for instruction-following and dialogue tasks

Project Samples

ERNIE-4.5-300B-A47B-Base-PT Screenshot 1

Project Activity

See All Activity >

Follow ERNIE-4.5-300B-A47B-Base-PT

ERNIE-4.5-300B-A47B-Base-PT Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-300B-A47B-Base-PT!

Additional Project Details

Registered

2025-06-30

Similar Business Software

Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
ERNIE 3.0 Titan

Pre-trained language models have achieved state-of-the-art results in various Natural Language Processing (NLP) tasks. GPT-3 has shown that scaling up pre-trained language models can further exploit their enormous potential. A unified framework named ERNIE 3.0 was recently proposed for...

See Software