ERNIE-4.5-0.3B-Base-PT download

ERNIE-4.5-0.3B-Base-PT is a compact, fully dense transformer model with 360 million parameters, optimized for general-purpose text generation tasks. It belongs to the ERNIE 4.5 series by Baidu and leverages advanced pretraining techniques without relying on a Mixture-of-Experts (MoE) structure. The model features 18 transformer layers, 16 attention heads, and a maximum context length of 131,072 tokens, offering strong language understanding for its size. It can be fine-tuned using ERNIEKit with support for SFT, LoRA, and DPO training methods, making it highly adaptable. Compatible with the Hugging Face Transformers library, the model can be easily used in Python for inference or deployed via FastDeploy. This variant emphasizes portability and accessibility, enabling fast deployment even on less powerful hardware. Ideal for developers seeking a smaller model for prototyping, educational use, or lightweight production tasks.

Features

360M parameters with 18 transformer layers
Dense architecture (non-MoE) for streamlined inference
131,072 token context window
Optimized for English and Chinese text generation
Fine-tuning supported via ERNIEKit (SFT, DPO, LoRA)
Hugging Face Transformers and FastDeploy compatibility
Python API example included for easy use
Apache 2.0 license with commercial-use permissions

Project Samples

Project Activity

See All Activity >

Follow ERNIE-4.5-0.3B-Base-PT

ERNIE-4.5-0.3B-Base-PT Web Site

Other Useful Business Software

No-Nonsense Code-to-Cloud Security for Devs | Aikido

Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.

Start for Free

Rate This Project

User Reviews

Be the first to post a review of ERNIE-4.5-0.3B-Base-PT!

Additional Project Details

Registered

2025-06-30

Similar Business Software

Kimi K2

Kimi K2 is a state-of-the-art open source large language model series built on a mixture-of-experts (MoE) architecture, featuring 1 trillion total parameters and 32 billion activated parameters for task-specific efficiency. Trained with the Muon optimizer on over 15.5 trillion tokens and...

See Software
DeepSeek-V2

DeepSeek-V2 is a state-of-the-art Mixture-of-Experts (MoE) language model introduced by DeepSeek-AI, characterized by its economical training and efficient inference capabilities. With a total of 236 billion parameters, of which only 21 billion are active per token, it supports a context length...

See Software
Yi-Lightning

Yi-Lightning, developed by 01.AI under the leadership of Kai-Fu Lee, represents the latest advancement in large language models with a focus on high performance and cost-efficiency. It boasts a maximum context length of 16K tokens and is priced at $0.14 per million tokens for both input and...

See Software