Qwen2.5-1.5B-Instruct download

Qwen2.5-1.5B-Instruct is an instruction-tuned variant of the Qwen2.5 language model with 1.54 billion parameters, designed for text generation and conversational tasks. It was developed for use within the Gensyn RL Swarm system, which enables decentralized reinforcement learning fine-tuning over peer-to-peer networks. The model architecture includes rotary positional embeddings (RoPE), SwiGLU activation, RMSNorm, attention QKV bias, and tied word embeddings. It features 28 layers, a GQA attention mechanism with 12 query heads and 2 key-value heads, and a context window of up to 32,768 tokens for input and 8,192 tokens for output. While optimized for RL Swarm use, it can be integrated into standard workflows for inference and chat once fine-tuned. It supports BF16 tensors and is distributed as a Safetensors model. The base model is Qwen2.5-1.5B, with this version enhanced for instruction following and dialogue.

Features

Instruction-tuned for chat and task-oriented dialogue
1.54B total parameters with 1.31B non-embedding parameters
Uses rotary position encodings (RoPE) and SwiGLU activation
Includes RMSNorm and attention QKV bias
28 transformer layers with grouped-query attention (GQA)
32K token context length for input, 8K token generation length
Compatible with Gensyn RL Swarm for decentralized RL fine-tuning
Ready for use with Featherless AI inference or local deployment

Project Samples

Project Activity

See All Activity >

Follow Qwen2.5-1.5B-Instruct

Qwen2.5-1.5B-Instruct Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of Qwen2.5-1.5B-Instruct!

Additional Project Details

Registered

2025-07-01

Similar Business Software

Tülu 3

Tülu 3 is an advanced instruction-following language model developed by the Allen Institute for AI (Ai2), designed to enhance capabilities in areas such as knowledge, reasoning, mathematics, coding, and safety. Built upon the Llama 3 Base, Tülu 3 employs a comprehensive four-stage post-training...

See Software
Qwen2.5-Max

Qwen2.5-Max is a large-scale Mixture-of-Experts (MoE) model developed by the Qwen team, pretrained on over 20 trillion tokens and further refined through Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). In evaluations, it outperforms models like DeepSeek V3 in...

See Software
Qwen2.5-1M

Qwen2.5-1M is an open-source language model developed by the Qwen team, designed to handle context lengths of up to one million tokens. This release includes two model variants, Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M, marking the first time Qwen models have been upgraded to support...

See Software