DeepSeed Files

Deep learning optimization library making distributed training easy

This is an exact mirror of the DeepSeed project, hosted at https://github.com/microsoft/DeepSpeed. SourceForge is not affiliated with DeepSeed. For more information, see the SourceForge Open Source Mirror Directory.

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
README.md	2025-10-23	1.5 kB	0
v0.18.1 Patch Release source code.tar.gz	2025-10-23	215.2 MB	0
v0.18.1 Patch Release source code.zip	2025-10-23	216.3 MB	8
Totals: 3 Items		431.5 MB	8

What's Changed

Add ZenFlow code for Stage 3 by @JoshWoo2003 in https://github.com/deepspeedai/DeepSpeed/pull/7516
[XPU][CI] recover xpu-max1100 workflow by @Liangliang-Ma in https://github.com/deepspeedai/DeepSpeed/pull/7630
Take **kwargs in init of DeepSpeedZeroOptimizer subclasses by @eternalNight in https://github.com/deepspeedai/DeepSpeed/pull/7634
add support for tensor learning rate (vs scalar) by @NirSonnenschein in https://github.com/deepspeedai/DeepSpeed/pull/7633
Fix illegal memory access with multi_tensor_apply size above INT_MAX by @wangyan-mms in https://github.com/deepspeedai/DeepSpeed/pull/7639
No Muon optimizer for embeding and lm_head layer by @delock in https://github.com/deepspeedai/DeepSpeed/pull/7641
z2: report param name and not zero id in assert by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7637
z2: don't pass dtype to report_ipg_memory_usage by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7636
Ulysses HF Accelerate integration by @stas00 in https://github.com/deepspeedai/DeepSpeed/pull/7638
Add DataStates-LLM: Asynchronous Checkpointing Engine Support by @mauryaavinash95 in https://github.com/deepspeedai/DeepSpeed/pull/7166

New Contributors

@JoshWoo2003 made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7516
@wangyan-mms made their first contribution in https://github.com/deepspeedai/DeepSpeed/pull/7639

Full Changelog: https://github.com/deepspeedai/DeepSpeed/compare/v0.18.0...v0.18.1

Source: README.md, updated 2025-10-23

Other Useful Business Software

Gen AI apps are built with MongoDB Atlas Icon

Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.

Start Free

Build Securely on Azure with Proven Frameworks Icon

Build Securely on Azure with Proven Frameworks

Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now

Recommended Projects

DeepSpeed
DeepSpeed is an easy-to-use deep learning optimization software suite that enables unprecedented scale and speed for Deep Learning Training and Inference. With DeepSpeed you can: 1. Train/Inference dense or sparse models with billions or trillions of parameters 2. Achieve excellent system...
Megatron
Megatron is a large, powerful transformer developed by the Applied Deep Learning Research team at NVIDIA. This repository is for ongoing research on training large transformer language models at scale. We developed efficient, model-parallel (tensor, sequence, and pipeline), and multi-node...
GPT-NeoX
This repository records EleutherAI's library for training large-scale language models on GPUs. Our current framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. We aim to make this repo a centralized and...
Ray
Modern workloads like deep learning and hyperparameter tuning are compute-intensive and require distributed or parallel execution. Ray makes it effortless to parallelize single machine code — go from a single CPU to multi-core, multi-GPU or multi-node with minimal code changes. Accelerate your...
SageMaker Python SDK
SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the SDK, you can train and deploy models using popular deep learning frameworks Apache MXNet and TensorFlow. You can also train and deploy models with Amazon algorithms,...