DeepSpeed

DeepSpeed

Microsoft
+
+

Related Products

  • Vertex AI
    783 Ratings
    Visit Website
  • RunPod
    205 Ratings
    Visit Website
  • Cloudflare
    1,903 Ratings
    Visit Website
  • Qloo
    23 Ratings
    Visit Website
  • Dragonfly
    16 Ratings
    Visit Website
  • Fraud.net
    56 Ratings
    Visit Website
  • OORT DataHub
    13 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • Nexo
    16,471 Ratings
    Visit Website
  • Evertune
    1 Rating
    Visit Website

About

DeepSpeed is an open source deep learning optimization library for PyTorch. It's designed to reduce computing power and memory use, and to train large distributed models with better parallelism on existing computer hardware. DeepSpeed is optimized for low latency, high throughput training. DeepSpeed can train DL models with over a hundred billion parameters on the current generation of GPU clusters. It can also train up to 13 billion parameters in a single GPU. DeepSpeed is developed by Microsoft and aims to offer distributed training for large-scale models. It's built on top of PyTorch, which specializes in data parallelism.

About

GPUs bring data in and out quickly, but have little locality of reference because of their small caches. They are geared towards applying a lot of compute to little data, not little compute to a lot of data. The networks designed to run on them therefore execute full layer after full layer in order to saturate their computational pipeline (see Figure 1 below). In order to deal with large models, given their small memory size (tens of gigabytes), GPUs are grouped together and models are distributed across them, creating a complex and painful software stack, complicated by the need to deal with many levels of communication and synchronization among separate machines. CPUs, on the other hand, have large, much faster caches than GPUs, and have an abundance of memory (terabytes). A typical CPU server can have memory equivalent to tens or even hundreds of GPUs. CPUs are perfect for a brain-like ML world in which parts of an extremely large network are executed piecemeal, as needed.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Deep learning model developers

Audience

Companies doing AI and ML development

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Microsoft
Founded: 1975
United States
www.deepspeed.ai/

Company Information

Neural Magic
Founded: 2018
United States
neuralmagic.com

Alternatives

Alternatives

GPT-NeoX

GPT-NeoX

EleutherAI
Neural Designer

Neural Designer

Artelnics
AWS Neuron

AWS Neuron

Amazon Web Services

Categories

Categories

Integrations

Axolotl
Cake AI
Comet LLM
Nurix
PyTorch
Python
Ultralytics

Integrations

Axolotl
Cake AI
Comet LLM
Nurix
PyTorch
Python
Ultralytics
Claim DeepSpeed and update features and information
Claim DeepSpeed and update features and information
Claim Neural Magic and update features and information
Claim Neural Magic and update features and information