DeepSpeed
Original authorMicrosoft Research
DeveloperMicrosoft
Initial releaseMay 18, 2020; 6 years ago (2020-05-18)
Stable release
v0.18.9 / March 30, 2026; 2 months ago (2026-03-30)
Written inPython, CUDA, C++
TypeSoftware library
LicenseApache License 2.0
Websitedeepspeed.ai
Repositorygithub.com/microsoft/DeepSpeed

DeepSpeed is an open source deep learning optimization library for PyTorch.[1]

Library

edit

The library is designed to reduce computing power and memory use and to train large distributed models with better parallelism on existing computer hardware.[2][3] DeepSpeed is optimized for low latency, high throughput training. It includes the Zero Redundancy Optimizer (ZeRO) for training models with 1 trillion or more parameters.[4] Features include mixed precision training, single-GPU, multi-GPU, and multi-node training as well as custom model parallelism. The DeepSpeed source code is licensed under MIT License and available on GitHub.[5]

The team claimed to achieve up to a 6.2x throughput improvement, 2.8x faster convergence, and 4.6x less communication.[6]

See also

edit

References

edit
  1. ^ "Microsoft Updates Windows, Azure Tools with an Eye on The Future". PCMag UK. May 22, 2020.
  2. ^ Yegulalp, Serdar (February 10, 2020). "Microsoft speeds up PyTorch with DeepSpeed". InfoWorld.
  3. ^ "Microsoft unveils "fifth most powerful" supercomputer in the world". Neowin. 18 June 2023.
  4. ^ "Microsoft trains world's largest Transformer language model". February 10, 2020.
  5. ^ "microsoft/DeepSpeed". July 10, 2020 – via GitHub.
  6. ^ "DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression". Microsoft Research. 2021-05-24. Retrieved 2021-06-19.

Further reading

edit
  • Rajbhandari, Samyam; Rasley, Jeff; Ruwase, Olatunji; He, Yuxiong (2019). "ZeRO: Memory Optimization Towards Training A Trillion Parameter Models". arXiv:1910.02054 [cs.LG].
edit

📚 Artikel Terkait di Wikipedia

Python (programming language)

Soup Biopython Chainer CatBoost Cheetah Construct Cubes CuPy Dask DEAP DeepSpeed Enthought Genshi Gensim graph-tool Horovod Imaging Library IPython JAX

Machine learning

Apache SINGA Spark MLlib Apache SystemDS Caffe CatBoost Deeplearning4j DeepSpeed Dlib ELKI Flux.jl Gensim Google JAX H2O Infer.NET JASP Jubatus Keras Kubeflow

List of large language models

arXiv:2304.03208 [cs.LG]. Alvi, Ali; Kharya, Paresh (11 October 2021). "Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World's Largest and

Microsoft Research

Distributed–grid computing BitVault Confidential Consortium Framework DeepSpeed Orleans Internet, networking AjaxView Avalanche Conference XP Gazelle

Flask (web framework)

Soup Biopython Chainer CatBoost Cheetah Construct Cubes CuPy Dask DEAP DeepSpeed Enthought Genshi Gensim graph-tool Horovod Imaging Library IPython JAX

Anaconda (Python distribution)

Soup Biopython Chainer CatBoost Cheetah Construct Cubes CuPy Dask DEAP DeepSpeed Enthought Genshi Gensim graph-tool Horovod Imaging Library IPython JAX

PyTorch

portal Comparison of deep learning software Differentiable programming DeepSpeed Open-source artificial intelligence PyTorch Lightning Chintala, Soumith

List of Python software

Deeplearning4j — open-source deep learning library for the Java virtual machine. DeepSpeed — deep learning optimization library. Dlib — software library with machine