✨Talk: Pipeline Model Parallelism

February 12, 2021 · One min read

ML Engineer, Microsoft

Abstract. Models are getting increasingly large, to the point that they don't always fit on a single device! We discuss some techniques to partition models over multiple devices, from plain PyTorch to libraries like deepspeed.

😴 Lazy blog - just a link to the talk's 📋PDF.