Abstract. Models are getting increasingly large, to the point that they don't always fit on a single device! We discuss some techniques to partition models over multiple devices, from plain PyTorch to libraries like deepspeed.
😴 Lazy blog - just a link to the talk's 📋PDF.