5 posts tagged with "nlp"

View All Tags

✨Talk: Roberta and Electra

September 24, 2021 · One min read

Amin Saied

ML Engineer, Microsoft

Abstract. In this talk I give an overview of two important transformer models in NLP.

✨Talk: Introduction to PyTorch Lightning

May 7, 2021 · One min read

Amin Saied

ML Engineer, Microsoft

Abstract. A hands on guide to using PyTorch Lightning. Concretely, we grab the GPT-2 model from Huggingface and build a lightning module to train it (more or less from scratch).

✨Talk: Pipeline Model Parallelism

February 12, 2021 · One min read

Amin Saied

ML Engineer, Microsoft

Abstract. Models are getting increasingly large, to the point that they don't always fit on a single device! We discuss some techniques to partition models over multiple devices, from plain PyTorch to libraries like deepspeed.

✨Talk: GPT-1 and GPT-2 Review

January 15, 2021 · One min read

Amin Saied

ML Engineer, Microsoft

Abstract. In this talk we'll review the GPT-1 paper [Improving Language Understanding by Generative Pre-Training - Radford et al][1]. By way of setting the stage we will give a brief review of the Transformer architecture.

Note: We didn't have time to cover GPT-2 in this talk, but some slides on the topic made it into the deck.

✨Talk: Introduction to 🤗Hugging Face

December 18, 2020 · One min read

Amin Saied

ML Engineer, Microsoft

Abstract. In this talk we'll review the 🤗Hugging Face Transformers library. This is an open-source library with the stated goal to "democratize NLP". We'll briefly review some background, explain what problems Huggingface is trying to address, and cover some of the tools and techniques they provide. We will not assume any familiarity with transformers.