Skip to main content

5 posts tagged with "nlp"

View All Tags

· One min read
Amin Saied

Abstract. Models are getting increasingly large, to the point that they don't always fit on a single device! We discuss some techniques to partition models over multiple devices, from plain PyTorch to libraries like deepspeed.

· One min read
Amin Saied

Abstract. In this talk we'll review the GPT-1 paper [Improving Language Understanding by Generative Pre-Training - Radford et al][1]. By way of setting the stage we will give a brief review of the Transformer architecture.

Note: We didn't have time to cover GPT-2 in this talk, but some slides on the topic made it into the deck.

· One min read
Amin Saied

Abstract. In this talk we'll review the 🤗Hugging Face Transformers library. This is an open-source library with the stated goal to "democratize NLP". We'll briefly review some background, explain what problems Huggingface is trying to address, and cover some of the tools and techniques they provide. We will not assume any familiarity with transformers.