Open in app
Home
Notifications
Lists
Stories

Write
Zheng Zhang
Zheng Zhang

Home

Nov 30, 2020

Breaking down GPT-2 and Transformer

GPT-2 has shown an impressive capacity of getting around a wide range of NLP tasks. In this article, I will break down the inner workings of this versatile model, illustrating the architecture of GPT-2 and its essential component — transformer. This article distills the content of Jay Alammar’s inspirational blog…

Gpt 2

6 min read

Breaking down GPT-2 and Transformer
Breaking down GPT-2 and Transformer
Zheng Zhang

Zheng Zhang

Ph.D. student in Computer Science

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Knowable