[Paper] GPT-1 — Improving Language Understanding by Generative Pre-Training
· 13 min read
This document is a note organizing the architecture and training process of the GPT-1 paper by combining mathematical definitions with intuitive interpretations.
