But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Unpacking how large language models work under the hood Early view of the next chapter for patrons: Special thanks to these supporters: #thanks To contribute edits to the subtitles, visit Other recommended resources on the topic. Richard Turner’s introduction is one of the best starting places: Coding a GPT with Andrej Karpathy Introduction to self-attention by John Hewitt History of language models by Brit Cruise: Paper about examples like the “woman - man” one presented here: ------------------ Timestamps 0:00 - Predict, sample, repeat 3:03 - Inside a transformer 6:36 - Chapter layout 7:20
Back to Top