This is the last of a series of 3 videos where we demystify Transformer models and explain them with visuals and friendly examples.
Video 1: The attention mechanism in high level
Video 2: The attention mechanism with math
Video 3 (This one): Transformer models
If you like this material, check out LLM University from Cohere!
Get the Grokking Machine Learning book!
Discount code (40%): serranoyt
(Use the discount code on checkout)
00:00 Introduction
01:50 What is a transformer?
04:35 Generating one word at a time
08:59 Sentiment Analysis
13:05 Neural Networks
18:18 Tokenization
19:12 Embeddings
25:06 Positional encoding
27:54 Attention
32:29 Softmax
35:48 Architecture of a Transformer
39:00 Fine-tuning
42:20 Conclusion
1 view
2096
575
1 month ago 00:00:13 1
Да что ты черт побери такое несешь ? (Банды Нью-Йорка)
1 month ago 00:04:14 1
Paramore: Decode [OFFICIAL VIDEO]
1 month ago 00:00:32 1
…but the people are retarded
1 month ago 00:02:36 5
Jingle Bells | Christmas Song | Super Simple Songs
1 month ago 00:04:49 1
Play To Earn🔥This New Play to Earn Game is About to Make a Lot of People RICH
1 month ago 00:02:41 6
sinking in the deep || Viktor (Arcane)
1 month ago 00:37:34 1
Chorallas - Desert Lambs (1969) [Full Album]
1 month ago 00:02:50 1
We Are Number One but it contains spoilers from Madoka Magica Concept Movie (and Rebellion)
1 month ago 00:04:14 1
The Hunter - Bloodborne (4K UHD 2024)
1 month ago 00:08:38 1
Retired General on How Ukraine Is ‘Bleeding Out’ Against Russia | WSJ
1 month ago 00:19:20 1
Blue Ribbon: Story of the Build
1 month ago 00:03:28 1
Nemo - The Code (LIVE) | Switzerland🇨🇭| Grand Final | Eurovision 2024
2 months ago 01:04:51 18
Half in the Bag: Top 10 Horror Movies (2024) Part 2