17000 into 30 dimensions is quite a bit of space - eg, if each dimension is only 2 values, 30 dimensions gives you a billion unuique locations (2^30)
Andrej Karpathy
458K subscribersWe implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of ...
This is amazing. Using just a little bit of what I was able...
43 Comments
TheAIGRID
22.1k views • 3 days ago
AI Explained
103.0k views • 1 month ago
AI Explained
117.6k views • 1 month ago
TheAIGRID
62.7k views • 1 month ago
Wes Roth
35.0k views • 1 month ago
AI For Humans
5.4k views • 2 months ago
Andrej Karpathy
34.7k views • 2 months ago
TheAIGRID
217 views • 3 months ago
TheAIGRID
48.5k views • 3 months ago
AI For Humans
5.1k views • 3 months ago
Shelf will be hidden for 30 daysUndo
Wes Roth
22.8k views • 3 months ago
AI For Humans
1.0k views • 3 months ago
Wes Roth
75.1k views • 3 months ago
Wes Roth
43.1k views • 3 months ago
Wes Roth
19.1k views • 3 months ago
Wes Roth
6.4k views • 3 months ago
Wes Roth
5.2k views • 3 months ago
Wes Roth
9.0k views • 4 months ago
Wes Roth
15.7k views • 4 months ago
Wes Roth
2.3k views • 4 months ago
Wes Roth
10.8k views • 5 months ago
AI Jason
56.9k views • 5 months ago
Wes Roth
82.8k views • 5 months ago
Wes Roth
17.6k views • 5 months ago
Wes Roth
81.6k views • 5 months ago
Wes Roth
70.8k views • 5 months ago
Wes Roth
183.9k views • 5 months ago
Wes Roth
35.0k views • 5 months ago
Wes Roth
53.5k views • 6 months ago
Wes Roth
56.7k views • 6 months ago
TheAIGRID
15.6k views • 6 months ago
Wes Roth
59.7k views • 6 months ago
AI For Humans
626 views • 6 months ago
Wes Roth
10.6k views • 6 months ago
Wes Roth
19.1k views • 7 months ago
Wes Roth
53.5k views • 7 months ago
Wes Roth
70.3k views • 7 months ago
Wes Roth
49.0k views • 7 months ago
Wes Roth
21.3k views • 7 months ago
Wes Roth
40.2k views • 7 months ago
Wes Roth
59.4k views • 7 months ago
Wes Roth
55.4k views • 7 months ago
AI Jason
56.1k views • 7 months ago
Wes Roth
25.4k views • 7 months ago
Wes Roth
40.3k views • 7 months ago
Wes Roth
100.8k views • 7 months ago
Wes Roth
90.4k views • 7 months ago
AI Jason
13.0k views • 7 months ago
Wes Roth
67.1k views • 7 months ago
Wes Roth
34.6k views • 7 months ago
Wes Roth
84.7k views • 7 months ago
Wes Roth
42.6k views • 8 months ago
Wes Roth
31.7k views • 8 months ago
Wes Roth
48.3k views • 8 months ago
Wes Roth
43.4k views • 8 months ago
Wes Roth
25.9k views • 8 months ago
Wes Roth
69.4k views • 8 months ago
Wes Roth
46.3k views • 8 months ago
Wes Roth
34.1k views • 8 months ago
Wes Roth
75.3k views • 8 months ago
AI Jason
15.0k views • 8 months ago
AI For Humans
1.8k views • 8 months ago
Andrej Karpathy
686.3k views • 9 months ago
AI Jason
17.6k views • 9 months ago
TheAIGRID
21.0k views • 9 months ago
TheAIGRID
29.2k views • 9 months ago
TheAIGRID
36.3k views • 9 months ago
TheAIGRID
10.5k views • 9 months ago
TheAIGRID
61.1k views • 9 months ago
AI For Humans
5.6k views • 9 months ago
TheAIGRID
14.1k views • 9 months ago
AI Explained
151.7k views • 9 months ago
TheAIGRID
4.9k views • 9 months ago
TheAIGRID
95.1k views • 9 months ago
TheAIGRID
16.8k views • 9 months ago
TheAIGRID
54.5k views • 9 months ago
TheAIGRID
43.5k views • 9 months ago
TheAIGRID
18.7k views • 9 months ago
TheAIGRID
30.1k views • 9 months ago
TheAIGRID
39.1k views • 9 months ago
AI Jason
75.1k views • 9 months ago
TheAIGRID
176.2k views • 9 months ago
TheAIGRID
37.7k views • 9 months ago
TheAIGRID
17.5k views • 9 months ago
TheAIGRID
35.1k views • 9 months ago
AI Explained
388.7k views • 9 months ago
TheAIGRID
71.3k views • 9 months ago
TheAIGRID
55.3k views • 9 months ago
TheAIGRID
6.2k views • 10 months ago
TheAIGRID
27.9k views • 10 months ago
TheAIGRID
14.6k views • 10 months ago
AI For Humans
948 views • 10 months ago
TheAIGRID
20.8k views • 10 months ago
TheAIGRID
25.3k views • 10 months ago
TheAIGRID
36.6k views • 10 months ago
AI Explained
129.2k views • 10 months ago
AI Explained
97.7k views • 10 months ago
AI For Humans
5.7k views • 10 months ago
AI Jason
354.8k views • 10 months ago
AI For Humans
667 views • 10 months ago
AI For Humans
3.5k views • 10 months ago
Morningside AI
13.5k views • 10 months ago
AI For Humans
781 views • 10 months ago
AI Explained
129.9k views • 10 months ago
AI For Humans
1.5k views • 10 months ago
AI Jason
49.4k views • 10 months ago
AI Explained
118.4k views • 11 months ago
AI For Humans
3.0k views • 11 months ago
AI Jason
113.7k views • 11 months ago
AI For Humans
387 views • 11 months ago
AI For Humans
3.6k views • 11 months ago
AI Explained
118.3k views • 11 months ago
AI For Humans
2.3k views • 11 months ago
AI For Humans
1.7k views • 11 months ago
AI Jason
30.7k views • 11 months ago
AI For Humans
339 views • 11 months ago
AI For Humans
2.6k views • 11 months ago
AI Explained
106.4k views • 11 months ago
AI Explained
131.0k views • 11 months ago
AI For Humans
1.5k views • 11 months ago
AI Jason
218.6k views • 1 year ago
AI For Humans
1.4k views • 1 year ago
AI For Humans
1.8k views • 1 year ago
AI Explained
181.1k views • 1 year ago
AI Jason
35.1k views • 1 year ago
AI Explained
151.1k views • 1 year ago
Andrej Karpathy
482.7k views • 1 year ago
AI Explained
241.8k views • 1 year ago
AI Jason
63.7k views • 1 year ago
AI Explained
187.7k views • 1 year ago
AI Explained
161.6k views • 1 year ago
AI Jason
91.0k views • 1 year ago
AI Explained
272.8k views • 1 year ago
AI Jason
61.4k views • 1 year ago
AI Explained
96.8k views • 1 year ago
AI Jason
7.2k views • 1 year ago
AI Explained
145.9k views • 1 year ago
AI Explained
133.4k views • 1 year ago
AI Explained
79.5k views • 1 year ago
AI Jason
16.8k views • 1 year ago
AI Explained
84.1k views • 1 year ago
AI Explained
74.6k views • 1 year ago
AI Explained
144.9k views • 1 year ago
AI Jason
75.2k views • 1 year ago
Morningside AI
4.1k views • 1 year ago
AI Explained
83.7k views • 1 year ago
AI Jason
140.3k views • 1 year ago
AI Jason
33.7k views • 1 year ago
Morningside AI
9.8k views • 1 year ago
AI Explained
229.6k views • 1 year ago
Andrej Karpathy
1.9M views • 1 year ago
AI Explained
112.8k views • 1 year ago
Morningside AI
26.1k views • 1 year ago
AI Jason
16.3k views • 1 year ago
AI Jason
71.9k views • 1 year ago
AI Jason
53.8k views • 1 year ago
AI Jason
20.4k views • 1 year ago
AI Jason
53.4k views • 1 year ago
AI Jason
28.9k views • 1 year ago
Andrej Karpathy
4.3M views • 2 years ago
Andrej Karpathy
157.3k views • 2 years ago
Andrej Karpathy
172.8k views • 2 years ago
Andrej Karpathy
247.8k views • 2 years ago
Andrej Karpathy
278.3k views • 2 years ago
43 Comments
17000 into 30 dimensions is quite a bit of space - eg, if each dimension is only 2 values, 30 dimensions gives you a billion unuique locations (2^30)
Andrey thank you for this great series of lectures. you are a great Educator! 100% GOLD Material to Learn
I'm confused at 56:17 why care must be taking with how many times you can use the test dataset as the model will lear     See More
at 21:24 I think it's supposed to be first letter not first word. It's first word in the paper but first letter i     See More
Love all the tips and explanations on pytorch, training efficiency, and educational purposed errors. I was writing both code and notes and rewatching and enjoyed it and felt having a fruitfu     See More
Can't thank you enough. It's such a satisfying feeling to understand the logic under the ML models clearly. Thank you!
This is amazing. Using just a little bit of what I was able to learn from part 3, namely the Kaiming init, and turning back on the learning rate decay, I was able to achieve 2.03 and 2.04 in     See More
17000 into 30 dimensions is quite a bit of space - eg, if each dimension is only 2 values, 30 dimensions gives you a billion unuique locations (2^30)     See Less
Andrey thank you for this great series of lectures. you are a great Educator! 100% GOLD Material to Learn     See Less
I'm confused at 56:17 why care must be taking with how many times you can use the test dataset as the model will lear     See More Is this because there is no equivalent of 'torch.no_grad()' for LLMs - will the LLM always update the weights when given data?    See Less
i love u     See Less
at 21:24 I think it's supposed to be first letter not first word. It's first word in the paper but first letter i     See More ple    See Less
Love all the tips and explanations on pytorch, training efficiency, and educational purposed errors. I was writing both code and notes and rewatching and enjoyed it and felt having a fruitfu     See More r finished. It's like I was learning with a kind and insightful mentor sitting next to me. Thanks so much Andrej.    See Less
It is an absolute honor to learn from the very best. Thanks Andrej.     See Less
thank you andrej     See Less
Can't thank you enough. It's such a satisfying feeling to understand the logic under the ML models clearly. Thank you!     See Less
This is amazing. Using just a little bit of what I was able to learn from part 3, namely the Kaiming init, and turning back on the learning rate decay, I was able to achieve 2.03 and 2.04 in     See More nd validation with a 1.89 in my training loss with just 300k iterations and 23k parameters. I set my block size to 4 and my embeddings to 12 and increased my hidden layer to 300 while decaying my learning rate exponent from -1 to -3 linear space over the 300k steps. All that without even using batch normalization yet. After applying batch norm, was able to get these down to 1.99 and 1.98 with training loss in the 1.7s after a little more tweaking. Really good content in this lecture, it really has me feeling like a chef in the kitchen almost, cooking up a model with a few turns of the knobs...This sounds like a game or a problem that can be solved with an AI trained on turning knobs.    See Less