Andrej, please train an llm with dilated convs, please show people we were doing it wrong this whole time, I know you can do it.
Andrej Karpathy
1.1M subscribersWe take the 2-layer MLP from previous video and make it deeper with a tree-like structure, arriving at a convolutional neural ...
Andrej, please train an llm with dilated convs, please show...
65 Comments
00:42
AI For Humans
1.9k views • 1 day ago
44:40
AI For Humans
10.1k views • 2 days ago
19:38
Wes Roth
31.0k views • 2 days ago
14:56
AI Explained
48.0k views • 3 days ago
01:41
AI For Humans
4.3k views • 3 days ago
21:31
TheAIGRID
7.6k views • 3 days ago
21:31
TheAIGRID
12.7k views • 3 days ago
12:08
Wes Roth
46.5k views • 3 days ago
21:43
AI Explained
102.9k views • 4 days ago
23:40
TheAIGRID
56.2k views • 4 days ago
Shelf will be hidden for 30 daysUndo
14:08
Wes Roth
53.4k views • 5 days ago
26:37
Wes Roth
49.0k views • 5 days ago
13:33
TheAIGRID
8.5k views • 6 days ago
14:40
TheAIGRID
25.8k views • 6 days ago
15:37
TheAIGRID
81.1k views • 1 week ago
15:07
TheAIGRID
14.7k views • 1 week ago
29:19
Wes Roth
35.2k views • 1 week ago
18:27
AI Explained
58.7k views • 1 week ago
45:12
AI For Humans
11.0k views • 1 week ago
08:14
TheAIGRID
13.3k views • 1 week ago
19:45
TheAIGRID
117.2k views • 1 week ago
19:57
Wes Roth
24.8k views • 1 week ago
13:13
TheAIGRID
47.1k views • 1 week ago
48:20
Wes Roth
35.4k views • 1 week ago
12:54
AI Explained
57.9k views • 1 week ago
16:14
TheAIGRID
20.9k views • 1 week ago
11:02
Wes Roth
29.6k views • 1 week ago
18:35
TheAIGRID
12.8k views • 1 week ago
79:17
Wes Roth
85.9k views • 2 weeks ago
55:23
AI For Humans
11.6k views • 2 weeks ago
18:03
Wes Roth
53.8k views • 2 weeks ago
26:42
Wes Roth
71.0k views • 2 weeks ago
24:58
Wes Roth
53.2k views • 2 weeks ago
37:44
Wes Roth
30.0k views • 3 weeks ago
36:05
TheAIGRID
22.3k views • 3 weeks ago
13:16
TheAIGRID
27.6k views • 3 weeks ago
62:27
AI For Humans
9.4k views • 3 weeks ago
08:33
AI Jason
32.5k views • 3 weeks ago
01:25
AI For Humans
4.7k views • 3 weeks ago
07:58
Wes Roth
43.5k views • 3 weeks ago
25:09
Wes Roth
50.1k views • 3 weeks ago
01:25
AI For Humans
2.0k views • 3 weeks ago
01:19
AI For Humans
3.2k views • 4 weeks ago
08:55
TheAIGRID
18.2k views • 4 weeks ago
15:17
Wes Roth
14.4k views • 4 weeks ago
52:48
AI For Humans
13.3k views • 4 weeks ago
13:52
TheAIGRID
98.9k views • 1 month ago
14:14
AI Explained
57.3k views • 1 month ago
10:52
TheAIGRID
19.6k views • 1 month ago
23:43
Wes Roth
28.4k views • 1 month ago
14:23
TheAIGRID
28.0k views • 1 month ago
15:02
TheAIGRID
14.7k views • 1 month ago
15:51
TheAIGRID
18.6k views • 1 month ago
14:44
TheAIGRID
27.4k views • 1 month ago
05:14
AI Jason
30.5k views • 1 month ago
24:40
Wes Roth
168.0k views • 1 month ago
14:17
TheAIGRID
26.6k views • 1 month ago
48:09
AI For Humans
11.4k views • 1 month ago
28:09
Wes Roth
60.8k views • 1 month ago
35:42
Wes Roth
39.4k views • 1 month ago
37:59
TheAIGRID
14.6k views • 1 month ago
24:15
Wes Roth
137.1k views • 1 month ago
06:07
Wes Roth
21.1k views • 1 month ago
53:01
AI For Humans
14.5k views • 1 month ago
23:55
Wes Roth
32.4k views • 1 month ago
14:50
Wes Roth
43.0k views • 1 month ago
112:36
Wes Roth
12.9k views • 1 month ago
11:47
AI Jason
53.8k views • 1 month ago
57:36
AI For Humans
14.4k views • 1 month ago
01:43
AI For Humans
2.7k views • 1 month ago
14:22
TheAIGRID
9.0k views • 1 month ago
15:44
AI Explained
58.0k views • 1 month ago
24:25
TheAIGRID
17.5k views • 1 month ago
02:06
AI For Humans
8.1k views • 1 month ago
14:07
AI Explained
66.2k views • 1 month ago
38:23
AI For Humans
11.1k views • 1 month ago
18:08
TheAIGRID
19.5k views • 1 month ago
49:55
AI For Humans
10.9k views • 2 months ago
02:12
AI For Humans
2.8k views • 2 months ago
11:32
AI Explained
48.3k views • 2 months ago
11:32
AI Explained
20.2k views • 2 months ago
50:33
AI For Humans
13.3k views • 2 months ago
06:41
AI Jason
15.8k views • 2 months ago
44:47
AI For Humans
9.4k views • 2 months ago
52:12
AI For Humans
19.1k views • 2 months ago
18:55
AI Explained
57.4k views • 2 months ago
53:56
AI For Humans
16.4k views • 3 months ago
44:52
AI For Humans
11.4k views • 3 months ago
16:02
AI Jason
94.2k views • 3 months ago
53:25
AI For Humans
18.3k views • 3 months ago
15:02
AI Explained
163.0k views • 3 months ago
11:55
AI Explained
194.4k views • 3 months ago
40:18
AI For Humans
35.8k views • 3 months ago
64:05
AI For Humans
20.0k views • 3 months ago
18:44
AI Jason
133.7k views • 3 months ago
17:20
AI Explained
84.4k views • 4 months ago
51:06
AI For Humans
18.8k views • 4 months ago
07:02
AI Jason
80.4k views • 4 months ago
02:12
AI For Humans
12.9k views • 4 months ago
11:44
AI Explained
177.5k views • 4 months ago
09:29
AI Jason
51.6k views • 4 months ago
55:27
AI For Humans
22.3k views • 4 months ago
16:39
AI Jason
181.8k views • 4 months ago
26:20
AI Explained
109.2k views • 4 months ago
51:33
AI For Humans
19.7k views • 5 months ago
05:56
AI Jason
22.7k views • 5 months ago
01:22
AI For Humans
4.0k views • 5 months ago
14:01
AI Explained
101.3k views • 5 months ago
45:51
AI For Humans
19.1k views • 5 months ago
22:02
AI Jason
17.7k views • 5 months ago
16:50
AI Explained
96.3k views • 5 months ago
02:47
AI For Humans
15.8k views • 5 months ago
00:49
AI For Humans
3.2k views • 5 months ago
56:57
AI For Humans
31.6k views • 5 months ago
13:09
AI For Humans
9.5k views • 5 months ago
03:35
AI Jason
51.5k views • 5 months ago
19:05
AI Explained
98.8k views • 6 months ago
56:15
AI For Humans
20.4k views • 6 months ago
04:25
AI Jason
24.3k views • 6 months ago
17:08
AI Explained
99.6k views • 6 months ago
02:10
AI For Humans
31.0k views • 6 months ago
14:02
AI Jason
10.1k views • 6 months ago
01:59
AI For Humans
2.0k views • 6 months ago
17:42
AI Explained
82.8k views • 6 months ago
48:59
AI For Humans
15.4k views • 6 months ago
01:19
AI For Humans
1.5k views • 6 months ago
11:44
AI Jason
33.0k views • 6 months ago
34:24
AI Explained
105.2k views • 6 months ago
14:34
AI Explained
60.8k views • 6 months ago
19:04
AI Jason
54.5k views • 7 months ago
14:25
AI Explained
94.2k views • 7 months ago
20:10
AI Explained
60.3k views • 7 months ago
15:30
AI Jason
284.2k views • 7 months ago
23:52
AI Explained
72.7k views • 7 months ago
21:22
AI Explained
110.0k views • 7 months ago
13:19
AI Jason
165.9k views • 7 months ago
64:53
AI For Humans
7.7k views • 8 months ago
09:14
AI Jason
223.4k views • 8 months ago
10:09
AI Jason
16.5k views • 8 months ago
01:22
AI For Humans
1.1k views • 8 months ago
02:20
AI For Humans
1.2k views • 8 months ago
13:07
AI Jason
86.0k views • 8 months ago
131:12
Andrej Karpathy
2.1M views • 8 months ago
13:17
AI Jason
223.1k views • 9 months ago
55:52
AI For Humans
7.0k views • 9 months ago
04:01
AI For Humans
6.4k views • 9 months ago
211:24
Andrej Karpathy
4.0M views • 9 months ago
20:35
AI Jason
16.2k views • 9 months ago
08:40
AI Jason
18.5k views • 9 months ago
01:08
AI For Humans
2.3k views • 9 months ago
16:12
AI Jason
52.0k views • 10 months ago
52:17
AI For Humans
9.7k views • 10 months ago
81:55
Andrej Karpathy
34.7k views • 11 months ago
51:56
AI For Humans
8.0k views • 11 months ago
52:16
AI For Humans
6.5k views • 11 months ago
46:52
AI For Humans
7.6k views • 1 year ago
07:16
AI For Humans
4.2k views • 1 year ago
01:00
AI For Humans
2.4k views • 1 year ago
00:52
AI For Humans
3.1k views • 1 year ago
00:50
AI For Humans
9.6k views • 1 year ago
241:26
Andrej Karpathy
943.4k views • 1 year ago
30:38
Morningside AI
13.8k views • 1 year ago
133:35
Andrej Karpathy
962.9k views • 1 year ago
26:56
Morningside AI
4.2k views • 1 year ago
45:54
Morningside AI
10.1k views • 1 year ago
59:48
Andrej Karpathy
3.2M views • 2 years ago
39:00
Morningside AI
26.7k views • 2 years ago
116:20
Andrej Karpathy
6.6M views • 2 years ago
56:22
Andrej Karpathy
250.4k views • 3 years ago
115:24
Andrej Karpathy
307.4k views • 3 years ago
115:58
Andrej Karpathy
448.2k views • 3 years ago
75:40
Andrej Karpathy
481.7k views • 3 years ago
65 Comments
Andrej, please train an llm with dilated convs, please show people we were doing it wrong this whole time, I know you can do it.
Just wanna say thank you for sharing your experience -- love this from-scratch series starting from first principles!
Hi @AndrejKarpathy thanks for recording this for us. I was following the whole way through, and funnily enough, I also wasn't able to beat the 1.993 that you got from this fancy hierarch     See More
I have challenging question ( for me :) ). I made a very simple network which takes x as an input and produce y as an output. the network looks like that (y = sin(ax + b)) where a and b are     See More
Um, can I find Part 6 somewhere?(RNN, LSTM, GRU..) I was under the impression that the next video in the playlist is about building GPT from skretch.
That was a very great playlist, easy to understand and very helpfull, thank you very much!!
So far THE BEST lecture series I came across on YouTube. Along side learning the neural networks in this series, I have learned the PyTorch more than learning it by waching a PyTorch video s     See More
Thank you so much for creating this video lecture series. Your passion for this topic comes through so vividly in your lectures. I learned so much from every lecture and especially appreciat     See More
Please show how to implement conv2D layers as matrix multiplication and cover the math
Andrej, please train an llm with dilated convs, please show people we were doing it wrong this whole time, I know you can do it.     See Less
Just wanna say thank you for sharing your experience -- love this from-scratch series starting from first principles!     See Less
Awesome series!     See Less
Hi @AndrejKarpathy thanks for recording this for us. I was following the whole way through, and funnily enough, I also wasn't able to beat the 1.993 that you got from this fancy hierarch     See More rk. I actually went back and tuned the single hidden layer network you mentioned above and was able to get that one to perform even better than 1.993. To be exact, I got
train 1.7930818796157837
val 1.9838893413543701
test 1.9920368194580078
just from making the network bigger and running it for longer. But don't worry, it's not that embarrassing, as I'm sure there is a setting in this hierarchical network that will product better results. It seems like there should be.    See Less
I have challenging question ( for me :) ). I made a very simple network which takes x as an input and produce y as an output. the network looks like that (y = sin(ax + b)) where a and b are     See More variables. training data is built out from sin(3x+4312). loss function is quadratic mean. using usual approaches I couldn't make it works ! what do you think the problem is?    See Less
Um, can I find Part 6 somewhere?(RNN, LSTM, GRU..) I was under the impression that the next video in the playlist is about building GPT from skretch.     See Less
That was a very great playlist, easy to understand and very helpfull, thank you very much!!     See Less
So far THE BEST lecture series I came across on YouTube. Along side learning the neural networks in this series, I have learned the PyTorch more than learning it by waching a PyTorch video s     See More 6 hrs from a youtuber.    See Less
Thank you so much for creating this video lecture series. Your passion for this topic comes through so vividly in your lectures. I learned so much from every lecture and especially appreciat     See More lectures started from the foundational concepts and built up to the state-of-the art techniques. Thank you!    See Less
Please show how to implement conv2D layers as matrix multiplication and cover the math     See Less