🖤 🔥
AI Explained
353K subscribersCritical analysis of the two most powerful new models behind ChatGPT, o3 and o4-mini. Not just the system cards, benchmarks, ...
🖤 🔥
73 Comments
TheAIGRID
99.5k views • 1 day ago
Wes Roth
91.1k views • 1 day ago
Wes Roth
11.6k views • 1 day ago
AI For Humans
2.1k views • 2 days ago
Wes Roth
79.9k views • 2 days ago
TheAIGRID
21.2k views • 2 days ago
Wes Roth
10.0k views • 2 days ago
AI For Humans
25.0k views • 3 days ago
TheAIGRID
13.6k views • 3 days ago
Wes Roth
80.3k views • 3 days ago
Shelf will be hidden for 30 daysUndo
Wes Roth
13.8k views • 3 days ago
Wes Roth
24.6k views • 4 days ago
TheAIGRID
8.1k views • 4 days ago
AI For Humans
8.9k views • 5 days ago
Wes Roth
29.3k views • 5 days ago
TheAIGRID
17.6k views • 5 days ago
TheAIGRID
49.3k views • 1 week ago
AI Jason
31.5k views • 1 week ago
Wes Roth
19.4k views • 1 week ago
Wes Roth
15.0k views • 1 week ago
Wes Roth
201.6k views • 1 week ago
TheAIGRID
32.6k views • 1 week ago
Wes Roth
148.3k views • 1 week ago
TheAIGRID
3.3k views • 1 week ago
TheAIGRID
23.5k views • 1 week ago
Wes Roth
45.1k views • 1 week ago
AI Explained
90.5k views • 1 week ago
Wes Roth
17.4k views • 1 week ago
AI For Humans
18.8k views • 1 week ago
Wes Roth
4.9k views • 1 week ago
AI Jason
16.1k views • 1 week ago
Wes Roth
51.5k views • 1 week ago
TheAIGRID
439.3k views • 1 week ago
AI Explained
95.2k views • 1 week ago
Wes Roth
103.7k views • 1 week ago
AI For Humans
25.6k views • 1 week ago
Wes Roth
53.8k views • 1 week ago
AI Jason
7.6k views • 1 week ago
TheAIGRID
31.1k views • 1 week ago
AI For Humans
1.8k views • 1 week ago
AI Explained
74.6k views • 1 week ago
Wes Roth
62.3k views • 1 week ago
TheAIGRID
37.3k views • 2 weeks ago
Wes Roth
9.5k views • 2 weeks ago
Wes Roth
3.0k views • 2 weeks ago
Wes Roth
5.9k views • 2 weeks ago
Wes Roth
13.9k views • 2 weeks ago
TheAIGRID
39.8k views • 2 weeks ago
AI For Humans
13.8k views • 2 weeks ago
Wes Roth
99.4k views • 2 weeks ago
Wes Roth
58.1k views • 2 weeks ago
AI For Humans
1.5k views • 2 weeks ago
TheAIGRID
9.6k views • 2 weeks ago
TheAIGRID
35.8k views • 3 weeks ago
AI For Humans
21.2k views • 3 weeks ago
AI For Humans
1.3k views • 3 weeks ago
TheAIGRID
29.3k views • 3 weeks ago
AI Jason
24.4k views • 3 weeks ago
AI For Humans
1.5k views • 3 weeks ago
AI For Humans
1.7k views • 3 weeks ago
TheAIGRID
11.6k views • 4 weeks ago
TheAIGRID
24.4k views • 1 month ago
AI For Humans
12.7k views • 1 month ago
TheAIGRID
13.6k views • 1 month ago
TheAIGRID
22.0k views • 1 month ago
TheAIGRID
7.6k views • 1 month ago
TheAIGRID
39.1k views • 1 month ago
AI Explained
98.3k views • 1 month ago
TheAIGRID
27.7k views • 1 month ago
AI Explained
60.0k views • 1 month ago
AI For Humans
17.2k views • 1 month ago
TheAIGRID
5.9k views • 1 month ago
TheAIGRID
50.8k views • 1 month ago
AI Jason
45.3k views • 1 month ago
AI For Humans
16.5k views • 1 month ago
AI Explained
92.6k views • 1 month ago
AI Explained
56.7k views • 1 month ago
AI For Humans
10.3k views • 1 month ago
AI Jason
221.8k views • 1 month ago
AI Explained
72.1k views • 1 month ago
AI For Humans
11.3k views • 1 month ago
AI For Humans
3.4k views • 2 months ago
AI Explained
108.0k views • 2 months ago
AI For Humans
14.8k views • 2 months ago
AI Jason
97.6k views • 2 months ago
AI Explained
135.6k views • 2 months ago
AI Explained
93.2k views • 2 months ago
AI For Humans
7.6k views • 2 months ago
AI Jason
211.8k views • 2 months ago
AI For Humans
949 views • 2 months ago
AI Jason
16.2k views • 2 months ago
AI For Humans
987 views • 2 months ago
AI For Humans
1.1k views • 2 months ago
AI Explained
117.1k views • 2 months ago
AI For Humans
11.2k views • 2 months ago
AI Jason
82.4k views • 2 months ago
AI For Humans
1.0k views • 2 months ago
AI Explained
109.7k views • 3 months ago
Andrej Karpathy
1.4M views • 3 months ago
AI Explained
135.2k views • 3 months ago
AI Jason
189.7k views • 3 months ago
AI Explained
111.2k views • 3 months ago
Andrej Karpathy
2.6M views • 3 months ago
AI Jason
14.7k views • 3 months ago
AI Explained
122.8k views • 3 months ago
AI Jason
18.1k views • 3 months ago
AI Explained
107.6k views • 4 months ago
AI Explained
182.7k views • 4 months ago
AI Jason
51.4k views • 4 months ago
AI Explained
106.0k views • 4 months ago
AI Jason
46.6k views • 4 months ago
AI Jason
43.4k views • 4 months ago
AI Explained
108.3k views • 4 months ago
AI Jason
68.1k views • 4 months ago
AI Explained
287.4k views • 5 months ago
Andrej Karpathy
34.7k views • 5 months ago
AI Jason
96.7k views • 5 months ago
AI Explained
87.3k views • 5 months ago
AI Explained
74.9k views • 5 months ago
AI Explained
153.6k views • 5 months ago
AI Explained
116.9k views • 5 months ago
AI Jason
62.6k views • 6 months ago
AI Jason
325.6k views • 7 months ago
AI Jason
151.4k views • 8 months ago
AI Jason
201.3k views • 8 months ago
AI Jason
31.3k views • 8 months ago
AI Jason
19.7k views • 9 months ago
AI Jason
124.6k views • 10 months ago
Andrej Karpathy
811.3k views • 11 months ago
Morningside AI
13.8k views • 1 year ago
Andrej Karpathy
813.6k views • 1 year ago
Morningside AI
4.2k views • 1 year ago
Morningside AI
10.1k views • 1 year ago
Andrej Karpathy
2.8M views • 1 year ago
Morningside AI
26.7k views • 1 year ago
Andrej Karpathy
5.7M views • 2 years ago
Andrej Karpathy
223.2k views • 2 years ago
Andrej Karpathy
268.3k views • 2 years ago
Andrej Karpathy
384.0k views • 2 years ago
Andrej Karpathy
423.4k views • 2 years ago
73 Comments
something i think would be great to add to simple bench is to add the date as to when a given model was run and tested on the benchmark, its a lot to try and keep up with, so to see a date w     See More
The o3 issues you uncovered all came back to logic for me and its lack thereof This has been a trend since I started using GPT3.5 and if there has been logic progress, it has been much mor     See More
No. AGI is when it can generalized out of its training domain. Tell it to DM a D&D campaign and let's see it try to do anything impressive.
AGI for me is when it is continually learning. But that's not the standard path at the moment.
Thank you, this kind of more level headed analysis is desperately needed!
Since it was released to the plus tier (nice change by OpenAI!) I did a fair amount of testing myself. Overal     See More
It's somewhat reassuring that OpenAI are saying they won't release a model if it can help people create bioweapons but I'm also not convinced they won't reneg on this obligat     See More
You will need to spin up a few digital clones to keep up with the advancing rates of model hype.
Open AI models are not most famous for their physical reality proximation descriptions. 🙂 When they catch up, I guess eg Sora might improve.
🖤 🔥     See Less
something i think would be great to add to simple bench is to add the date as to when a given model was run and tested on the benchmark, its a lot to try and keep up with, so to see a date w     See More nk at least be great! maybe just add a date column after "Organization"    See Less
The o3 issues you uncovered all came back to logic for me and its lack thereof This has been a trend since I started using GPT3.5 and if there has been logic progress, it has been much mor     See More than all other metrics. In fact, I am certain that if all other metrics were the same, BUT logic significantly advanced, we would have AGI now. Without logic, it will never beat humans and all the things we would love AI to do and for sure it will NOT replace a developer and so many other intelligent careers as logic really is a key to them all.
A lack of logic = hallucinations
I am not sure why OpenAI said it was hallucination free when it clearly is not. Perhaps for investors to hear? Seems foolish unless those investing really have no clue about the reality of the model which may very well be.    See Less
No. AGI is when it can generalized out of its training domain. Tell it to DM a D&D campaign and let's see it try to do anything impressive.     See Less
AGI for me is when it is continually learning. But that's not the standard path at the moment.     See Less
Thank you, this kind of more level headed analysis is desperately needed!
Since it was released to the plus tier (nice change by OpenAI!) I did a fair amount of testing myself. Overal     See More ry, very impressed. The image reasoning is a step change compared to previous OpenAI models. It's also super fun to watch it reasoning in images. It is sort of clumsy, in the same way robots are clumsy when they walk. It's kind of cute when it keeps zooming in on various parts of the image talking to itself 😊.
It did hallucinate for me too. For example it kept insisting that clockwise is 9 -> 8 -> 7 -> 6, as part of its solution to a more complex problem. Finally it admitted that time moves forward 😅.
I feel like this video gave a slightly too negative impression, but as a counterbalance it was great!    See Less
For the algorithm, I hope you had a wonderful flight.     See Less
It's somewhat reassuring that OpenAI are saying they won't release a model if it can help people create bioweapons but I'm also not convinced they won't reneg on this obligat     See More future. The money they can make from it will likely change their minds. They've backtracked on most of the promises they've made about AI safety, I doubt this will be any different. Hope to be wrong.    See Less
You will need to spin up a few digital clones to keep up with the advancing rates of model hype.     See Less
Open AI models are not most famous for their physical reality proximation descriptions. 🙂 When they catch up, I guess eg Sora might improve.     See Less