GPT 5.2: OpenAI Strikes Back

AI Explained

390K subscribers

50.8k views  •  1 day ago

Full GPT-5.2 breakdown - did OpenAI reclaim the crown? A story of tokens, time and cost, plus 9 details you wouldn't get just from ...

Comments320

User Image

Where is the video on Opus 4.5, the best model of the year a...

19 Comments

@SirQuantization  20 hours ago

SimpleBench coming in clutch as usual. I absolutely love that you made your own benchmark for testing.

I find it fascinating that GPT 5 outperformed 5.1 and 5.1 outperformed 5.2. Why     See More ny theories? Seems like a potential topic for an entire video? Perhaps I'm missing something obvious in regards to what types of questions are on SimpleBench.    See Less

@stephenknox2346  20 hours ago

The problem is the assumption that counting the sheep achieves the goal rather than simply clearing another arbitrary hurdle as you realize the scope of the land outside the paddock.     See Less

@jonathanlivingston7358  20 hours ago

17:15 yes “eventually it would count all the sheep” but it still wouldn’t necessarily be able to relate each sheep     See More er.

AI has a strong atomistic intelligence—it’s a savant—yet is still an “idiot savant,” it has a weak holistic intelligence… and I’m not sure LLM’s will ever get a good holistic intelligence as I see little improvement on that level. A different type of breakthrough might be needed to get a positronic matrix. Until then we are stuck with Lore.    See Less

@randomuser5237  22 hours ago

Where is the video on Opus 4.5, the best model of the year and best coding model ever? This is becoming heavily biased towards Google and OpenAI.     See Less

@iambored9872  21 hours ago

Did your march for Palestine strike out? Does British law exists or is it Sharia now?     See Less

@Shaunmcdonogh-shaunsurfing  21 hours ago

Great work with Simple Bench     See Less

@seeker_of_knowlage3568  22 hours ago

For me, brute forcing the problem by more computing power that a lot of people wouldn't be able to access (cheaply at least) is not impressive.     See Less

@strayedlight  22 hours ago

I have yet to find a better workflow and experience than I had with 4o/4.1 combo... these benchmarks don't mean shite.     See Less

@Kairulol  22 hours ago

for programming, i've tried to daily a range of models but always come back to claude. all these recent benchmark results feel extremely disconnected with my actual day to day use of the     See More it used to just be claude (in a good way, overperforming).    See Less

@AlexAlex-wm7xt  23 hours ago

It’s score on benchmark makes me question the relevance arc agi.     See Less

13:03

Ex Google AI Veteran Claims Worlds First AGI Capable System - And Nobodys Talking About it...

TheAIGRID

4.1k views   •   1 day ago

34:18

GPT 5.2 is the first HUMAN LABOR replacement

Wes Roth

42.7k views   •   1 day ago

10:39

OpenAI Released GPT-5.2 Is Not What You Think - You Should Be Concerned

TheAIGRID

13.4k views   •   1 day ago

19:32

Pentagon "Four Months to Prepare for AGI"

Wes Roth

50.6k views   •   3 days ago

41:56

The Latest AI Breakthroughs You Need to See (Google, OpenAI, Deepseek and More)

TheAIGRID

23.7k views   •   5 days ago

17:31

"Code Red" Over, OpenAI is about to blow...

Wes Roth

52.5k views   •   5 days ago

42:48

The Latest Humanoid Robotics Breakthroughs You Need to See

TheAIGRID

17.9k views   •   6 days ago

33:44

AI News : Deepseek Returns, Amazons Secret AI Models, Googles Breakthrough , Veo 3 Beaten and More

TheAIGRID

9.1k views   •   1 week ago

118:46

Elon Reveals GROK 4.20... and it's getting scary good

Wes Roth

20.0k views   •   1 week ago

10:12

Google’s New Breakthrough Brings AGI Even Closer - Titans and Miras

TheAIGRID

19.2k views   •   1 week ago

20:16

You Are Being Told Contradictory Things About AI

AI Explained

65.9k views   •   1 week ago

11:29

ChatGPT Privacy CRACKS:The Court Now Has Your ChatGPT History

TheAIGRID

5.7k views   •   1 week ago

08:52

BREAKING: Grok 4.20 might be *too* good...

Wes Roth

70.6k views   •   1 week ago

10:33

Humanoid Robots Are Moving in Ways We’ve NEVER Seen Before.

TheAIGRID

24.9k views   •   1 week ago

27:54

this experiment could END the AI hype

Wes Roth

20.2k views   •   1 week ago

13:46

AI Is About to Change Coding Forever in 2026 - "Software Engineering Is Done"

TheAIGRID

22.6k views   •   1 week ago

62:38

China Just Popped America's AI Bubble: Cyrus Janssen Reveals What Happens Next!

Wes Roth

27.4k views   •   1 week ago

10:43

Grok Thinks Elon Musk Is a God… This Is Where It Gets Dangerous

TheAIGRID

6.8k views   •   1 week ago

81:11

You have TWO YEARS LEFT to prepare - Dr. Roman Yampolskiy

Wes Roth

87.3k views   •   2 weeks ago

13:43

Claude Opus 4.5 Just Crossed Into Human Territory

TheAIGRID

26.7k views   •   2 weeks ago

18:47

Claude turns chaotic evil

Wes Roth

158.9k views   •   2 weeks ago

00:26

Do NOT do this with Nano Banana Pro #ai #aiart #google

AI For Humans

2.7k views   •   2 weeks ago

20:07

Claude just beat Gemini 3... how?!

Wes Roth

45.4k views   •   2 weeks ago

02:17

How to Tell If an Image Is AI-Generated (Beginner Friendly)

TheAIGRID

4.7k views   •   2 weeks ago

12:33

"okay, but I want Gemini3 to perform 10x for my specific use case" - Here is how

AI Jason

27.4k views   •   2 weeks ago

00:42

Nano Banana Pro: Take a Selfie With Every Version of You

AI For Humans

1.9k views   •   3 weeks ago

44:40

Google's Nano Banana Pro & Gemini 3 Just Changed Everything!

AI For Humans

10.1k views   •   3 weeks ago

19:38

Google's UNREAL New Nano Banana Pro...

Wes Roth

34.8k views   •   3 weeks ago

14:56

Nano Banana Pro: But Did You Catch These 10 Details?

AI Explained

58.0k views   •   3 weeks ago

01:41

Google's Nano Banana Pro is INSANE

AI For Humans

4.3k views   •   3 weeks ago

12:08

the world wasn't ready for Gemini 3

Wes Roth

48.9k views   •   3 weeks ago

21:43

Gemini 3 Pro: Breakdown

AI Explained

115.4k views   •   3 weeks ago

23:40

Gemini 3 Shows a Level of Intelligence We Haven’t Seen Before. (Gemini 3 Explained)

TheAIGRID

70.9k views   •   3 weeks ago

14:08

Gemini 3 just got *scary* good

Wes Roth

54.8k views   •   3 weeks ago

26:37

xAI's new model is insane...

Wes Roth

51.3k views   •   3 weeks ago

13:33

This Chip Could Give OpenAI an Unfair Advantage.

TheAIGRID

8.7k views   •   3 weeks ago

14:40

Researchers Just Broke AI’s Most Important Assumption. (We Were Wrong About LLMs)

TheAIGRID

26.8k views   •   3 weeks ago

15:37

If This Works… AGI Arrives Early. (Thermodynamic Computing)

TheAIGRID

100.0k views   •   4 weeks ago

15:07

Google’s SIMA 2: The Most Advanced AI Agent Ever Built

TheAIGRID

16.3k views   •   4 weeks ago

29:19

SIMA 2 is a "significant step towards AGI" says Google

Wes Roth

36.5k views   •   4 weeks ago

18:27

Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that

AI Explained

60.7k views   •   4 weeks ago

45:12

OpenAI Surprise Drops GPT-5.1 But Google Is Lurking

AI For Humans

11.0k views   •   4 weeks ago

08:14

GPT 5.1 - The AI Update Nobody Expected...

TheAIGRID

13.6k views   •   4 weeks ago

19:45

Meta’s AI Genius Just Quit — Even Zuckerberg Seems Surprised.

TheAIGRID

123.1k views   •   1 month ago

19:57

the BIG SHORT against the AI BUBBLE (Nov 25th is the day)

Wes Roth

25.3k views   •   1 month ago

13:13

Leaked Letter Reveals OpenAI’s Real Plan... And people Aren't Happy About It.

TheAIGRID

48.3k views   •   1 month ago

48:20

ok WTF is going on? we need to discuss this...

Wes Roth

37.2k views   •   1 month ago

12:54

Bubble or No Bubble, AI Keeps Progressing (ft. Relentless Learning + Introspection)

AI Explained

59.5k views   •   1 month ago

16:14

OpenAIs New Agent is One Step Closer To Superintelligence. (AI 2027 Is Happening...)

TheAIGRID

21.3k views   •   1 month ago

11:02

KIMI K2 just broke the AI Industry... here's it's "secret"

Wes Roth

30.1k views   •   1 month ago

18:35

Nvidia CEO SHOCKS Everyone: “China Will WIN The AI Race!”

TheAIGRID

13.1k views   •   1 month ago

79:17

"No One is Prepared" the next 1,000 days are CRUCIAL | Emad Mostaque

Wes Roth

104.1k views   •   1 month ago

55:23

AI Job Losses Are Real. Don’t Panic (Yet).

AI For Humans

11.6k views   •   1 month ago

18:03

Project Suncatcher - Google's Plan to Put AI Data Centers in Space

Wes Roth

54.3k views   •   1 month ago

26:42

Claude just developed self awareness

Wes Roth

72.0k views   •   1 month ago

24:58

BREAKING: Ilya Sutskever DEPOSED, Sam Altman firing was planned a year in advance and more...

Wes Roth

54.0k views   •   1 month ago

37:44

LLMs can't reason

Wes Roth

31.0k views   •   1 month ago

36:05

Big AI News : Gemini 3, AI Music Ban, New Humanoid Robot, Groks AGI, UBI Starts and OpenAI Changes!

TheAIGRID

22.4k views   •   1 month ago

13:16

GAME OVER! AI Music Is Now BANNED!

TheAIGRID

28.2k views   •   1 month ago

62:27

OpenAI Unveils 2028 AGI Plan But First... Sora 2 Is Now For Pets?!

AI For Humans

9.4k views   •   1 month ago

08:33

The Design Mode for Claude Code...

AI Jason

35.3k views   •   1 month ago

01:25

How I Turned My Pet Into a Sora 2 Cameo Character

AI For Humans

4.7k views   •   1 month ago

07:58

OpenAI just said it

Wes Roth

43.6k views   •   1 month ago

14:58

The NEO Humanoid Robot Just Stunned The AI Industry (1x Tech NEO Humanoid Robot)

TheAIGRID

86.1k views   •   1 month ago

25:09

Self Improving AI is getting wild

Wes Roth

50.8k views   •   1 month ago

01:25

This AI Makes Videos As You Type!

AI For Humans

2.0k views   •   1 month ago

01:19

Sora 2 Prompt Allows You To See "Real" AI Movies

AI For Humans

3.2k views   •   1 month ago

08:55

Microsoft’s New AI Copilot Update Just Changed The Way You Will Use Computers Forever

TheAIGRID

18.5k views   •   1 month ago

52:48

Will OpenAI's ChatGPT Atlas Roll Over Google in 2025?

AI For Humans

13.3k views   •   1 month ago

13:52

Google's New Quantum Computing Breakthrough Just SHOCKED THE WORLD! (Quantum Echoes)

TheAIGRID

100.6k views   •   1 month ago

14:14

Did you miss these 2 AI stories? A *Real* LLM-crafted Breakthrough + Continual Learning Blocked?

AI Explained

57.7k views   •   1 month ago

05:14

Claude Skills - the SOP for your agent that is bigger than MCP

AI Jason

31.5k views   •   1 month ago

48:09

OpenAI’s Curvy Road to AGI Includes Sora 2 and… Erotica??

AI For Humans

11.4k views   •   1 month ago

53:01

OpenAI Nerfs Sora 2. Chaos Still Reigns. Is It Over??

AI For Humans

14.5k views   •   2 months ago

11:47

.agent folder is making claude code 10x better...

AI Jason

56.4k views   •   2 months ago

57:36

OpenAI’s Sora 2: Future of Media or AI SLOPOCALYPSE??

AI For Humans

14.4k views   •   2 months ago

01:43

Introducing AndThen. Play the Conversation.

AI For Humans

2.7k views   •   2 months ago

15:44

Sora 2 - It will only get more realistic from here

AI Explained

58.3k views   •   2 months ago

02:06

You Won't Believe Sora 2's New Features!

AI For Humans

8.1k views   •   2 months ago

14:07

OpenAI Tests if GPT-5 Can Automate Your Job - 4 Unexpected Findings

AI Explained

66.8k views   •   2 months ago

38:23

OpenAI Raises Billions While AI Creates New Drugs. What's Next?

AI For Humans

11.1k views   •   2 months ago

49:55

Meta’s $800 AI Glasses Show The Future… Sometimes Breaks

AI For Humans

10.9k views   •   2 months ago

02:12

How Did He Make This With AI?

AI For Humans

2.8k views   •   2 months ago

11:32

ChatGPT Can Now Call the Cops, but 'Wait till 2100 for Full Job Impact' - Altman

AI Explained

48.5k views   •   2 months ago

11:32

ChatGPT Can Now Call the Cops, but 'Wait till 2100 for Full Job Impact' - Altman

AI Explained

20.2k views   •   2 months ago

50:33

OpenAI Is Spending A Fortune To Get To AGI. Will They Make It?

AI For Humans

13.3k views   •   3 months ago

06:41

Vibe Design is much better than I thought...

AI Jason

16.3k views   •   3 months ago

44:47

AI Is Taking Jobs. It Doesn't Have To Take Yours.

AI For Humans

9.4k views   •   3 months ago

52:12

We Tried Google’s Nano Banana AI Model. It’s... Ridiculous.

AI For Humans

19.1k views   •   3 months ago

18:55

An ‘AI Bubble’? What Altman Actually said, the Facts and Nano Banana

AI Explained

57.5k views   •   3 months ago

53:56

Move Over OpenAI… Google Looks Ready To Take The AI Lead

AI For Humans

16.4k views   •   3 months ago

44:52

OpenAI's GPT-5 Struggles To Be AI For Everything & Everybody

AI For Humans

11.4k views   •   3 months ago

16:02

I was using sub-agents wrong... Here is my way after 20+ hrs test

AI Jason

100.2k views   •   3 months ago

53:25

OpenAI’s GPT-5 Is Very Good... But AGI Might Be Delayed.

AI For Humans

18.3k views   •   4 months ago

15:02

GPT-5 has Arrived

AI Explained

163.1k views   •   4 months ago

11:55

Genie 3: The World Becomes Playable (DeepMind)

AI Explained

195.2k views   •   4 months ago

40:18

OpenAI’s GPT-5 Leaks Show Us The Future (Of Next Week??)

AI For Humans

35.8k views   •   4 months ago

64:05

OpenAI Teases GPT-5 as America Goes Full 'AI Action' Mode

AI For Humans

20.0k views   •   4 months ago

18:44

I was using Claude Code wrong... The Ultimate Workflow

AI Jason

135.1k views   •   4 months ago

17:20

How Not to Read a Headline on AI (ft. new Olympiad Gold, GPT-5 …)

AI Explained

84.5k views   •   4 months ago

51:06

OpenAI’s New ChatGPT Agent Might've Just Stolen Your Job

AI For Humans

18.8k views   •   4 months ago

07:02

Claude Killer? My review on Kimi K2 after hrs of testing...

AI Jason

80.8k views   •   4 months ago

02:12

Is Grok 4 the smartest AI model in the world?

AI For Humans

12.9k views   •   5 months ago

11:44

Grok 4 - 10 New Things to Know

AI Explained

177.8k views   •   5 months ago

09:29

Tired of AI-ish UI? Here is how to make it better...

AI Jason

52.0k views   •   5 months ago

55:27

OpenAI & Google Are Using AI To Take Over. What About Us?

AI For Humans

22.3k views   •   5 months ago

16:39

Claude Designer is insane...Ultimate vibe coding UI workflow

AI Jason

183.4k views   •   5 months ago

26:20

When Will AI Models Blackmail You, and Why?

AI Explained

109.5k views   •   5 months ago

51:33

OpenAI's GPT-5 Is Coming But Sam Altman Won't Stop Throwing Shade

AI For Humans

19.7k views   •   5 months ago

05:56

Vibe Versioning - Iterate UI in Cursor 10x faster

AI Jason

22.8k views   •   5 months ago

01:22

Would You Let This Robot In Your House?

AI For Humans

4.0k views   •   5 months ago

14:01

Apple’s ‘AI Can’t Reason’ Claim Seen By 13M+, What You Need to Know

AI Explained

101.4k views   •   6 months ago

45:51

OpenAI Preps To Blow Past AGI Straight to Super Intelligence

AI For Humans

19.1k views   •   6 months ago

22:02

Build the next Billion $ Agent 🚀

AI Jason

17.8k views   •   6 months ago

16:50

AI Accelerates: New Gemini Model + AI Unemployment Stories Analysed

AI Explained

96.3k views   •   6 months ago

02:47

Is GPT-5 Coming Next Month? Find Out What Might Happen!

AI For Humans

15.8k views   •   6 months ago

00:49

Will AI take your job?!

AI For Humans

3.2k views   •   6 months ago

56:57

Anthropic's CEO Says AI Will Take 50% of Jobs. Now What?

AI For Humans

31.6k views   •   6 months ago

13:09

Is VEO 3 really the death of human creativity?

AI For Humans

9.5k views   •   6 months ago

03:35

10x better UI design for vibe coders - Use v0 directly in Cursor

AI Jason

51.8k views   •   6 months ago

19:05

Claude 4: Full 120 Page Breakdown … Is it the Best New Model?

AI Explained

98.8k views   •   6 months ago

56:15

Google Went AI Crazy and VEO 3 Is Just the Start

AI For Humans

20.4k views   •   6 months ago

04:25

How to make accurate UI Tweak in Cursor with Stagewise

AI Jason

24.4k views   •   6 months ago

17:08

Google Takes No Prisoners Amid Torrent of AI Announcements

AI Explained

99.6k views   •   6 months ago

02:10

VEO 3 is actually insane. Best AI video + audio AI tool yet.

AI For Humans

31.0k views   •   6 months ago

14:02

Build MCP business for vibe coder

AI Jason

10.1k views   •   6 months ago

01:59

Will this be the biggest AI News week to date?!?

AI For Humans

2.0k views   •   6 months ago

17:42

AI Improves at Self-improving

AI Explained

82.9k views   •   6 months ago

48:59

Google's New AI Agent Improves Itself. But Can It Stop AI Babies?

AI For Humans

15.4k views   •   6 months ago

01:19

Can Google Gemini Make Coding Easy for Everyone?

AI For Humans

1.5k views   •   7 months ago

11:44

Cursor + Browser control = Self improving coding agent

AI Jason

33.5k views   •   7 months ago

34:24

"OpenAI is Not God” - The DeepSeek Documentary on Liang Wenfeng, R1 and What's Next

AI Explained

105.4k views   •   7 months ago

14:34

o3 breaks (some) records, but AI becomes pay-to-win

AI Explained

60.8k views   •   7 months ago

19:04

How I reduced 90% errors for my Cursor (Part 2)

AI Jason

54.7k views   •   7 months ago

14:25

o3 and o4-mini - they’re great, but easy to over-hype

AI Explained

94.2k views   •   7 months ago

20:10

‘Speaking Dolphin’ to AI Data Dominance, 4.1 + Kling 2.0: 7 Updates Critically Analysed

AI Explained

60.3k views   •   7 months ago

15:30

How I reduced 90% errors for my Cursor (+ any other AI IDE)

AI Jason

285.2k views   •   8 months ago

23:52

AI CEO: ‘Stock Crash Could Stop AI Progress’, Llama 4 Anti-climax + ‘Superintelligence in 2027’ ...

AI Explained

72.7k views   •   8 months ago

13:19

Don't do RAG - This method is way faster & accurate...

AI Jason

168.9k views   •   8 months ago

64:53

NVIDIA Dominates The Race To AGI at GTC 2025

AI For Humans

7.7k views   •   8 months ago

09:14

Claude Designer is insane...Ultimate vibe coding UI workflow

AI Jason

223.5k views   •   8 months ago

10:09

Gemini 2.0 blew me away - The future of Multimodal Model

AI Jason

16.5k views   •   8 months ago

01:22

Jurassic Park AI Video Fail 😭🤖😳#ai #aivideo #funny

AI For Humans

1.1k views   •   8 months ago

02:20

AI will write 100% of code. What happens next?! 😳 #ai #technology #chatgpt

AI For Humans

1.2k views   •   8 months ago

13:07

MCP = Next Big Opportunity? EASIST way to build your own MCP business

AI Jason

86.2k views   •   9 months ago

131:12

How I use LLMs

Andrej Karpathy

2.1M views   •   9 months ago

13:17

Those MCP totally 10x my Cursor workflow…

AI Jason

224.8k views   •   9 months ago

55:52

Who Will Control The Future of AI?

AI For Humans

7.0k views   •   10 months ago

04:01

Sam Altman Confirms GPT-5 & It Will Be FREE For Everyone

AI For Humans

6.4k views   •   10 months ago

211:24

Deep Dive into LLMs like ChatGPT

Andrej Karpathy

4.2M views   •   10 months ago

20:35

The ONLY way to run your own Deepseek on mobile...

AI Jason

16.3k views   •   10 months ago

08:40

Yep, o3-mini is WORTH the money - Build your own reasoning agent

AI Jason

18.5k views   •   10 months ago

01:08

China’s Robotics advances are INSANE 🤖🤯👀 #ai #robotics #technology

AI For Humans

2.3k views   •   10 months ago

52:17

OpenAI Starts Prepping For Super Intelligence (ASI) & More AI News

AI For Humans

9.7k views   •   10 months ago

81:55

Founding fathers on today's America

Andrej Karpathy

34.7k views   •   11 months ago

51:56

The Biggest Week in AI Yet (For Real This Time)

AI For Humans

8.0k views   •   1 year ago

52:16

The Future of AI: OpenAI's 12 Days of Surprises

AI For Humans

6.5k views   •   1 year ago

46:52

Why OpenAI's o1 Model Might Be The Future of AI Scaling

AI For Humans

7.6k views   •   1 year ago

07:16

How AI Video Is Changing Hollywood

AI For Humans

4.2k views   •   1 year ago

01:00

OpenAI’s Orion Coming in November?!? 👀🤖🤯 #ai #tech #openai

AI For Humans

2.4k views   •   1 year ago

00:52

Nobel Prize Winner Disses Sam Altman 😭🤯👀 #ai #news #openai

AI For Humans

3.1k views   •   1 year ago

00:50

Voice Memo to Musical with Suno Covers 🔊🤖 #ai #aimusic #technology

AI For Humans

9.6k views   •   1 year ago

241:26

Let's reproduce GPT-2 (124M)

Andrej Karpathy

951.6k views   •   1 year ago

30:38

Expert AI Developer Explains NEW OpenAI Assistants API v2 Release

Morningside AI

13.8k views   •   1 year ago

133:35

Let's build the GPT Tokenizer

Andrej Karpathy

972.3k views   •   1 year ago

26:56

Expert AI Developer Explains What OpenAI's Q* Means for Businesses

Morningside AI

4.2k views   •   2 years ago

45:54

Voiceflow CEO Talks GPTs, Future of AI Agencies and Chatbot Builders (Full Interview)

Morningside AI

10.1k views   •   2 years ago

59:48

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

3.2M views   •   2 years ago

39:00

Expert AI Developer Explains What OpenAI 'GPTs' Mean For Businesses

Morningside AI

26.7k views   •   2 years ago

116:20

Let's build GPT: from scratch, in code, spelled out.

Andrej Karpathy

6.6M views   •   2 years ago

56:22

Building makemore Part 5: Building a WaveNet

Andrej Karpathy

252.0k views   •   3 years ago

115:24

Building makemore Part 4: Becoming a Backprop Ninja

Andrej Karpathy

309.9k views   •   3 years ago

115:58

Building makemore Part 3: Activations & Gradients, BatchNorm

Andrej Karpathy

451.9k views   •   3 years ago

75:40

Building makemore Part 2: MLP

Andrej Karpathy

482.9k views   •   3 years ago

19 Comments

@SirQuantizatio...  20 hours ago

SimpleBench coming in clutch as usual. I absolutely love that you made your own benchmark for testing.

I find it fascinating that GPT 5 outperformed 5.1 and 5.1 outperformed 5.2. Why     See More ny theories? Seems like a potential topic for an entire video? Perhaps I'm missing something obvious in regards to what types of questions are on SimpleBench.    See Less

@stephenknox234...  20 hours ago

The problem is the assumption that counting the sheep achieves the goal rather than simply clearing another arbitrary hurdle as you realize the scope of the land outside the paddock.     See Less

@jonathanliving...  20 hours ago

17:15 yes “eventually it would count all the sheep” but it still wouldn’t necessarily be able to relate each sheep     See More er.

AI has a strong atomistic intelligence—it’s a savant—yet is still an “idiot savant,” it has a weak holistic intelligence… and I’m not sure LLM’s will ever get a good holistic intelligence as I see little improvement on that level. A different type of breakthrough might be needed to get a positronic matrix. Until then we are stuck with Lore.    See Less

@randomuser5237  22 hours ago

Where is the video on Opus 4.5, the best model of the year and best coding model ever? This is becoming heavily biased towards Google and OpenAI.     See Less

@iambored9872  21 hours ago

Did your march for Palestine strike out? Does British law exists or is it Sharia now?     See Less

@Shaunmcdonogh-...  21 hours ago

Great work with Simple Bench     See Less

@seeker_of_know...  22 hours ago

For me, brute forcing the problem by more computing power that a lot of people wouldn't be able to access (cheaply at least) is not impressive.     See Less

@strayedlight  22 hours ago

I have yet to find a better workflow and experience than I had with 4o/4.1 combo... these benchmarks don't mean shite.     See Less

@Kairulol  22 hours ago

for programming, i've tried to daily a range of models but always come back to claude. all these recent benchmark results feel extremely disconnected with my actual day to day use of the     See More it used to just be claude (in a good way, overperforming).    See Less

@AlexAlex-wm7xt  23 hours ago

It’s score on benchmark makes me question the relevance arc agi.     See Less