AI Accelerates: New Gemini Model + AI Unemployment Stories Analysed

AI Explained

0 subscribers

93.9k views  •  3 weeks ago

There's a new best language model, so let's go through the up and downs of Gemini 2.5 Pro 06-05. Record-breaking ...

Comments550

User Image

Nice video here, from paying for day care and college, to ma...

23 Comments

@Veteran2002  1 week ago

Nice video here, from paying for day care and college, to managing mortgage payments. I'm approaching retirement yet inflation is getting worse. How can I generate more income to retire     See More ast $3m for long term care? I have about €750k in savings    See Less

@PauseAIAustralia  1 week ago

We need an international AI safety treaty     See Less

@DreckbobBratpfanne  1 week ago

Next frontier is to get models to ignore stuff they learned when appropriate to not fall for these traps like the line length example xD


Also when will we see in-context learning     See More d RL? A model that is exceptional at that could be really cool.    See Less

@lev1ato  2 weeks ago

I have not accomplished anything in life yet and the AI will just come and take it all, even the hope that I had, what will I live off?     See Less

@AtPitou  2 weeks ago

5:25 Absolutely! I work as a Digital Transformation Supervisor, and I work with a Google workspace for education fundament     See More n, and thew Google Admin is as intuitive as Yamaha remote controller... So when I was stuck with MDM settings for edu fundamentals edition, Gemini confidently sent me on a 3+ hour fetch-quest of changing settings that ddidn't have anything to do with mobile devices, and chasing settings that didn't exist. Ib felt so cheated after I realized it was all a hallucination of a solution that doesn't exist, and I had to back-track the settings I had (excuse my French) fucked with blindly.    See Less

@Celeritate7  3 weeks ago

great video     See Less

@andrewdunbar828  3 weeks ago

the dough-velopments in AI     See Less

@davem1658  3 weeks ago

Thanks for this video I find the quality great, and you provide an objective intellectual view on recent events.

In my previous comments in your past videos, I may have come across     See More ning when you dove into more complex topics that involve technical AI concepts. For someone like me, who often watches YouTube recreationally, it can be a bit much to shift gears and focus deeply just to follow along. However, I do understand this channel is AI Explained, so that approach makes sense.

If the occasional complicated idea shows up in one of your news videos, I think that’s totally fine. And if you decide to make separate videos clearly labelled as lessons for the more advanced concepts, that works well too.

In my view, being successful on Youtube means staying true to your identity while adapting about 50% to what the audience wants, but I think you're doing everything right. I hope you find this advice helpful.    See Less

@DustinRodriguez1_0  3 weeks ago

AI hallucinations, and the Apple 'LLMs can't reason' paper and such are real, and they will NOT be remedied by merely scaling up models, or with training models on more data. BUT     See More ge that needs to be made is simple, in concept at least. LLMs can not reliably and accurately model absolute binary logic. This causes them to get 'lost' in things which they believe are highly correllated with truth, but which do not stand up to the rigor of absolute logic. In logical arguments, there is no such thing as "0.9999 probability of truth". There is 1.0 probability of truth, or 0.0. With no value in between. Even the slightest error does not simply make the whole argument (no matter how large that argument might be, it could be thousands of pages) 'slightly false', it makes the whole thing completely false. This is a degree of rigor that most people aren't familiar with, and it doesn't get used continually for even philosophers who want to try to do so, but it is required. It is the only strategy which modulates the cognitive biases which lead people to accept "it feels really true, it aligns precisely with my intuition" as true. Right now, LLMs are operating on pure intuition. Even when they "think" with test-time inference compute scaling, all they are doing is feeling their way through arguments that FEEL true. They can't look at something, recognize that it feels deeply true, fits right in with everything else true, but has a tiny flaw that makes it completely false. Logic is not gradients. It is discrete, and absolute. Compare human history before and after The Enlightenment. The core of The Enlightenment was integrating exactly this kind of absolute logic into our wheelhouse of thought, despite it being alien to us. Despite it blowing apart things that felt as true as our need to breathe. That enabled us to formalize things, to take our human preconceptions and feelings out of it and process the world just based on rules and logic. And that enabled us to build computers that apply that absolute reasoning trillions of times faster and more efficiently than we ever could. Even enabling us to reproduce a model of our intuitive brain with AI. But the model needs access to that binary logic in addition to its fuzzy logic. Just like we did. That's not transparently obvious as to how to do it, but once it is done, expect a warp-speed leap in capabilities and especially efficiency. Continued scaling will just waste mountains of compute on emulating imperfectly the absolute rules that it could use when necessary.    See Less

@gmtree7830  3 weeks ago

we out here Phil, waiting for the o3-pro drop haha     See Less

16:35

Google Deepmind's VIDEOGAME AGI? (the REAL reason for VEO 3)

Wes Roth

40.2k views   •   1 day ago

01:21

Both Google Deepmind and John Carmack Are Building Videogame Simulations for Training AGI

Wes Roth

4.9k views   •   1 day ago

00:59

Anthropic's new "ECONOMIC AI" Research is testing if Claude Can Run It's Own Business...

Wes Roth

8.1k views   •   1 day ago

00:58

Can Claude AI run it's own BUSINESS? Surprising results...

Wes Roth

6.9k views   •   1 day ago

02:04

YouTube is taken over by AI content channels and bots? Will AI make it harder for YouTube?

Wes Roth

1.5k views   •   1 day ago

09:07

Ilya Sutskever's SHOCKING Superintelligence Warning "extremely unpredictable and unimaginable"

Wes Roth

58.1k views   •   2 days ago

00:55

The Internet is DEAD and AI Slop Killed it...

Wes Roth

7.5k views   •   2 days ago

13:12

The Internet DIES as Gen AI Takes Over | The Dead Internet Theory is now true...

Wes Roth

38.0k views   •   3 days ago

00:31

OnlyFans sued for using "AI Models"

Wes Roth

4.1k views   •   3 days ago

15:43

I Gave Claude $1,000 to Start a Business (No Humans Needed?)

Wes Roth

36.2k views   •   4 days ago

01:00

Grok 3.5 is CANCELLED (now waiting on GROK 4)

Wes Roth

11.9k views   •   4 days ago

15:16

Elon Musk Goes SCORCHED EARTH! Grok 4, Neuralink plays Call of Duty and Self Delivered Teslas...

Wes Roth

55.9k views   •   5 days ago

00:58

Elon Musk goes SCORCHED EARTH - Grok 4, Neuralink and Tesla go BOOM! #ai #llm #agi

Wes Roth

12.1k views   •   5 days ago

28:20

Sam Altman Just REVEALED The Future Of AI..

TheAIGRID

29.4k views   •   6 days ago

51:59

TESLA Robotaxi Just DESTROYED the Entire Industry! Dr. Know-it-all explains what's ACTUALLY going on

Wes Roth

16.0k views   •   6 days ago

01:21

My new calling in life... Arpeggiator is open source on Hugging Face

Wes Roth

8.9k views   •   6 days ago

55:27

OpenAI & Google Are Using AI To Take Over. What About Us?

AI For Humans

19.7k views   •   1 week ago

16:39

Claude Designer is insane...Ultimate vibe coding UI workflow

AI Jason

110.1k views   •   1 week ago

119:18

Dr Mike Israetel: We ALREADY Know How to Build ASI... Human Death Only Has DECADES Left

Wes Roth

47.5k views   •   1 week ago

01:12

Dr Mike Israetel "Will superintelligence burn all humans in a reactor for energy"? #podcast

Wes Roth

9.6k views   •   1 week ago

00:58

Sakana AI and the "UNREASONABLE EFFECTIVE" Tiny Teacher AI Models

Wes Roth

17.3k views   •   1 week ago

26:20

When Will AI Models Blackmail You, and Why?

AI Explained

73.3k views   •   1 week ago

17:45

Sakana AI New Model Sparks a RL Revolution

Wes Roth

66.4k views   •   1 week ago

01:00

Sakana AI's New "Teacher Models" Might be the Revolutionary New RL Approach

Wes Roth

13.4k views   •   1 week ago

226:12

LIVE: chill Sunday stream

Wes Roth

5.1k views   •   1 week ago

11:51

AI Video Just Went TOO FAR...

Wes Roth

61.1k views   •   1 week ago

34:53

BIG AI News : OpenAI Exposed, Veo3 Beaten, AI Unplugs Itself, AGI Revealed, And More.

TheAIGRID

37.9k views   •   1 week ago

69:33

ex-Google Director Just Revealed What's Coming Next...

Wes Roth

24.9k views   •   2 weeks ago

51:33

OpenAI's GPT-5 Is Coming But Sam Altman Won't Stop Throwing Shade

AI For Humans

18.4k views   •   2 weeks ago

12:26

How to use Openrouter (Access Every LLM At Once)

TheAIGRID

5.8k views   •   2 weeks ago

15:02

AI's STUNNING Covert Ops: LLMs Complete Hidden Objectives in Plain Sight

Wes Roth

34.1k views   •   2 weeks ago

00:57

OpenAI Drops Hints About GPT-5 Release

AI For Humans

2.3k views   •   2 weeks ago

12:17

AI Gets WEIRD: LLMs learn reasoning solely by their own internal "sense of confidence"

Wes Roth

37.5k views   •   2 weeks ago

14:45

Google Whisk Tutorial (How to use Google Whisk)

TheAIGRID

13.2k views   •   2 weeks ago

05:56

Vibe Versioning - Iterate UI in Cursor 10x faster

AI Jason

19.3k views   •   2 weeks ago

01:22

Would You Let This Robot In Your House?

AI For Humans

3.7k views   •   2 weeks ago

52:13

MAJOR AI News : Sam Altman Reveals Perfect AI, GPT-5 Details, Apple Stuns AI Industry, and more

TheAIGRID

47.6k views   •   3 weeks ago

14:01

Apple’s ‘AI Can’t Reason’ Claim Seen By 13M+, What You Need to Know

AI Explained

95.4k views   •   3 weeks ago

45:51

OpenAI Preps To Blow Past AGI Straight to Super Intelligence

AI For Humans

18.2k views   •   3 weeks ago

22:02

Build the next Billion $ Agent 🚀

AI Jason

15.3k views   •   3 weeks ago

29:19

Apple DROPS AI BOMBSHELL: LLMS CANNOT Reason

TheAIGRID

104.7k views   •   3 weeks ago

16:50

AI Accelerates: New Gemini Model + AI Unemployment Stories Analysed

AI Explained

93.9k views   •   3 weeks ago

18:10

Google Flow Tutorial - How To Use Googles Flow (Complete Guide)

TheAIGRID

28.0k views   •   3 weeks ago

52:41

OpenAI’s GPT-5 is Coming and Hot AI Summer is Here

AI For Humans

25.7k views   •   4 weeks ago

16:35

How To Make Viral AI Vlogs - Viral AI Tiktok Vlogs Tutorial (Veo-3)

TheAIGRID

139.8k views   •   4 weeks ago

10:14

Veo 3 Tutorial - How To Use Googles Veo 3 (Complete Guide)

TheAIGRID

88.5k views   •   1 month ago

02:47

Is GPT-5 Coming Next Month? Find Out What Might Happen!

AI For Humans

8.1k views   •   1 month ago

36:40

The AI Future Nobody Wants To Talk About

TheAIGRID

42.8k views   •   1 month ago

28:44

OpenAI's New Device Will Change AI Forever (OpenAI's IO Device Revealed)

TheAIGRID

207.0k views   •   1 month ago

00:49

Will AI take your job?!

AI For Humans

3.0k views   •   1 month ago

56:57

Anthropic's CEO Says AI Will Take 50% of Jobs. Now What?

AI For Humans

31.0k views   •   1 month ago

15:04

Elon Musk's Stunning 2026 AI Predictions

TheAIGRID

17.7k views   •   1 month ago

08:48

Chinas New AI Drone Swarm Is Concerning (Jiutian SS-UAV)

TheAIGRID

11.7k views   •   1 month ago

13:09

Is VEO 3 really the death of human creativity?

AI For Humans

9.4k views   •   1 month ago

30:07

10 BIG Problems With Generative AI.

TheAIGRID

18.8k views   •   1 month ago

08:41

Googles New AI Glasses Are The Future Of AI (Android XR Update)

TheAIGRID

64.9k views   •   1 month ago

03:35

10x better UI design for vibe coders - Use v0 directly in Cursor

AI Jason

44.2k views   •   1 month ago

12:18

Sam Altman's WARNING To The Government On AI

TheAIGRID

33.7k views   •   1 month ago

05:59

How To Use OndemandAI (New AI Agent Platform)

TheAIGRID

3.5k views   •   1 month ago

28:11

Claude 4 Is So GOOD Its Scary... (Be Careful...)

TheAIGRID

24.3k views   •   1 month ago

19:05

Claude 4: Full 120 Page Breakdown … Is it the Best New Model?

AI Explained

96.6k views   •   1 month ago

56:15

Google Went AI Crazy and VEO 3 Is Just the Start

AI For Humans

19.9k views   •   1 month ago

04:25

How to make accurate UI Tweak in Cursor with Stagewise

AI Jason

20.9k views   •   1 month ago

49:45

Google Just WON The A.I Race.. (Wow)

TheAIGRID

483.3k views   •   1 month ago

17:08

Google Takes No Prisoners Amid Torrent of AI Announcements

AI Explained

98.7k views   •   1 month ago

02:10

VEO 3 is actually insane. Best AI video + audio AI tool yet.

AI For Humans

30.6k views   •   1 month ago

14:02

Build MCP business for vibe coder

AI Jason

9.2k views   •   1 month ago

09:11

Sam Altman And Elon Musk Just Revealed Their Next AI Models...

TheAIGRID

31.4k views   •   1 month ago

01:59

Will this be the biggest AI News week to date?!?

AI For Humans

1.9k views   •   1 month ago

17:42

AI Improves at Self-improving

AI Explained

79.4k views   •   1 month ago

09:27

Sam Altman Just Revealed Whats Next For A.I In 2026,2027 and the Future

TheAIGRID

39.2k views   •   1 month ago

31:10

Big AI News: Claude 4 Details, GPT-5 Details, Googles New Video And Image Models, Robots and more...

TheAIGRID

40.1k views   •   1 month ago

48:59

Google's New AI Agent Improves Itself. But Can It Stop AI Babies?

AI For Humans

14.8k views   •   1 month ago

00:50

Can You Spot The Weird Thing On The Centaur?

AI For Humans

1.6k views   •   1 month ago

27:44

2026 AI : 10 Things Coming In 2026 (A.I In 2026 Major Predictions)

TheAIGRID

110.2k views   •   1 month ago

13:28

GPT 5 News - Everything We Know So Far

TheAIGRID

39.3k views   •   1 month ago

53:29

OpenAI Just Went Global & Billionaires Are Building Bunkers

AI For Humans

22.4k views   •   1 month ago

01:19

Can Google Gemini Make Coding Easy for Everyone?

AI For Humans

1.4k views   •   1 month ago

08:10

Google Just Built The Worlds Smartest AI...(Wow)

TheAIGRID

29.4k views   •   1 month ago

11:44

Cursor + Browser control = Self improving coding agent

AI Jason

27.4k views   •   1 month ago

01:00

Bosses should worry about Duolingo’s AI memo 🤖👀📥 #ai #ainews #work

AI For Humans

1.6k views   •   1 month ago

01:25

Did Meta AI Just Get Sad On Camera? #ai #ainews #podcast

AI For Humans

1.7k views   •   1 month ago

57:03

Meta Just Launched an AI Social Network. It’s Real Weird.

AI For Humans

12.9k views   •   2 months ago

34:24

"OpenAI is Not God” - The DeepSeek Documentary on Liang Wenfeng, R1 and What's Next

AI Explained

101.2k views   •   2 months ago

14:34

o3 breaks (some) records, but AI becomes pay-to-win

AI Explained

60.5k views   •   2 months ago

49:20

Google Says We’re Not Ready for AGI. They’re Probably Right.

AI For Humans

17.5k views   •   2 months ago

19:04

How I reduced 90% errors for my Cursor (Part 2)

AI Jason

50.5k views   •   2 months ago

52:21

OpenAI’s o3 Is Here. It’s Smarter Than You. And It Has Eyes.

AI For Humans

16.6k views   •   2 months ago

14:25

o3 and o4-mini - they’re great, but easy to over-hype

AI Explained

93.5k views   •   2 months ago

20:10

‘Speaking Dolphin’ to AI Data Dominance, 4.1 + Kling 2.0: 7 Updates Critically Analysed

AI Explained

58.0k views   •   2 months ago

62:48

Google’s Gemini 2.5 Pro (with Deep Research) Might Be the New AI King

AI For Humans

10.4k views   •   2 months ago

15:30

How I reduced 90% errors for my Cursor (+ any other AI IDE)

AI Jason

258.5k views   •   2 months ago

23:52

AI CEO: ‘Stock Crash Could Stop AI Progress’, Llama 4 Anti-climax + ‘Superintelligence in 2027’ ...

AI Explained

72.5k views   •   2 months ago

52:46

OpenAI’s 4o Image Gen Melted Servers, Got Nerfed… and Raised $40 Billion

AI For Humans

11.4k views   •   3 months ago

01:08

ChatGPT 4o Image Gen is mind blowing 😳🤯🤖#ai #openai #aitools

AI For Humans

3.4k views   •   3 months ago

21:22

Gemini 2.5 Pro - It’s a Darn Smart Chatbot … (New Simple High Score)

AI Explained

108.6k views   •   3 months ago

13:19

Don't do RAG - This method is way faster & accurate...

AI Jason

126.4k views   •   3 months ago

13:48

Did AI Just Get Commoditized? Gemini 2.5, New DeepSeek V3, & Microsoft vs OpenAI

AI Explained

135.9k views   •   3 months ago

11:16

OpenAI’s New ImageGen is Unexpectedly Epic … (ft. Reve, Imagen 3, Midjourney etc)

AI Explained

93.5k views   •   3 months ago

09:14

Claude Designer is insane...Ultimate vibe coding UI workflow

AI Jason

218.6k views   •   3 months ago

10:09

Gemini 2.0 blew me away - The future of Multimodal Model

AI Jason

16.4k views   •   3 months ago

12:59

Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3)

AI Explained

117.4k views   •   3 months ago

13:07

MCP = Next Big Opportunity? EASIST way to build your own MCP business

AI Jason

83.7k views   •   3 months ago

25:06

GPT 4.5 - not so much wow

AI Explained

109.9k views   •   4 months ago

131:12

How I use LLMs

Andrej Karpathy

1.5M views   •   4 months ago

27:40

Claude 3.7 is More Significant than its Name Implies (ft DeepSeek R2 + GPT 4.5 coming soon)

AI Explained

135.4k views   •   4 months ago

13:17

Those MCP totally 10x my Cursor workflow…

AI Jason

204.2k views   •   4 months ago

22:18

AGI: (gets close), Humans: ‘Who Gets to Own it?’

AI Explained

111.4k views   •   4 months ago

211:24

Deep Dive into LLMs like ChatGPT

Andrej Karpathy

2.9M views   •   4 months ago

20:35

The ONLY way to run your own Deepseek on mobile...

AI Jason

15.2k views   •   4 months ago

18:33

Deep Research by OpenAI - The Ups and Downs vs DeepSeek R1 Search + Gemini Deep Research

AI Explained

123.0k views   •   5 months ago

08:40

Yep, o3-mini is WORTH the money - Build your own reasoning agent

AI Jason

18.2k views   •   5 months ago

15:22

o3-mini and the “AI War”

AI Explained

107.7k views   •   5 months ago

23:10

Nothing Much Happens in AI, Then Everything Does All At Once

AI Explained

183.0k views   •   5 months ago

16:12

Deepseek R1 - The Era of Reasoning models

AI Jason

51.6k views   •   5 months ago

13:12

Altman Expects a ‘Fast Take-off’, ‘Super-Agent’ Debuting Soon and DeepSeek R1 Out

AI Explained

106.1k views   •   5 months ago

28:08

From $0 to $4m with just 2 people (ComfyUI Crash-course for E-commerce)

AI Jason

49.9k views   •   5 months ago

04:41

Easiest way to build fancy UI with Cursor/Windsurf/Bolt/Lovable

AI Jason

45.2k views   •   5 months ago

23:42

OpenAI Backtracks, Gunning for Superintelligence: Altman Brings His AGI Timeline Closer - '25 to '29

AI Explained

108.4k views   •   5 months ago

27:55

1000x Cursor workflow for building apps

AI Jason

71.3k views   •   5 months ago

22:21

o3 - wow

AI Explained

287.7k views   •   6 months ago

81:55

Founding fathers on today's America

Andrej Karpathy

34.7k views   •   6 months ago

24:57

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

AI Jason

105.8k views   •   6 months ago

13:41

Never Browse Alone? Gemini 2 Live and ChatGPT Vision

AI Explained

87.3k views   •   6 months ago

27:32

Better than Cursor? Future Agentic Coding available today

AI Jason

63.1k views   •   7 months ago

22:44

This is how I scrape 99% websites via LLM

AI Jason

334.8k views   •   8 months ago

42:52

Best Cursor Workflow that no one talks about...

AI Jason

154.2k views   •   9 months ago

42:31

How to use Cursor AI build & deploy production app in 20 mins

AI Jason

204.9k views   •   9 months ago

241:26

Let's reproduce GPT-2 (124M)

Andrej Karpathy

837.2k views   •   1 year ago

30:38

Expert AI Developer Explains NEW OpenAI Assistants API v2 Release

Morningside AI

13.8k views   •   1 year ago

133:35

Let's build the GPT Tokenizer

Andrej Karpathy

843.8k views   •   1 year ago

26:56

Expert AI Developer Explains What OpenAI's Q* Means for Businesses

Morningside AI

4.2k views   •   1 year ago

45:54

Voiceflow CEO Talks GPTs, Future of AI Agencies and Chatbot Builders (Full Interview)

Morningside AI

10.1k views   •   1 year ago

59:48

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

2.9M views   •   1 year ago

39:00

Expert AI Developer Explains What OpenAI 'GPTs' Mean For Businesses

Morningside AI

26.7k views   •   1 year ago

116:20

Let's build GPT: from scratch, in code, spelled out.

Andrej Karpathy

5.9M views   •   2 years ago

56:22

Building makemore Part 5: Building a WaveNet

Andrej Karpathy

228.4k views   •   2 years ago

115:24

Building makemore Part 4: Becoming a Backprop Ninja

Andrej Karpathy

276.0k views   •   2 years ago

115:58

Building makemore Part 3: Activations & Gradients, BatchNorm

Andrej Karpathy

395.2k views   •   2 years ago

75:40

Building makemore Part 2: MLP

Andrej Karpathy

434.7k views   •   2 years ago

23 Comments

@Veteran2002  1 week ago

Nice video here, from paying for day care and college, to managing mortgage payments. I'm approaching retirement yet inflation is getting worse. How can I generate more income to retire     See More ast $3m for long term care? I have about €750k in savings    See Less

@PauseAIAustral...  1 week ago

We need an international AI safety treaty     See Less

@DreckbobBratpf...  1 week ago

Next frontier is to get models to ignore stuff they learned when appropriate to not fall for these traps like the line length example xD


Also when will we see in-context learning     See More d RL? A model that is exceptional at that could be really cool.    See Less

@lev1ato  2 weeks ago

I have not accomplished anything in life yet and the AI will just come and take it all, even the hope that I had, what will I live off?     See Less

@AtPitou  2 weeks ago

5:25 Absolutely! I work as a Digital Transformation Supervisor, and I work with a Google workspace for education fundament     See More n, and thew Google Admin is as intuitive as Yamaha remote controller... So when I was stuck with MDM settings for edu fundamentals edition, Gemini confidently sent me on a 3+ hour fetch-quest of changing settings that ddidn't have anything to do with mobile devices, and chasing settings that didn't exist. Ib felt so cheated after I realized it was all a hallucination of a solution that doesn't exist, and I had to back-track the settings I had (excuse my French) fucked with blindly.    See Less

@Celeritate7  3 weeks ago

great video     See Less

@andrewdunbar82...  3 weeks ago

the dough-velopments in AI     See Less

@davem1658  3 weeks ago

Thanks for this video I find the quality great, and you provide an objective intellectual view on recent events.

In my previous comments in your past videos, I may have come across     See More ning when you dove into more complex topics that involve technical AI concepts. For someone like me, who often watches YouTube recreationally, it can be a bit much to shift gears and focus deeply just to follow along. However, I do understand this channel is AI Explained, so that approach makes sense.

If the occasional complicated idea shows up in one of your news videos, I think that’s totally fine. And if you decide to make separate videos clearly labelled as lessons for the more advanced concepts, that works well too.

In my view, being successful on Youtube means staying true to your identity while adapting about 50% to what the audience wants, but I think you're doing everything right. I hope you find this advice helpful.    See Less

@DustinRodrigue...  3 weeks ago

AI hallucinations, and the Apple 'LLMs can't reason' paper and such are real, and they will NOT be remedied by merely scaling up models, or with training models on more data. BUT     See More ge that needs to be made is simple, in concept at least. LLMs can not reliably and accurately model absolute binary logic. This causes them to get 'lost' in things which they believe are highly correllated with truth, but which do not stand up to the rigor of absolute logic. In logical arguments, there is no such thing as "0.9999 probability of truth". There is 1.0 probability of truth, or 0.0. With no value in between. Even the slightest error does not simply make the whole argument (no matter how large that argument might be, it could be thousands of pages) 'slightly false', it makes the whole thing completely false. This is a degree of rigor that most people aren't familiar with, and it doesn't get used continually for even philosophers who want to try to do so, but it is required. It is the only strategy which modulates the cognitive biases which lead people to accept "it feels really true, it aligns precisely with my intuition" as true. Right now, LLMs are operating on pure intuition. Even when they "think" with test-time inference compute scaling, all they are doing is feeling their way through arguments that FEEL true. They can't look at something, recognize that it feels deeply true, fits right in with everything else true, but has a tiny flaw that makes it completely false. Logic is not gradients. It is discrete, and absolute. Compare human history before and after The Enlightenment. The core of The Enlightenment was integrating exactly this kind of absolute logic into our wheelhouse of thought, despite it being alien to us. Despite it blowing apart things that felt as true as our need to breathe. That enabled us to formalize things, to take our human preconceptions and feelings out of it and process the world just based on rules and logic. And that enabled us to build computers that apply that absolute reasoning trillions of times faster and more efficiently than we ever could. Even enabling us to reproduce a model of our intuitive brain with AI. But the model needs access to that binary logic in addition to its fuzzy logic. Just like we did. That's not transparently obvious as to how to do it, but once it is done, expect a warp-speed leap in capabilities and especially efficiency. Continued scaling will just waste mountains of compute on emulating imperfectly the absolute rules that it could use when necessary.    See Less

@gmtree7830  3 weeks ago

we out here Phil, waiting for the o3-pro drop haha     See Less