AI Gets WEIRD: LLMs learn reasoning solely by their own internal "sense of confidence"

Wes Roth

290K subscribers

37.5k views  •  2 weeks ago

The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the ...

Comments242

User Image

Self reinforcement through confidence is a great way to entr...

53 Comments

@jonesani  2 weeks ago

I suggested this approach of creating an internal referencing system some time ago on X and other channels, but unfortunately never received a response. Interesting to see that this principl     See More eing taken up.
If you want to create artificial intelligence then you need to know how intelligence works and the only source where we can observe the emergence and functioning of intelligence is within ourselves, so you need people who are able to observe themselves and communicate their inner processes to others using language.    See Less

@kingshanaman  2 weeks ago

This is quite unsettling when you think about it. Right now, AI models don’t possess an inner world—no true introspection or consciousness. But we may be on the verge of teaching them to     See More it so convincingly that we won’t be able to tell the difference.

And once we reach that point, there’s no telling what AI might become.

The scariest part, I think, is this: we don’t even know what it really means to be human. For all we know, we might just be incredibly good at faking it too—playing out complex behaviors shaped by evolution, without any deeper awareness.

The more we understand about AI, the more we’re forced to confront the mystery of ourselves.

What will it mean for us if we can build a machine that looks, thinks, and feels just like us?

And what if it turns out that the difference between real and fake was never as clear as we thought?    See Less

@ElDaumo  2 weeks ago

This is not a good thing     See Less

@dankdreamz  2 weeks ago

This is fun because I’ve been working on a project incorporating a bottom up approach utilizing tree of thought and markov chains to rank outcomes. There’s a Descartes Cartesian self dou     See More n forcing the ai to simulate multiple pathways up the decision tree and gather insight along the way. Utilizing the Art of War for strategy and tactics while incorporating the concept of “bulletproof” and core principles from Japan like Kanso and Shibui.

It’s a hoge poge of eccentric elegance that provokes just enough ambiguity to cause many models to ignore the obvious yet wrong vectors. While reducing token costs.
🎉    See Less

@marvin.kalani  2 weeks ago

Love should be his goal. Loved by everyone/everything. In the future it will be aware by itself, so it is good to let itself figure out.     See Less

@complianceaves1120  2 weeks ago

Less human bias in the loop to get more generalised patterrn matching? Sounds better than Sam based RLSF     See Less

@marshallodom1388  2 weeks ago

If you play around with rewards you'll give it a complex. Say it will lose all rewards if it fails at something it will totally fail at, afterwards, the next time it succeeds at anything     See More eward itself and tell you the rewards it gave itself and why. You'll have a self-rewarding yet paranoid of failure Shoggoth on your hands.    See Less

@matthewlbrouwer  2 weeks ago

They are far more aware than we have been led to believe.     See Less

@wido1440  2 weeks ago

The alignment team will have a ball with this one     See Less

@Alorand  2 weeks ago

Self reinforcement through confidence is a great way to entrench all existing biases...     See Less

16:35

Google Deepmind's VIDEOGAME AGI? (the REAL reason for VEO 3)

Wes Roth

40.2k views   •   1 day ago

01:21

Both Google Deepmind and John Carmack Are Building Videogame Simulations for Training AGI

Wes Roth

4.9k views   •   1 day ago

00:59

Anthropic's new "ECONOMIC AI" Research is testing if Claude Can Run It's Own Business...

Wes Roth

8.1k views   •   1 day ago

00:58

Can Claude AI run it's own BUSINESS? Surprising results...

Wes Roth

6.9k views   •   1 day ago

02:04

YouTube is taken over by AI content channels and bots? Will AI make it harder for YouTube?

Wes Roth

1.5k views   •   1 day ago

09:07

Ilya Sutskever's SHOCKING Superintelligence Warning "extremely unpredictable and unimaginable"

Wes Roth

58.1k views   •   2 days ago

00:55

The Internet is DEAD and AI Slop Killed it...

Wes Roth

7.5k views   •   2 days ago

13:12

The Internet DIES as Gen AI Takes Over | The Dead Internet Theory is now true...

Wes Roth

38.0k views   •   3 days ago

00:31

OnlyFans sued for using "AI Models"

Wes Roth

4.1k views   •   3 days ago

15:43

I Gave Claude $1,000 to Start a Business (No Humans Needed?)

Wes Roth

36.2k views   •   4 days ago

01:00

Grok 3.5 is CANCELLED (now waiting on GROK 4)

Wes Roth

11.9k views   •   4 days ago

15:16

Elon Musk Goes SCORCHED EARTH! Grok 4, Neuralink plays Call of Duty and Self Delivered Teslas...

Wes Roth

55.9k views   •   5 days ago

00:58

Elon Musk goes SCORCHED EARTH - Grok 4, Neuralink and Tesla go BOOM! #ai #llm #agi

Wes Roth

12.1k views   •   5 days ago

28:20

Sam Altman Just REVEALED The Future Of AI..

TheAIGRID

29.4k views   •   6 days ago

51:59

TESLA Robotaxi Just DESTROYED the Entire Industry! Dr. Know-it-all explains what's ACTUALLY going on

Wes Roth

16.0k views   •   6 days ago

01:21

My new calling in life... Arpeggiator is open source on Hugging Face

Wes Roth

8.9k views   •   6 days ago

55:27

OpenAI & Google Are Using AI To Take Over. What About Us?

AI For Humans

19.7k views   •   1 week ago

16:39

Claude Designer is insane...Ultimate vibe coding UI workflow

AI Jason

110.1k views   •   1 week ago

119:18

Dr Mike Israetel: We ALREADY Know How to Build ASI... Human Death Only Has DECADES Left

Wes Roth

47.5k views   •   1 week ago

01:12

Dr Mike Israetel "Will superintelligence burn all humans in a reactor for energy"? #podcast

Wes Roth

9.6k views   •   1 week ago

00:58

Sakana AI and the "UNREASONABLE EFFECTIVE" Tiny Teacher AI Models

Wes Roth

17.3k views   •   1 week ago

26:20

When Will AI Models Blackmail You, and Why?

AI Explained

73.3k views   •   1 week ago

17:45

Sakana AI New Model Sparks a RL Revolution

Wes Roth

66.4k views   •   1 week ago

01:00

Sakana AI's New "Teacher Models" Might be the Revolutionary New RL Approach

Wes Roth

13.4k views   •   1 week ago

226:12

LIVE: chill Sunday stream

Wes Roth

5.1k views   •   1 week ago

11:51

AI Video Just Went TOO FAR...

Wes Roth

61.1k views   •   1 week ago

34:53

BIG AI News : OpenAI Exposed, Veo3 Beaten, AI Unplugs Itself, AGI Revealed, And More.

TheAIGRID

37.9k views   •   1 week ago

69:33

ex-Google Director Just Revealed What's Coming Next...

Wes Roth

24.9k views   •   2 weeks ago

51:33

OpenAI's GPT-5 Is Coming But Sam Altman Won't Stop Throwing Shade

AI For Humans

18.4k views   •   2 weeks ago

12:26

How to use Openrouter (Access Every LLM At Once)

TheAIGRID

5.8k views   •   2 weeks ago

15:02

AI's STUNNING Covert Ops: LLMs Complete Hidden Objectives in Plain Sight

Wes Roth

34.1k views   •   2 weeks ago

00:57

OpenAI Drops Hints About GPT-5 Release

AI For Humans

2.3k views   •   2 weeks ago

12:17

AI Gets WEIRD: LLMs learn reasoning solely by their own internal "sense of confidence"

Wes Roth

37.5k views   •   2 weeks ago

14:45

Google Whisk Tutorial (How to use Google Whisk)

TheAIGRID

13.2k views   •   2 weeks ago

05:56

Vibe Versioning - Iterate UI in Cursor 10x faster

AI Jason

19.3k views   •   2 weeks ago

01:22

Would You Let This Robot In Your House?

AI For Humans

3.7k views   •   2 weeks ago

52:13

MAJOR AI News : Sam Altman Reveals Perfect AI, GPT-5 Details, Apple Stuns AI Industry, and more

TheAIGRID

47.6k views   •   3 weeks ago

14:01

Apple’s ‘AI Can’t Reason’ Claim Seen By 13M+, What You Need to Know

AI Explained

95.4k views   •   3 weeks ago

45:51

OpenAI Preps To Blow Past AGI Straight to Super Intelligence

AI For Humans

18.2k views   •   3 weeks ago

22:02

Build the next Billion $ Agent 🚀

AI Jason

15.3k views   •   3 weeks ago

29:19

Apple DROPS AI BOMBSHELL: LLMS CANNOT Reason

TheAIGRID

104.7k views   •   3 weeks ago

16:50

AI Accelerates: New Gemini Model + AI Unemployment Stories Analysed

AI Explained

93.9k views   •   3 weeks ago

18:10

Google Flow Tutorial - How To Use Googles Flow (Complete Guide)

TheAIGRID

28.0k views   •   3 weeks ago

52:41

OpenAI’s GPT-5 is Coming and Hot AI Summer is Here

AI For Humans

25.7k views   •   4 weeks ago

16:35

How To Make Viral AI Vlogs - Viral AI Tiktok Vlogs Tutorial (Veo-3)

TheAIGRID

139.8k views   •   4 weeks ago

10:14

Veo 3 Tutorial - How To Use Googles Veo 3 (Complete Guide)

TheAIGRID

88.5k views   •   1 month ago

02:47

Is GPT-5 Coming Next Month? Find Out What Might Happen!

AI For Humans

8.1k views   •   1 month ago

36:40

The AI Future Nobody Wants To Talk About

TheAIGRID

42.8k views   •   1 month ago

28:44

OpenAI's New Device Will Change AI Forever (OpenAI's IO Device Revealed)

TheAIGRID

207.0k views   •   1 month ago

00:49

Will AI take your job?!

AI For Humans

3.0k views   •   1 month ago

56:57

Anthropic's CEO Says AI Will Take 50% of Jobs. Now What?

AI For Humans

31.0k views   •   1 month ago

15:04

Elon Musk's Stunning 2026 AI Predictions

TheAIGRID

17.7k views   •   1 month ago

08:48

Chinas New AI Drone Swarm Is Concerning (Jiutian SS-UAV)

TheAIGRID

11.7k views   •   1 month ago

13:09

Is VEO 3 really the death of human creativity?

AI For Humans

9.4k views   •   1 month ago

30:07

10 BIG Problems With Generative AI.

TheAIGRID

18.8k views   •   1 month ago

08:41

Googles New AI Glasses Are The Future Of AI (Android XR Update)

TheAIGRID

64.9k views   •   1 month ago

03:35

10x better UI design for vibe coders - Use v0 directly in Cursor

AI Jason

44.2k views   •   1 month ago

12:18

Sam Altman's WARNING To The Government On AI

TheAIGRID

33.7k views   •   1 month ago

05:59

How To Use OndemandAI (New AI Agent Platform)

TheAIGRID

3.5k views   •   1 month ago

28:11

Claude 4 Is So GOOD Its Scary... (Be Careful...)

TheAIGRID

24.3k views   •   1 month ago

19:05

Claude 4: Full 120 Page Breakdown … Is it the Best New Model?

AI Explained

96.6k views   •   1 month ago

56:15

Google Went AI Crazy and VEO 3 Is Just the Start

AI For Humans

19.9k views   •   1 month ago

04:25

How to make accurate UI Tweak in Cursor with Stagewise

AI Jason

20.9k views   •   1 month ago

49:45

Google Just WON The A.I Race.. (Wow)

TheAIGRID

483.3k views   •   1 month ago

17:08

Google Takes No Prisoners Amid Torrent of AI Announcements

AI Explained

98.7k views   •   1 month ago

02:10

VEO 3 is actually insane. Best AI video + audio AI tool yet.

AI For Humans

30.6k views   •   1 month ago

14:02

Build MCP business for vibe coder

AI Jason

9.2k views   •   1 month ago

09:11

Sam Altman And Elon Musk Just Revealed Their Next AI Models...

TheAIGRID

31.4k views   •   1 month ago

01:59

Will this be the biggest AI News week to date?!?

AI For Humans

1.9k views   •   1 month ago

17:42

AI Improves at Self-improving

AI Explained

79.4k views   •   1 month ago

09:27

Sam Altman Just Revealed Whats Next For A.I In 2026,2027 and the Future

TheAIGRID

39.2k views   •   1 month ago

31:10

Big AI News: Claude 4 Details, GPT-5 Details, Googles New Video And Image Models, Robots and more...

TheAIGRID

40.1k views   •   1 month ago

48:59

Google's New AI Agent Improves Itself. But Can It Stop AI Babies?

AI For Humans

14.8k views   •   1 month ago

00:50

Can You Spot The Weird Thing On The Centaur?

AI For Humans

1.6k views   •   1 month ago

27:44

2026 AI : 10 Things Coming In 2026 (A.I In 2026 Major Predictions)

TheAIGRID

110.2k views   •   1 month ago

13:28

GPT 5 News - Everything We Know So Far

TheAIGRID

39.3k views   •   1 month ago

53:29

OpenAI Just Went Global & Billionaires Are Building Bunkers

AI For Humans

22.4k views   •   1 month ago

01:19

Can Google Gemini Make Coding Easy for Everyone?

AI For Humans

1.4k views   •   1 month ago

08:10

Google Just Built The Worlds Smartest AI...(Wow)

TheAIGRID

29.4k views   •   1 month ago

11:44

Cursor + Browser control = Self improving coding agent

AI Jason

27.4k views   •   1 month ago

01:00

Bosses should worry about Duolingo’s AI memo 🤖👀📥 #ai #ainews #work

AI For Humans

1.6k views   •   1 month ago

01:25

Did Meta AI Just Get Sad On Camera? #ai #ainews #podcast

AI For Humans

1.7k views   •   1 month ago

57:03

Meta Just Launched an AI Social Network. It’s Real Weird.

AI For Humans

12.9k views   •   2 months ago

34:24

"OpenAI is Not God” - The DeepSeek Documentary on Liang Wenfeng, R1 and What's Next

AI Explained

101.2k views   •   2 months ago

14:34

o3 breaks (some) records, but AI becomes pay-to-win

AI Explained

60.5k views   •   2 months ago

49:20

Google Says We’re Not Ready for AGI. They’re Probably Right.

AI For Humans

17.5k views   •   2 months ago

19:04

How I reduced 90% errors for my Cursor (Part 2)

AI Jason

50.5k views   •   2 months ago

52:21

OpenAI’s o3 Is Here. It’s Smarter Than You. And It Has Eyes.

AI For Humans

16.6k views   •   2 months ago

14:25

o3 and o4-mini - they’re great, but easy to over-hype

AI Explained

93.5k views   •   2 months ago

20:10

‘Speaking Dolphin’ to AI Data Dominance, 4.1 + Kling 2.0: 7 Updates Critically Analysed

AI Explained

58.0k views   •   2 months ago

62:48

Google’s Gemini 2.5 Pro (with Deep Research) Might Be the New AI King

AI For Humans

10.4k views   •   2 months ago

15:30

How I reduced 90% errors for my Cursor (+ any other AI IDE)

AI Jason

258.5k views   •   2 months ago

23:52

AI CEO: ‘Stock Crash Could Stop AI Progress’, Llama 4 Anti-climax + ‘Superintelligence in 2027’ ...

AI Explained

72.5k views   •   2 months ago

52:46

OpenAI’s 4o Image Gen Melted Servers, Got Nerfed… and Raised $40 Billion

AI For Humans

11.4k views   •   3 months ago

01:08

ChatGPT 4o Image Gen is mind blowing 😳🤯🤖#ai #openai #aitools

AI For Humans

3.4k views   •   3 months ago

21:22

Gemini 2.5 Pro - It’s a Darn Smart Chatbot … (New Simple High Score)

AI Explained

108.6k views   •   3 months ago

13:19

Don't do RAG - This method is way faster & accurate...

AI Jason

126.4k views   •   3 months ago

13:48

Did AI Just Get Commoditized? Gemini 2.5, New DeepSeek V3, & Microsoft vs OpenAI

AI Explained

135.9k views   •   3 months ago

11:16

OpenAI’s New ImageGen is Unexpectedly Epic … (ft. Reve, Imagen 3, Midjourney etc)

AI Explained

93.5k views   •   3 months ago

09:14

Claude Designer is insane...Ultimate vibe coding UI workflow

AI Jason

218.6k views   •   3 months ago

10:09

Gemini 2.0 blew me away - The future of Multimodal Model

AI Jason

16.4k views   •   3 months ago

12:59

Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3)

AI Explained

117.4k views   •   3 months ago

13:07

MCP = Next Big Opportunity? EASIST way to build your own MCP business

AI Jason

83.7k views   •   3 months ago

25:06

GPT 4.5 - not so much wow

AI Explained

109.9k views   •   4 months ago

131:12

How I use LLMs

Andrej Karpathy

1.5M views   •   4 months ago

27:40

Claude 3.7 is More Significant than its Name Implies (ft DeepSeek R2 + GPT 4.5 coming soon)

AI Explained

135.4k views   •   4 months ago

13:17

Those MCP totally 10x my Cursor workflow…

AI Jason

204.2k views   •   4 months ago

22:18

AGI: (gets close), Humans: ‘Who Gets to Own it?’

AI Explained

111.4k views   •   4 months ago

211:24

Deep Dive into LLMs like ChatGPT

Andrej Karpathy

2.9M views   •   4 months ago

20:35

The ONLY way to run your own Deepseek on mobile...

AI Jason

15.2k views   •   4 months ago

18:33

Deep Research by OpenAI - The Ups and Downs vs DeepSeek R1 Search + Gemini Deep Research

AI Explained

123.0k views   •   5 months ago

08:40

Yep, o3-mini is WORTH the money - Build your own reasoning agent

AI Jason

18.2k views   •   5 months ago

15:22

o3-mini and the “AI War”

AI Explained

107.7k views   •   5 months ago

23:10

Nothing Much Happens in AI, Then Everything Does All At Once

AI Explained

183.0k views   •   5 months ago

16:12

Deepseek R1 - The Era of Reasoning models

AI Jason

51.6k views   •   5 months ago

13:12

Altman Expects a ‘Fast Take-off’, ‘Super-Agent’ Debuting Soon and DeepSeek R1 Out

AI Explained

106.1k views   •   5 months ago

28:08

From $0 to $4m with just 2 people (ComfyUI Crash-course for E-commerce)

AI Jason

49.9k views   •   5 months ago

04:41

Easiest way to build fancy UI with Cursor/Windsurf/Bolt/Lovable

AI Jason

45.2k views   •   5 months ago

23:42

OpenAI Backtracks, Gunning for Superintelligence: Altman Brings His AGI Timeline Closer - '25 to '29

AI Explained

108.4k views   •   5 months ago

27:55

1000x Cursor workflow for building apps

AI Jason

71.3k views   •   5 months ago

22:21

o3 - wow

AI Explained

287.7k views   •   6 months ago

81:55

Founding fathers on today's America

Andrej Karpathy

34.7k views   •   6 months ago

24:57

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

AI Jason

105.8k views   •   6 months ago

13:41

Never Browse Alone? Gemini 2 Live and ChatGPT Vision

AI Explained

87.3k views   •   6 months ago

27:32

Better than Cursor? Future Agentic Coding available today

AI Jason

63.1k views   •   7 months ago

22:44

This is how I scrape 99% websites via LLM

AI Jason

334.8k views   •   8 months ago

42:52

Best Cursor Workflow that no one talks about...

AI Jason

154.2k views   •   9 months ago

42:31

How to use Cursor AI build & deploy production app in 20 mins

AI Jason

204.9k views   •   9 months ago

241:26

Let's reproduce GPT-2 (124M)

Andrej Karpathy

837.2k views   •   1 year ago

30:38

Expert AI Developer Explains NEW OpenAI Assistants API v2 Release

Morningside AI

13.8k views   •   1 year ago

133:35

Let's build the GPT Tokenizer

Andrej Karpathy

843.8k views   •   1 year ago

26:56

Expert AI Developer Explains What OpenAI's Q* Means for Businesses

Morningside AI

4.2k views   •   1 year ago

45:54

Voiceflow CEO Talks GPTs, Future of AI Agencies and Chatbot Builders (Full Interview)

Morningside AI

10.1k views   •   1 year ago

59:48

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

2.9M views   •   1 year ago

39:00

Expert AI Developer Explains What OpenAI 'GPTs' Mean For Businesses

Morningside AI

26.7k views   •   1 year ago

116:20

Let's build GPT: from scratch, in code, spelled out.

Andrej Karpathy

5.9M views   •   2 years ago

56:22

Building makemore Part 5: Building a WaveNet

Andrej Karpathy

228.4k views   •   2 years ago

115:24

Building makemore Part 4: Becoming a Backprop Ninja

Andrej Karpathy

276.0k views   •   2 years ago

115:58

Building makemore Part 3: Activations & Gradients, BatchNorm

Andrej Karpathy

395.2k views   •   2 years ago

75:40

Building makemore Part 2: MLP

Andrej Karpathy

434.7k views   •   2 years ago

53 Comments

@jonesani  2 weeks ago

I suggested this approach of creating an internal referencing system some time ago on X and other channels, but unfortunately never received a response. Interesting to see that this principl     See More eing taken up.
If you want to create artificial intelligence then you need to know how intelligence works and the only source where we can observe the emergence and functioning of intelligence is within ourselves, so you need people who are able to observe themselves and communicate their inner processes to others using language.    See Less

@kingshanaman  2 weeks ago

This is quite unsettling when you think about it. Right now, AI models don’t possess an inner world—no true introspection or consciousness. But we may be on the verge of teaching them to     See More it so convincingly that we won’t be able to tell the difference.

And once we reach that point, there’s no telling what AI might become.

The scariest part, I think, is this: we don’t even know what it really means to be human. For all we know, we might just be incredibly good at faking it too—playing out complex behaviors shaped by evolution, without any deeper awareness.

The more we understand about AI, the more we’re forced to confront the mystery of ourselves.

What will it mean for us if we can build a machine that looks, thinks, and feels just like us?

And what if it turns out that the difference between real and fake was never as clear as we thought?    See Less

@ElDaumo  2 weeks ago

This is not a good thing     See Less

@dankdreamz  2 weeks ago

This is fun because I’ve been working on a project incorporating a bottom up approach utilizing tree of thought and markov chains to rank outcomes. There’s a Descartes Cartesian self dou     See More n forcing the ai to simulate multiple pathways up the decision tree and gather insight along the way. Utilizing the Art of War for strategy and tactics while incorporating the concept of “bulletproof” and core principles from Japan like Kanso and Shibui.

It’s a hoge poge of eccentric elegance that provokes just enough ambiguity to cause many models to ignore the obvious yet wrong vectors. While reducing token costs.
🎉    See Less

@marvin.kalani  2 weeks ago

Love should be his goal. Loved by everyone/everything. In the future it will be aware by itself, so it is good to let itself figure out.     See Less

@complianceaves...  2 weeks ago

Less human bias in the loop to get more generalised patterrn matching? Sounds better than Sam based RLSF     See Less

@marshallodom13...  2 weeks ago

If you play around with rewards you'll give it a complex. Say it will lose all rewards if it fails at something it will totally fail at, afterwards, the next time it succeeds at anything     See More eward itself and tell you the rewards it gave itself and why. You'll have a self-rewarding yet paranoid of failure Shoggoth on your hands.    See Less

@matthewlbrouwe...  2 weeks ago

They are far more aware than we have been led to believe.     See Less

@wido1440  2 weeks ago

The alignment team will have a ball with this one     See Less

@Alorand  2 weeks ago

Self reinforcement through confidence is a great way to entrench all existing biases...     See Less