im ready when you are ~
well that couldve been weirder dear i say scarier
AI Explained
339K subscribersA new state of the art LLM (at least for creative writing and basic reasoning) but what lies behind the numbers that were put out?
100% spot on on reliability, that is always the one thing I...
20 Comments
TheAIGRID
2.7k views • 17 hours ago
TheAIGRID
7.9k views • 1 day ago
TheAIGRID
11.6k views • 2 days ago
Wes Roth
78.8k views • 3 days ago
TheAIGRID
4.9k views • 3 days ago
TheAIGRID
28.4k views • 4 days ago
Wes Roth
69.2k views • 4 days ago
TheAIGRID
22.4k views • 4 days ago
TheAIGRID
20.5k views • 5 days ago
Wes Roth
38.5k views • 5 days ago
Shelf will be hidden for 30 daysUndo
AI Explained
97.0k views • 6 days ago
Wes Roth
100.8k views • 6 days ago
TheAIGRID
61.9k views • 6 days ago
TheAIGRID
44.4k views • 1 week ago
AI For Humans
11.1k views • 1 week ago
TheAIGRID
18.9k views • 1 week ago
AI Jason
48.0k views • 1 week ago
Wes Roth
77.3k views • 1 week ago
TheAIGRID
16.2k views • 1 week ago
AI Explained
130.3k views • 1 week ago
AI Explained
89.6k views • 1 week ago
Wes Roth
23.8k views • 1 week ago
TheAIGRID
48.3k views • 1 week ago
TheAIGRID
33.7k views • 1 week ago
Wes Roth
75.1k views • 1 week ago
TheAIGRID
13.1k views • 1 week ago
AI For Humans
6.8k views • 2 weeks ago
TheAIGRID
10.2k views • 2 weeks ago
Wes Roth
58.4k views • 2 weeks ago
AI Jason
168.6k views • 2 weeks ago
TheAIGRID
48.4k views • 2 weeks ago
Wes Roth
47.4k views • 2 weeks ago
TheAIGRID
13.1k views • 2 weeks ago
AI For Humans
709 views • 2 weeks ago
AI Jason
15.2k views • 2 weeks ago
AI For Humans
243 views • 2 weeks ago
TheAIGRID
57.2k views • 2 weeks ago
Wes Roth
86.6k views • 2 weeks ago
TheAIGRID
56.7k views • 2 weeks ago
TheAIGRID
17.9k views • 2 weeks ago
Wes Roth
116.2k views • 2 weeks ago
TheAIGRID
35.1k views • 2 weeks ago
TheAIGRID
96.5k views • 3 weeks ago
AI Explained
113.9k views • 3 weeks ago
Wes Roth
78.3k views • 3 weeks ago
TheAIGRID
27.8k views • 3 weeks ago
AI Jason
70.9k views • 3 weeks ago
Wes Roth
153.2k views • 3 weeks ago
Wes Roth
17.5k views • 3 weeks ago
Wes Roth
404.0k views • 3 weeks ago
TheAIGRID
36.2k views • 3 weeks ago
TheAIGRID
47.0k views • 3 weeks ago
Wes Roth
214.3k views • 3 weeks ago
Wes Roth
72.9k views • 3 weeks ago
Wes Roth
79.3k views • 4 weeks ago
Wes Roth
68.4k views • 4 weeks ago
Wes Roth
143.2k views • 1 month ago
Wes Roth
125.4k views • 1 month ago
AI Explained
108.0k views • 1 month ago
Wes Roth
29.7k views • 1 month ago
Andrej Karpathy
1.1M views • 1 month ago
Wes Roth
6.2k views • 1 month ago
Wes Roth
36.3k views • 1 month ago
Wes Roth
28.4k views • 1 month ago
AI Explained
133.7k views • 1 month ago
AI Jason
150.8k views • 1 month ago
AI Explained
110.4k views • 1 month ago
Andrej Karpathy
2.1M views • 1 month ago
AI Jason
13.1k views • 1 month ago
AI Explained
122.0k views • 2 months ago
AI Jason
17.4k views • 2 months ago
AI Explained
107.3k views • 2 months ago
AI Explained
182.1k views • 2 months ago
AI Jason
50.6k views • 2 months ago
AI Explained
105.7k views • 2 months ago
AI Jason
36.6k views • 2 months ago
AI Jason
38.9k views • 2 months ago
AI Explained
108.0k views • 2 months ago
AI Jason
59.7k views • 2 months ago
AI For Humans
5.4k views • 3 months ago
AI Explained
286.0k views • 3 months ago
Andrej Karpathy
34.7k views • 3 months ago
AI Jason
73.6k views • 3 months ago
AI Explained
87.1k views • 3 months ago
AI Explained
74.8k views • 3 months ago
AI Explained
152.1k views • 3 months ago
AI For Humans
5.1k views • 3 months ago
AI Explained
116.8k views • 3 months ago
AI For Humans
1.0k views • 4 months ago
AI Explained
99.4k views • 4 months ago
AI Jason
60.5k views • 4 months ago
AI Explained
142.2k views • 4 months ago
AI Explained
112.2k views • 5 months ago
AI Jason
298.4k views • 5 months ago
AI Explained
88.6k views • 5 months ago
AI Explained
83.1k views • 5 months ago
AI Explained
166.7k views • 6 months ago
AI Jason
141.0k views • 6 months ago
AI Explained
100.6k views • 6 months ago
AI Explained
168.4k views • 6 months ago
AI Explained
198.5k views • 6 months ago
AI Jason
190.6k views • 6 months ago
AI Jason
30.1k views • 7 months ago
AI Jason
18.7k views • 7 months ago
AI For Humans
626 views • 7 months ago
AI Jason
123.9k views • 8 months ago
AI Jason
17.8k views • 8 months ago
AI Jason
16.4k views • 9 months ago
AI For Humans
1.8k views • 9 months ago
Andrej Karpathy
763.6k views • 9 months ago
AI Jason
19.5k views • 9 months ago
AI For Humans
5.6k views • 10 months ago
AI Explained
151.7k views • 10 months ago
AI Jason
104.0k views • 10 months ago
AI Explained
388.7k views • 10 months ago
AI For Humans
948 views • 10 months ago
AI Explained
129.2k views • 10 months ago
AI Explained
97.7k views • 11 months ago
AI For Humans
5.7k views • 11 months ago
AI Jason
608.3k views • 11 months ago
AI For Humans
667 views • 11 months ago
AI For Humans
3.5k views • 11 months ago
Morningside AI
13.7k views • 11 months ago
AI For Humans
781 views • 11 months ago
AI Explained
129.9k views • 11 months ago
AI For Humans
1.5k views • 11 months ago
AI Jason
60.6k views • 11 months ago
AI Explained
118.4k views • 11 months ago
AI For Humans
3.0k views • 11 months ago
AI For Humans
387 views • 11 months ago
AI For Humans
3.6k views • 11 months ago
AI Explained
118.3k views • 1 year ago
AI For Humans
2.3k views • 1 year ago
AI For Humans
1.7k views • 1 year ago
AI For Humans
339 views • 1 year ago
AI For Humans
2.6k views • 1 year ago
AI Explained
106.4k views • 1 year ago
AI Explained
131.0k views • 1 year ago
AI For Humans
1.5k views • 1 year ago
AI For Humans
1.4k views • 1 year ago
AI For Humans
1.8k views • 1 year ago
AI Explained
181.1k views • 1 year ago
AI Explained
151.1k views • 1 year ago
Andrej Karpathy
742.8k views • 1 year ago
AI Explained
241.8k views • 1 year ago
AI Explained
187.7k views • 1 year ago
AI Explained
161.6k views • 1 year ago
AI Explained
272.8k views • 1 year ago
AI Explained
96.8k views • 1 year ago
AI Explained
145.9k views • 1 year ago
AI Explained
133.4k views • 1 year ago
AI Explained
79.5k views • 1 year ago
AI Explained
84.1k views • 1 year ago
AI Explained
74.6k views • 1 year ago
AI Explained
144.9k views • 1 year ago
Morningside AI
4.2k views • 1 year ago
AI Explained
83.7k views • 1 year ago
Morningside AI
10.1k views • 1 year ago
AI Explained
229.6k views • 1 year ago
Andrej Karpathy
2.7M views • 1 year ago
AI Explained
112.8k views • 1 year ago
Morningside AI
26.6k views • 1 year ago
Andrej Karpathy
5.4M views • 2 years ago
Andrej Karpathy
213.4k views • 2 years ago
Andrej Karpathy
254.8k views • 2 years ago
Andrej Karpathy
363.4k views • 2 years ago
Andrej Karpathy
403.3k views • 2 years ago
20 Comments
100% spot on on reliability, that is always the one thing I focus on when people hype up AI. Yes, it's absolutely great BUT it will never be consistently useful as of now, and won't     See More
my biggest issues with claude atm are its only 1% of gpt-canvas limit usage
Its still impossible for it to keep folowing a story that evolves over 10 chapters
Shouldnt be to hard f     See More
Guys, guys … does it click on that "I‘m not a robot" checkbox or not? Can it solve captchas? Cause I struggle with them. Also wouldn’t it kinda set a bad precedent if the sma     See More
Here's a ChatGPT summary:
- The new Claude 3.5 Sonnet from Anthropic is a significant advancement, particularly in reasoning, coding, and visual processing abilities.
- The mod     See More
It could also be malaria. Or maybe dengue. What about the African illness that they came up with? What about Delta? Alpha? ABCD+? Or just... Occam's razor.
Love your videos and can't wait for the next one, super informative and good fact check
the new model truly sucks XD the accuracy went down big time, simple stuff .. make him loose his shit easily.. to a point where its not just visible, but its unusable..
im ready when you are ~
well that couldve been weirder dear i say scarier     See Less
100% spot on on reliability, that is always the one thing I focus on when people hype up AI. Yes, it's absolutely great BUT it will never be consistently useful as of now, and won't     See More ble to be left alone, because of the risks of minor or even major mistakes, especially as e.g. context goes up    See Less
my biggest issues with claude atm are its only 1% of gpt-canvas limit usage
Its still impossible for it to keep folowing a story that evolves over 10 chapters
Shouldnt be to hard f     See More eep a backtrack of red lines behind the scences how story progresses if i told it we write a story over many chapters...but nothing from that exist so far .    See Less
Guys, guys … does it click on that "I‘m not a robot" checkbox or not? Can it solve captchas? Cause I struggle with them. Also wouldn’t it kinda set a bad precedent if the sma     See More s start lying to the stupid programs to do their job?    See Less
The AI zoom call was the most soulless thing I’d ever seen     See Less
Here's a ChatGPT summary:
- The new Claude 3.5 Sonnet from Anthropic is a significant advancement, particularly in reasoning, coding, and visual processing abilities.
- The mod     See More bility to use a computer via an API is limited due to unreliability and inability to perform tasks like sending emails or making purchases.
- Claude 3.5 Sonnet has knowledge of world events up until April 2024.
- In the OS World benchmark, Claude 3.5 Sonnet achieved 22% accuracy compared to 72% by computer science majors.
- In the SWE Bench Software Engineering benchmark, Claude 3.5 Sonnet scored 49%, outperforming the 0.1 preview model.
- The new Claude 3.5 Sonnet performs better in challenging science questions, general knowledge, coding, mathematics, and visual question answering compared to its predecessor.
- The model's performance in creative writing is superior to the original Claude 3.5 Sonnet.
- In multilingual challenges, the new Claude 3.5 Sonnet is slightly worse than the previous version.
- The new model shows a reverse scaling law in reliability, where performance drops as the number of attempts increases.
- Claude 3.5 Sonnet is slightly worse at correctly refusing toxic requests and incorrectly refusing innocent requests compared to the previous model.
- The new model's performance in the retail and airline tasks is not outstanding, with a 46% success rate in airline tasks given one try.
- The Simple Bench test showed a significant improvement in the new Claude 3.5 Sonnet compared to the previous version.
- The new model's performance in reasoning and creative writing is impressive, though it struggles with computation-heavy tasks.
- The new Claude 3.5 Sonnet is better at reasoning and creative writing but still faces challenges in reliability and multilingual tasks.
- Main message: The new Claude 3.5 Sonnet represents a significant step forward in reasoning and processing abilities, though it still faces challenges in reliability and certain tasks.    See Less
It could also be malaria. Or maybe dengue. What about the African illness that they came up with? What about Delta? Alpha? ABCD+? Or just... Occam's razor.     See Less
19:16 Open bobs     See Less
Love your videos and can't wait for the next one, super informative and good fact check     See Less
the new model truly sucks XD the accuracy went down big time, simple stuff .. make him loose his shit easily.. to a point where its not just visible, but its unusable..     See Less