Setup helicone to monitor your LLM app cost now: https://www.helicone.ai/?utm_source=ai-jason
Join AI builder club to access     See More
AI Jason
207K subscribersCAG intro + Build a MCP server that read API docs Setup helicone to monitor your LLM app cost now: ...
Setup helicone to monitor your LLM app cost now:
81 Comments
00:42
AI For Humans
1.9k views • 1 day ago
44:40
AI For Humans
10.1k views • 2 days ago
19:38
Wes Roth
31.0k views • 2 days ago
14:56
AI Explained
48.0k views • 3 days ago
01:41
AI For Humans
4.3k views • 3 days ago
21:31
TheAIGRID
12.7k views • 3 days ago
21:31
TheAIGRID
7.6k views • 3 days ago
12:08
Wes Roth
46.5k views • 3 days ago
21:43
AI Explained
102.9k views • 4 days ago
23:40
TheAIGRID
56.2k views • 4 days ago
Shelf will be hidden for 30 daysUndo
14:08
Wes Roth
53.4k views • 5 days ago
26:37
Wes Roth
49.0k views • 5 days ago
13:33
TheAIGRID
8.5k views • 6 days ago
14:40
TheAIGRID
25.8k views • 6 days ago
15:37
TheAIGRID
81.1k views • 1 week ago
15:07
TheAIGRID
14.7k views • 1 week ago
29:19
Wes Roth
35.2k views • 1 week ago
18:27
AI Explained
58.7k views • 1 week ago
45:12
AI For Humans
11.0k views • 1 week ago
08:14
TheAIGRID
13.3k views • 1 week ago
19:45
TheAIGRID
117.2k views • 1 week ago
19:57
Wes Roth
24.8k views • 1 week ago
13:13
TheAIGRID
47.1k views • 1 week ago
48:20
Wes Roth
35.4k views • 1 week ago
12:54
AI Explained
57.9k views • 1 week ago
16:14
TheAIGRID
20.9k views • 1 week ago
11:02
Wes Roth
29.6k views • 1 week ago
18:35
TheAIGRID
12.8k views • 1 week ago
79:17
Wes Roth
85.9k views • 2 weeks ago
55:23
AI For Humans
11.6k views • 2 weeks ago
18:03
Wes Roth
53.8k views • 2 weeks ago
26:42
Wes Roth
71.0k views • 2 weeks ago
24:58
Wes Roth
53.2k views • 2 weeks ago
37:44
Wes Roth
30.0k views • 3 weeks ago
36:05
TheAIGRID
22.3k views • 3 weeks ago
13:16
TheAIGRID
27.6k views • 3 weeks ago
62:27
AI For Humans
9.4k views • 3 weeks ago
08:33
AI Jason
32.5k views • 3 weeks ago
01:25
AI For Humans
4.7k views • 3 weeks ago
07:58
Wes Roth
43.5k views • 3 weeks ago
25:09
Wes Roth
50.1k views • 3 weeks ago
01:25
AI For Humans
2.0k views • 3 weeks ago
01:19
AI For Humans
3.2k views • 4 weeks ago
08:55
TheAIGRID
18.2k views • 4 weeks ago
15:17
Wes Roth
14.4k views • 4 weeks ago
52:48
AI For Humans
13.3k views • 4 weeks ago
13:52
TheAIGRID
98.9k views • 1 month ago
14:14
AI Explained
57.3k views • 1 month ago
10:52
TheAIGRID
19.6k views • 1 month ago
23:43
Wes Roth
28.4k views • 1 month ago
14:23
TheAIGRID
28.0k views • 1 month ago
15:02
TheAIGRID
14.7k views • 1 month ago
15:51
TheAIGRID
18.6k views • 1 month ago
14:44
TheAIGRID
27.4k views • 1 month ago
05:14
AI Jason
30.5k views • 1 month ago
24:40
Wes Roth
168.0k views • 1 month ago
14:17
TheAIGRID
26.6k views • 1 month ago
48:09
AI For Humans
11.4k views • 1 month ago
28:09
Wes Roth
60.8k views • 1 month ago
35:42
Wes Roth
39.4k views • 1 month ago
37:59
TheAIGRID
14.6k views • 1 month ago
24:15
Wes Roth
137.1k views • 1 month ago
06:07
Wes Roth
21.1k views • 1 month ago
53:01
AI For Humans
14.5k views • 1 month ago
23:55
Wes Roth
32.4k views • 1 month ago
14:50
Wes Roth
43.0k views • 1 month ago
112:36
Wes Roth
12.9k views • 1 month ago
11:47
AI Jason
53.8k views • 1 month ago
57:36
AI For Humans
14.4k views • 1 month ago
01:43
AI For Humans
2.7k views • 1 month ago
14:22
TheAIGRID
9.0k views • 1 month ago
15:44
AI Explained
58.0k views • 1 month ago
24:25
TheAIGRID
17.5k views • 1 month ago
02:06
AI For Humans
8.1k views • 1 month ago
14:07
AI Explained
66.2k views • 1 month ago
38:23
AI For Humans
11.1k views • 1 month ago
18:08
TheAIGRID
19.5k views • 1 month ago
49:55
AI For Humans
10.9k views • 2 months ago
02:12
AI For Humans
2.8k views • 2 months ago
11:32
AI Explained
20.2k views • 2 months ago
11:32
AI Explained
48.3k views • 2 months ago
50:33
AI For Humans
13.3k views • 2 months ago
06:41
AI Jason
15.8k views • 2 months ago
44:47
AI For Humans
9.4k views • 2 months ago
52:12
AI For Humans
19.1k views • 2 months ago
18:55
AI Explained
57.4k views • 2 months ago
53:56
AI For Humans
16.4k views • 3 months ago
44:52
AI For Humans
11.4k views • 3 months ago
16:02
AI Jason
94.2k views • 3 months ago
53:25
AI For Humans
18.3k views • 3 months ago
15:02
AI Explained
163.0k views • 3 months ago
11:55
AI Explained
194.4k views • 3 months ago
40:18
AI For Humans
35.8k views • 3 months ago
64:05
AI For Humans
20.0k views • 3 months ago
18:44
AI Jason
133.7k views • 3 months ago
17:20
AI Explained
84.4k views • 4 months ago
51:06
AI For Humans
18.8k views • 4 months ago
07:02
AI Jason
80.4k views • 4 months ago
02:12
AI For Humans
12.9k views • 4 months ago
11:44
AI Explained
177.5k views • 4 months ago
09:29
AI Jason
51.6k views • 4 months ago
55:27
AI For Humans
22.3k views • 4 months ago
16:39
AI Jason
181.8k views • 4 months ago
26:20
AI Explained
109.2k views • 4 months ago
51:33
AI For Humans
19.7k views • 5 months ago
05:56
AI Jason
22.7k views • 5 months ago
01:22
AI For Humans
4.0k views • 5 months ago
14:01
AI Explained
101.3k views • 5 months ago
45:51
AI For Humans
19.1k views • 5 months ago
22:02
AI Jason
17.7k views • 5 months ago
16:50
AI Explained
96.3k views • 5 months ago
02:47
AI For Humans
15.8k views • 5 months ago
00:49
AI For Humans
3.2k views • 5 months ago
56:57
AI For Humans
31.6k views • 5 months ago
13:09
AI For Humans
9.5k views • 5 months ago
03:35
AI Jason
51.5k views • 5 months ago
19:05
AI Explained
98.8k views • 6 months ago
56:15
AI For Humans
20.4k views • 6 months ago
04:25
AI Jason
24.3k views • 6 months ago
17:08
AI Explained
99.6k views • 6 months ago
02:10
AI For Humans
31.0k views • 6 months ago
14:02
AI Jason
10.1k views • 6 months ago
01:59
AI For Humans
2.0k views • 6 months ago
17:42
AI Explained
82.8k views • 6 months ago
48:59
AI For Humans
15.4k views • 6 months ago
01:19
AI For Humans
1.5k views • 6 months ago
11:44
AI Jason
33.0k views • 6 months ago
34:24
AI Explained
105.2k views • 6 months ago
14:34
AI Explained
60.8k views • 6 months ago
19:04
AI Jason
54.5k views • 7 months ago
14:25
AI Explained
94.2k views • 7 months ago
20:10
AI Explained
60.3k views • 7 months ago
15:30
AI Jason
284.2k views • 7 months ago
23:52
AI Explained
72.7k views • 7 months ago
21:22
AI Explained
110.0k views • 7 months ago
13:19
AI Jason
165.9k views • 7 months ago
64:53
AI For Humans
7.7k views • 8 months ago
09:14
AI Jason
223.4k views • 8 months ago
10:09
AI Jason
16.5k views • 8 months ago
01:22
AI For Humans
1.1k views • 8 months ago
02:20
AI For Humans
1.2k views • 8 months ago
13:07
AI Jason
86.0k views • 8 months ago
131:12
Andrej Karpathy
2.1M views • 8 months ago
13:17
AI Jason
223.1k views • 9 months ago
55:52
AI For Humans
7.0k views • 9 months ago
04:01
AI For Humans
6.4k views • 9 months ago
211:24
Andrej Karpathy
4.0M views • 9 months ago
20:35
AI Jason
16.2k views • 9 months ago
08:40
AI Jason
18.5k views • 9 months ago
01:08
AI For Humans
2.3k views • 9 months ago
16:12
AI Jason
52.0k views • 10 months ago
52:17
AI For Humans
9.7k views • 10 months ago
81:55
Andrej Karpathy
34.7k views • 11 months ago
51:56
AI For Humans
8.0k views • 11 months ago
52:16
AI For Humans
6.5k views • 11 months ago
46:52
AI For Humans
7.6k views • 1 year ago
07:16
AI For Humans
4.2k views • 1 year ago
01:00
AI For Humans
2.4k views • 1 year ago
00:52
AI For Humans
3.1k views • 1 year ago
00:50
AI For Humans
9.6k views • 1 year ago
241:26
Andrej Karpathy
943.4k views • 1 year ago
30:38
Morningside AI
13.8k views • 1 year ago
133:35
Andrej Karpathy
962.9k views • 1 year ago
26:56
Morningside AI
4.2k views • 1 year ago
45:54
Morningside AI
10.1k views • 1 year ago
59:48
Andrej Karpathy
3.2M views • 2 years ago
39:00
Morningside AI
26.7k views • 2 years ago
116:20
Andrej Karpathy
6.6M views • 2 years ago
56:22
Andrej Karpathy
250.4k views • 3 years ago
115:24
Andrej Karpathy
307.4k views • 3 years ago
115:58
Andrej Karpathy
448.2k views • 3 years ago
75:40
Andrej Karpathy
481.7k views • 3 years ago
81 Comments
Setup helicone to monitor your LLM app cost now: https://www.helicone.ai/?utm_source=ai-jason
Join AI builder club to access     See More
More context = less attention, more latency, less repeatability in answer generation, more tokens, less concurrency, more cost
Its true but CAG has a limit in TKM, tokens per minute, and tokens per request rate limit, with this limits you need play with LLMs models for implement CAG systems, so if you have a enourmo     See More
I don't think this is CAG, this is.. just a BFP (big f!cking prompt).. I know there was a research paper about CAG and Im pretty sure it requires manipulating the internals of the model.
Helicone is honestly awful, would recommend using LangFuse instead. We used to be paying users of helicone, but their software is so slow and sluggish
strongly agree with this approach that I am experimenting as well : with RAG frameworks even with multi hop query etc. retrieval was really complicated. With CAG it destroy every query I do.     See More
Thank you very much for the information.. this is definitely better than RAG
How could we implement this in N8N?
I don't feel like you described actual CAG.....you left out the mechanism of 'C' (cache). CAG actually caches the KV computed values of your static knowledge base in the first l     See More
Setup helicone to monitor your LLM app cost now: https://www.helicone.ai/?utm_source=ai-jason
Join AI builder club to access     See More ple & Doc MCP: http://aibuilderclub.com/    See Less
More context = less attention, more latency, less repeatability in answer generation, more tokens, less concurrency, more cost     See Less
Its true but CAG has a limit in TKM, tokens per minute, and tokens per request rate limit, with this limits you need play with LLMs models for implement CAG systems, so if you have a enourmo     See More of context CAG is not the way    See Less
This is just a worse version of RAG, why are we going backwards?     See Less
I don't think this is CAG, this is.. just a BFP (big f!cking prompt).. I know there was a research paper about CAG and Im pretty sure it requires manipulating the internals of the model.     See Less
Helicone is honestly awful, would recommend using LangFuse instead. We used to be paying users of helicone, but their software is so slow and sluggish     See Less
strongly agree with this approach that I am experimenting as well : with RAG frameworks even with multi hop query etc. retrieval was really complicated. With CAG it destroy every query I do.     See More ven more clever I think for large dataset is like a simplified GraphRAG by labelling the docs with tags, put every document that have relevant tags into the cache, and still perform a RAG query on every documents for precise and local request (for example, query like « who is xxx » where RAG works well on proper names) to know in which document the info is and load in the cache all the document with the same tags that the one found with the RAG and boom, knowledge issue is done    See Less
Thank you very much for the information.. this is definitely better than RAG
How could we implement this in N8N?     See Less
Preloading whole database into context? 🤣     See Less
I don't feel like you described actual CAG.....you left out the mechanism of 'C' (cache). CAG actually caches the KV computed values of your static knowledge base in the first l     See More he model. Then, any incoming prompts are added as tokens AFTER that precomputed data. The model has to do far fewer computations to begin outputting the first tokens. This maximizes speed to first token, which is a huge part of building production chat/agents. However, this approach will require some model interface/API changes and is currently only supported in Gemini's latest offering as far as I know. There is another video you can search on "rag vs cag solving knowledge gaps" by ibm which goes into the caching mechanism.    See Less