Setup helicone to monitor your LLM app cost now: https://www.helicone.ai/?utm_source=ai-jason
Join AI builder club to access     See More
AI Jason
211K subscribersCAG intro + Build a MCP server that read API docs Setup helicone to monitor your LLM app cost now: ...
Setup helicone to monitor your LLM app cost now:
84 Comments
27:23
Wes Roth
23.3k views • 23 hours ago
08:52
TheAIGRID
3.1k views • 1 day ago
19:30
Wes Roth
27.7k views • 1 day ago
30:11
TheAIGRID
21.1k views • 2 days ago
111:10
Wes Roth
11.7k views • 2 days ago
13:07
TheAIGRID
37.5k views • 4 days ago
89:57
Wes Roth
18.0k views • 4 days ago
15:52
TheAIGRID
36.1k views • 6 days ago
21:05
Wes Roth
46.3k views • 6 days ago
14:24
TheAIGRID
3.9k views • 1 week ago
Shelf will be hidden for 30 daysUndo
11:31
TheAIGRID
19.9k views • 1 week ago
10:45
Wes Roth
31.6k views • 1 week ago
27:21
TheAIGRID
38.5k views • 1 week ago
18:55
Wes Roth
54.2k views • 1 week ago
10:25
TheAIGRID
416.1k views • 1 week ago
88:45
Wes Roth
13.4k views • 1 week ago
18:54
AI Jason
63.4k views • 2 weeks ago
33:27
AI Explained
112.1k views • 2 weeks ago
09:34
TheAIGRID
18.2k views • 2 weeks ago
62:07
Wes Roth
11.7k views • 2 weeks ago
20:00
AI Explained
86.7k views • 3 weeks ago
45:20
AI For Humans
8.5k views • 3 weeks ago
38:57
Wes Roth
31.1k views • 3 weeks ago
25:37
TheAIGRID
41.8k views • 3 weeks ago
23:58
Wes Roth
64.4k views • 3 weeks ago
35:11
TheAIGRID
22.8k views • 3 weeks ago
10:39
TheAIGRID
55.4k views • 3 weeks ago
27:21
Wes Roth
82.6k views • 3 weeks ago
13:03
TheAIGRID
15.0k views • 3 weeks ago
17:42
AI Explained
87.3k views • 4 weeks ago
55:07
AI For Humans
10.6k views • 4 weeks ago
11:29
AI Jason
73.9k views • 4 weeks ago
34:18
Wes Roth
68.4k views • 4 weeks ago
19:32
Wes Roth
57.4k views • 4 weeks ago
41:56
TheAIGRID
26.2k views • 1 month ago
17:31
Wes Roth
53.2k views • 1 month ago
42:48
TheAIGRID
23.8k views • 1 month ago
33:44
TheAIGRID
9.3k views • 1 month ago
118:46
Wes Roth
20.7k views • 1 month ago
10:12
TheAIGRID
20.1k views • 1 month ago
20:16
AI Explained
73.4k views • 1 month ago
49:17
AI For Humans
9.8k views • 1 month ago
11:29
TheAIGRID
6.1k views • 1 month ago
08:52
Wes Roth
74.2k views • 1 month ago
27:54
Wes Roth
20.7k views • 1 month ago
13:46
TheAIGRID
24.4k views • 1 month ago
62:38
Wes Roth
29.0k views • 1 month ago
10:43
TheAIGRID
7.0k views • 1 month ago
81:11
Wes Roth
109.0k views • 1 month ago
18:47
Wes Roth
212.0k views • 1 month ago
00:26
AI For Humans
3.0k views • 1 month ago
20:07
Wes Roth
45.6k views • 1 month ago
02:17
TheAIGRID
5.5k views • 1 month ago
12:33
AI Jason
30.0k views • 1 month ago
00:42
AI For Humans
4.8k views • 1 month ago
44:40
AI For Humans
15.3k views • 1 month ago
19:38
Wes Roth
34.9k views • 1 month ago
14:56
AI Explained
59.6k views • 1 month ago
01:41
AI For Humans
5.6k views • 1 month ago
12:08
Wes Roth
49.0k views • 1 month ago
21:43
AI Explained
117.4k views • 1 month ago
23:40
TheAIGRID
71.3k views • 1 month ago
14:08
Wes Roth
54.9k views • 1 month ago
13:33
TheAIGRID
8.8k views • 1 month ago
14:40
TheAIGRID
27.0k views • 1 month ago
15:37
TheAIGRID
111.2k views • 1 month ago
15:07
TheAIGRID
17.0k views • 1 month ago
18:27
AI Explained
61.5k views • 1 month ago
45:12
AI For Humans
11.8k views • 1 month ago
19:45
TheAIGRID
124.0k views • 1 month ago
13:13
TheAIGRID
48.4k views • 1 month ago
12:54
AI Explained
60.3k views • 1 month ago
55:23
AI For Humans
11.6k views • 2 months ago
62:27
AI For Humans
9.4k views • 2 months ago
08:33
AI Jason
38.0k views • 2 months ago
01:25
AI For Humans
4.7k views • 2 months ago
01:25
AI For Humans
2.0k views • 2 months ago
01:19
AI For Humans
3.2k views • 2 months ago
52:48
AI For Humans
13.3k views • 2 months ago
14:14
AI Explained
58.1k views • 2 months ago
05:14
AI Jason
32.5k views • 2 months ago
48:09
AI For Humans
11.4k views • 2 months ago
53:01
AI For Humans
14.5k views • 2 months ago
11:47
AI Jason
58.4k views • 3 months ago
57:36
AI For Humans
14.4k views • 3 months ago
01:43
AI For Humans
2.7k views • 3 months ago
15:44
AI Explained
58.5k views • 3 months ago
02:06
AI For Humans
8.1k views • 3 months ago
14:07
AI Explained
67.2k views • 3 months ago
38:23
AI For Humans
11.1k views • 3 months ago
49:55
AI For Humans
10.9k views • 3 months ago
02:12
AI For Humans
2.8k views • 3 months ago
11:32
AI Explained
20.2k views • 3 months ago
11:32
AI Explained
48.6k views • 3 months ago
50:33
AI For Humans
13.3k views • 3 months ago
06:41
AI Jason
17.0k views • 4 months ago
44:47
AI For Humans
9.4k views • 4 months ago
52:12
AI For Humans
19.1k views • 4 months ago
18:55
AI Explained
57.7k views • 4 months ago
53:56
AI For Humans
16.4k views • 4 months ago
44:52
AI For Humans
11.4k views • 4 months ago
16:02
AI Jason
106.1k views • 4 months ago
53:25
AI For Humans
18.3k views • 5 months ago
15:02
AI Explained
163.3k views • 5 months ago
11:55
AI Explained
196.3k views • 5 months ago
40:18
AI For Humans
35.8k views • 5 months ago
64:05
AI For Humans
20.0k views • 5 months ago
18:44
AI Jason
136.6k views • 5 months ago
17:20
AI Explained
84.6k views • 5 months ago
51:06
AI For Humans
18.8k views • 5 months ago
07:02
AI Jason
81.1k views • 5 months ago
02:12
AI For Humans
12.9k views • 5 months ago
11:44
AI Explained
178.4k views • 5 months ago
09:29
AI Jason
52.5k views • 6 months ago
55:27
AI For Humans
22.3k views • 6 months ago
16:39
AI Jason
184.9k views • 6 months ago
26:20
AI Explained
110.0k views • 6 months ago
51:33
AI For Humans
19.7k views • 6 months ago
05:56
AI Jason
22.9k views • 6 months ago
01:22
AI For Humans
4.0k views • 6 months ago
14:01
AI Explained
101.6k views • 6 months ago
45:51
AI For Humans
19.1k views • 6 months ago
22:02
AI Jason
17.9k views • 6 months ago
16:50
AI Explained
96.4k views • 7 months ago
02:47
AI For Humans
15.8k views • 7 months ago
00:49
AI For Humans
3.2k views • 7 months ago
56:57
AI For Humans
31.6k views • 7 months ago
13:09
AI For Humans
9.5k views • 7 months ago
03:35
AI Jason
52.2k views • 7 months ago
19:05
AI Explained
98.9k views • 7 months ago
56:15
AI For Humans
20.4k views • 7 months ago
04:25
AI Jason
24.5k views • 7 months ago
17:08
AI Explained
99.6k views • 7 months ago
02:10
AI For Humans
31.0k views • 7 months ago
14:02
AI Jason
10.2k views • 7 months ago
01:59
AI For Humans
2.0k views • 7 months ago
17:42
AI Explained
83.1k views • 7 months ago
48:59
AI For Humans
15.4k views • 7 months ago
01:19
AI For Humans
1.5k views • 8 months ago
11:44
AI Jason
34.1k views • 8 months ago
34:24
AI Explained
105.7k views • 8 months ago
14:34
AI Explained
60.8k views • 8 months ago
19:04
AI Jason
54.9k views • 8 months ago
15:30
AI Jason
286.3k views • 9 months ago
13:19
AI Jason
172.4k views • 9 months ago
64:53
AI For Humans
7.7k views • 9 months ago
09:14
AI Jason
223.7k views • 9 months ago
10:09
AI Jason
16.5k views • 9 months ago
01:22
AI For Humans
1.1k views • 9 months ago
02:20
AI For Humans
1.2k views • 9 months ago
13:07
AI Jason
86.4k views • 9 months ago
131:12
Andrej Karpathy
2.2M views • 10 months ago
13:17
AI Jason
226.1k views • 10 months ago
55:52
AI For Humans
7.0k views • 10 months ago
04:01
AI For Humans
6.4k views • 10 months ago
211:24
Andrej Karpathy
4.4M views • 11 months ago
01:08
AI For Humans
2.3k views • 11 months ago
52:17
AI For Humans
9.7k views • 11 months ago
81:55
Andrej Karpathy
34.7k views • 1 year ago
51:56
AI For Humans
8.0k views • 1 year ago
52:16
AI For Humans
6.5k views • 1 year ago
46:52
AI For Humans
7.6k views • 1 year ago
07:16
AI For Humans
4.2k views • 1 year ago
01:00
AI For Humans
2.4k views • 1 year ago
00:52
AI For Humans
3.1k views • 1 year ago
00:50
AI For Humans
9.6k views • 1 year ago
241:26
Andrej Karpathy
963.2k views • 1 year ago
30:38
Morningside AI
13.8k views • 1 year ago
133:35
Andrej Karpathy
989.2k views • 1 year ago
26:56
Morningside AI
4.2k views • 2 years ago
45:54
Morningside AI
10.1k views • 2 years ago
59:48
Andrej Karpathy
3.3M views • 2 years ago
39:00
Morningside AI
26.7k views • 2 years ago
116:20
Andrej Karpathy
6.7M views • 2 years ago
56:22
Andrej Karpathy
254.4k views • 3 years ago
115:24
Andrej Karpathy
313.6k views • 3 years ago
115:58
Andrej Karpathy
457.4k views • 3 years ago
75:40
Andrej Karpathy
490.9k views • 3 years ago
84 Comments
Setup helicone to monitor your LLM app cost now: https://www.helicone.ai/?utm_source=ai-jason
Join AI builder club to access     See More
More context = less attention, more latency, less repeatability in answer generation, more tokens, less concurrency, more cost
Its true but CAG has a limit in TKM, tokens per minute, and tokens per request rate limit, with this limits you need play with LLMs models for implement CAG systems, so if you have a enourmo     See More
I don't think this is CAG, this is.. just a BFP (big f!cking prompt).. I know there was a research paper about CAG and Im pretty sure it requires manipulating the internals of the model.
Helicone is honestly awful, would recommend using LangFuse instead. We used to be paying users of helicone, but their software is so slow and sluggish
strongly agree with this approach that I am experimenting as well : with RAG frameworks even with multi hop query etc. retrieval was really complicated. With CAG it destroy every query I do.     See More
Thank you very much for the information.. this is definitely better than RAG
How could we implement this in N8N?
I don't feel like you described actual CAG.....you left out the mechanism of 'C' (cache). CAG actually caches the KV computed values of your static knowledge base in the first l     See More
Setup helicone to monitor your LLM app cost now: https://www.helicone.ai/?utm_source=ai-jason
Join AI builder club to access     See More ple & Doc MCP: http://aibuilderclub.com/    See Less
More context = less attention, more latency, less repeatability in answer generation, more tokens, less concurrency, more cost     See Less
Its true but CAG has a limit in TKM, tokens per minute, and tokens per request rate limit, with this limits you need play with LLMs models for implement CAG systems, so if you have a enourmo     See More of context CAG is not the way    See Less
This is just a worse version of RAG, why are we going backwards?     See Less
I don't think this is CAG, this is.. just a BFP (big f!cking prompt).. I know there was a research paper about CAG and Im pretty sure it requires manipulating the internals of the model.     See Less
Helicone is honestly awful, would recommend using LangFuse instead. We used to be paying users of helicone, but their software is so slow and sluggish     See Less
strongly agree with this approach that I am experimenting as well : with RAG frameworks even with multi hop query etc. retrieval was really complicated. With CAG it destroy every query I do.     See More ven more clever I think for large dataset is like a simplified GraphRAG by labelling the docs with tags, put every document that have relevant tags into the cache, and still perform a RAG query on every documents for precise and local request (for example, query like « who is xxx » where RAG works well on proper names) to know in which document the info is and load in the cache all the document with the same tags that the one found with the RAG and boom, knowledge issue is done    See Less
Thank you very much for the information.. this is definitely better than RAG
How could we implement this in N8N?     See Less
Preloading whole database into context? 🤣     See Less
I don't feel like you described actual CAG.....you left out the mechanism of 'C' (cache). CAG actually caches the KV computed values of your static knowledge base in the first l     See More he model. Then, any incoming prompts are added as tokens AFTER that precomputed data. The model has to do far fewer computations to begin outputting the first tokens. This maximizes speed to first token, which is a huge part of building production chat/agents. However, this approach will require some model interface/API changes and is currently only supported in Gemini's latest offering as far as I know. There is another video you can search on "rag vs cag solving knowledge gaps" by ibm which goes into the caching mechanism.    See Less