漫游 RAG – 让模型找到答案 – 搞英语 → 看世界

John Berryman 的全新 RAG 技术（名字很活泼）：

Roaming RAG 的主要想法是制作一个简单的 LLM 应用程序，以便 LLM 助理能够阅读文档的层次结构大纲，然后翻阅文档（通过打开各个部分），直到找到并回答手头的问题。由于漫游 RAG 直接导航文档文本，因此无需设置检索基础设施，而且移动部件更少意味着您可能搞砸的事情更少！

John 提供了一个示例，该示例将 Markdown 文档折叠为仅标题，每个标题都带有一条指令注释 。

然后提供expand_section()工具以及以下工具描述：

Expand a section of the markdown document to reveal its contents.

- Expand the most specific (lowest-level) relevant section first
- Multiple sections can be expanded in parallel
- You can expand any section regardless of parent section state (eg parent sections do not need to be expanded to view subsection content)

我过去探索过矢量搜索和全文搜索 RAG，但这是我见过的第一个听起来令人信服的技术，它完全跳过搜索，而是倾向于允许模型通过标题直接导航大型文档。

通过@jnbrymn.bsky.social

标签：提示工程,生成人工智能,人工智能, rag , llms

原文： https://simonwillison.net/2024/Dec/6/roaming-rag/#atom-everything