Advanced Technologies: Beyond Prompting -- Retrieval Augmented Generation(RAG)

RAG: giving it additional knowledge beyond what it may have learned from data on the Internet or other open sources

Retrieval Augmented Generation (RAG) example

  • General Chatbot
  • Chatbot with RAG
    • Give the LLM additional information
  1. Given a question, search relevant documents for answer (Company documents, most relevant to this question)
  2. Incorporate retrieved text into an updated prompt (take the relevant text from documentation and put that into my prompt;)
    1. limitations to the prompt length or the input length for a large language model
    2. Pull out just the part of the document that's most relevant to the question
  3. Generate answer from the new prompt with additional context
    1. generate an answer to this, but we're going to augment how we generate text by retrieving the relevant context or the relevant information and augmenting the prompt with that additional text
    2. also add a link to the original source document that led to this answer being generated
    3. can go back and read the original source document and double-check the answer for themseleves

Example of RAG applications

  • Chat with PDF files
    • upload pdf and then ask questions
    • use RAG to generate answers for you
  • Answer questions based on a website's articles
  • New form of web search

Big Idea: LLM as a reasoning engine: which is to think of the LLM not as a knowledge store, but instead as a reasoning engine

  • LLMs have a lot of general knowledge, but they don't know everything
  • By providing relevant context in the prompt, we ask an LLM to read a piece of text, then process it to get an answer
  • We're using it as a reasoning engine to process information, rather than using it as a source of information
相关推荐
qq_411262423 分钟前
四博智联AI开发宝典(2/3):后端部署、OTA与AT+MCP接入
人工智能·ai·四博
QiLinkOS4 分钟前
极客精神与商业思维的融合实践(2)
c语言·c++·人工智能·算法·开源协议
逻辑君8 分钟前
认知神经科学研究报告【20260071】
人工智能·深度学习·机器学习·数学建模
Eloudy10 分钟前
伊辛解码(Ising Decoding)
人工智能·量子计算
财经资讯数据_灵砚智能10 分钟前
基于全球经济类多源新闻的NLP情感分析与数据可视化(日间)2026年6月12日
人工智能·python·ai·信息可视化·自然语言处理·ai编程·灵砚智能
deephub16 分钟前
相关性与因果性:识别伪相关以提升模型在真实环境的可用性
人工智能·机器学习·数据挖掘·数据分析
2601_9555052517 分钟前
行业研究|AI-Ready高质量数据集建设难点与元数据标准化解决方案(基于国家数据局25号文)
人工智能·金融·能源·健康医疗·制造·政务
虾壳云官方18 分钟前
【本地 AI 自动化最新工具】 OpenClaw 2.7.9 Windows 完整部署教程(包含安装包)
人工智能·windows·openclaw·openclaw安装·openclaw一键部署
ai产品老杨20 分钟前
解耦异构安防:基于 Docker 与边缘计算的 AI 视频管理平台,如何实现 GB28181/RTSP 统一接入与全源码交付
人工智能·docker·边缘计算
趋之22 分钟前
千问大模型核心能力与实战效果全景展示
人工智能