Advanced Technologies: Beyond Prompting -- Retrieval Augmented Generation(RAG)

RAG: giving it additional knowledge beyond what it may have learned from data on the Internet or other open sources

Retrieval Augmented Generation (RAG) example

  • General Chatbot
  • Chatbot with RAG
    • Give the LLM additional information
  1. Given a question, search relevant documents for answer (Company documents, most relevant to this question)
  2. Incorporate retrieved text into an updated prompt (take the relevant text from documentation and put that into my prompt;)
    1. limitations to the prompt length or the input length for a large language model
    2. Pull out just the part of the document that's most relevant to the question
  3. Generate answer from the new prompt with additional context
    1. generate an answer to this, but we're going to augment how we generate text by retrieving the relevant context or the relevant information and augmenting the prompt with that additional text
    2. also add a link to the original source document that led to this answer being generated
    3. can go back and read the original source document and double-check the answer for themseleves

Example of RAG applications

  • Chat with PDF files
    • upload pdf and then ask questions
    • use RAG to generate answers for you
  • Answer questions based on a website's articles
  • New form of web search

Big Idea: LLM as a reasoning engine: which is to think of the LLM not as a knowledge store, but instead as a reasoning engine

  • LLMs have a lot of general knowledge, but they don't know everything
  • By providing relevant context in the prompt, we ask an LLM to read a piece of text, then process it to get an answer
  • We're using it as a reasoning engine to process information, rather than using it as a source of information
相关推荐
Bruce_Liuxiaowei1 小时前
WorkBuddy案例——自动化内容创作平台
人工智能·ai·大模型·智能体·workbuddy
狒狒热知识1 小时前
软文营销底层逻辑重构专业发稿平台成品牌流量核心抓手
人工智能
fan65404141 小时前
GEO服务商技术评估的四维量化框架:以杭州文澜天下科技为例
运维·人工智能·科技
沪漂阿龙1 小时前
面试题详解:大模型思维链 CoT 推理模型训练全解析——如何微调出带思维链的 LLM、优缺点、SFT 与 RLHF 能不能跳过
人工智能
2601_958352901 小时前
A-68双麦波束模组深度解析:90dB降噪、60°夹角、3-5米拾音,一篇讲透
人工智能·语音识别·嵌入式开发·音频降噪·回音消除
事变天下1 小时前
奥哲重构电建新范式!AI赋能电力建设全场景智能化升级
大数据·人工智能
翼龙云_cloud1 小时前
腾讯云代理商:腾讯云 AI 算力成本全解析 从技术原理到价格优化策略
人工智能·云计算·腾讯云
浩瀚之水_csdn1 小时前
机器学习性能评估指数之Dice系数
人工智能·计算机视觉·目标跟踪
闵孚龙1 小时前
AI 应用开发到底在开发什么?
人工智能