Advanced Technologies: Beyond Prompting - Fine-tuning

Whereas RAG gives you one way to give additional information to a large language model;

Fine-tuning is another way to give it more information.

  • In particular, if you have a bigger context that can fit into the input length, or the input context window length of the LM.
  • Get an LM to absorb this information
  • Getting the LM ot output text in a certain given style

Pretraining and Fine-tuning

  • Pre-training: Learns from a lot of data
  • Fine-tuning: cause the LM to do a little bit more learning to change its outputs to be
    • a set of sentences or a set of text
    • Create an additional data set
    • Fine-tuning to this relatively modest-sized data set

Why fine-tune?

  • To carry out a task that isn't easy to define in a prompt.
    • A summary of this specific style
    • Mimicking a writing or speaking style
  • To help LLM gain specific knowledge
    • gain a domain of knowledge: medical、legal、financial
  • To get a smaller model to perform a task.
    • larger model: 100B +parameters
    • small model: 1B parameters, Lower cost/latency to deploy
    • can run on mobile/laptop(edge devices)

  • Use it for tasks that are hard to specify in a prompt.
  • gain a body of knowledge
  • Get a smaller and cheaper-to-run LM

RAG is just a modification of your prompt

fine tuning

pre-training

相关推荐
盼小辉丶1 小时前
PyTorch强化学习实战(10)——强化学习高级组件
人工智能·pytorch·python·强化学习
mydeman1 小时前
智能体工程化演进:架构收敛、协议标准化与安全边界下沉
人工智能·架构·软件工程·ai编程
星辰AI1 小时前
长文本处理技术综述:突破上下文限制
人工智能·ai·语言模型
xwz小王子1 小时前
Nature 正刊:可穿戴膝关节机器人,重量仅为0.96 kg!让脊髓性肌萎缩症患儿重获站立能力
人工智能·机器人
白露与泡影1 小时前
自己用 ai 写了个链接 mysql 数据库的 mcp 工具
数据库·人工智能·mysql
掘根1 小时前
【openCV】键盘响应,像素逻辑操作,通道分离合并,抠像
人工智能·opencv·计算机视觉
一条泥憨鱼1 小时前
让AI从“死记硬背”到“开卷考试”:详解RAG技术的奥秘
人工智能·ai·语言模型·机器人·rag
霍格沃兹测试学院-小舟畅学1 小时前
高质量测试 Skill 编写手册 -- 渐进式披露
人工智能
MediaTea1 小时前
DL:生成对抗网络的基本原理与 PyTorch 实现
人工智能·pytorch·深度学习·神经网络·生成对抗网络