Advanced Technologies: Beyond Prompting - Fine-tuning

Whereas RAG gives you one way to give additional information to a large language model;

Fine-tuning is another way to give it more information.

  • In particular, if you have a bigger context that can fit into the input length, or the input context window length of the LM.
  • Get an LM to absorb this information
  • Getting the LM ot output text in a certain given style

Pretraining and Fine-tuning

  • Pre-training: Learns from a lot of data
  • Fine-tuning: cause the LM to do a little bit more learning to change its outputs to be
    • a set of sentences or a set of text
    • Create an additional data set
    • Fine-tuning to this relatively modest-sized data set

Why fine-tune?

  • To carry out a task that isn't easy to define in a prompt.
    • A summary of this specific style
    • Mimicking a writing or speaking style
  • To help LLM gain specific knowledge
    • gain a domain of knowledge: medical、legal、financial
  • To get a smaller model to perform a task.
    • larger model: 100B +parameters
    • small model: 1B parameters, Lower cost/latency to deploy
    • can run on mobile/laptop(edge devices)

  • Use it for tasks that are hard to specify in a prompt.
  • gain a body of knowledge
  • Get a smaller and cheaper-to-run LM

RAG is just a modification of your prompt

fine tuning

pre-training

相关推荐
oyyanghh6 分钟前
AI编码模式实测:两款主流工具交互式开发能力对比
人工智能
代码小库13 分钟前
【2026前端转 AI 全栈指南】第 1 章:前言 · 后端架构 · 章节导览
前端·人工智能·架构
极光代码工作室14 分钟前
基于深度学习的手写数字识别系统
人工智能·python·深度学习·神经网络·机器学习
Tbisnic17 分钟前
AI大模型学习第十三天:让AI学会查资料、记数据、看图和听声
人工智能·ai·大模型开发·rag·coze
blue_dou24 分钟前
灵活拓展能力对决:多款CRM自定义与数据互通实测
大数据·人工智能
女神下凡27 分钟前
这是 Cursor(Composer) 的五种核心交互模式
服务器·人工智能·windows·vscode·microsoft
AI创界者30 分钟前
告别云端限制!Sulphur 2 本地文生视频/图生视频整合包,本地部署,解压即用,保姆级部署与工作流实战
人工智能·python·aigc·音视频
蓝星空200033 分钟前
GPT-Image-2 实战教程:一段提示词生成专业分镜图(含 9 格脚本模板,附一键同款)
人工智能·gpt·image2·imagen
用户3379225456835 分钟前
从字节跳动 DeerFlow 源码看 Agent 平台设计(二):工具系统设计 — 从全量绑定到按需加载
人工智能
IT 行者42 分钟前
GitHub Spec Kit 实战(四):读懂和干预 /speckit.plan——AI 最自由发挥的一步
java·人工智能·github·ai编程·claude