Advanced Technologies: Beyond prompting - Tool use and agents(optional)

How LLMs are starting to really use tools, and then also discuss a cutting-edge topic of agents, which is where we'll let LLMs try to decide for themselves what action they want to take next.

Tool use for food order taking--chatbot

  • LLM can't just say...., because it needs to take some action to actually send xxx to you
  • An LLM might output this response, order xxxx for the user, and then also say the user message is to say, " Okay, it's on its way."
    • An LLM that's been fine-tuned to output text like this will be able to generate an order, which in this case would trigger a software application ( passes a request)
    • What is shown to the user is not the full LLM output, but rather set to the user as the response
    • A better user interface would be to pop up a verification dialogue;
    • And clearly, given that LLMs' outputs are not completely reliable for any safety-critical or mission-critical action, it would be a good idea to let a user confirm that that's the right action before letting the LLM trigger some potentially costly mistake by itself

Tools for reasoning

  • LLMs are not great at precise math
    • It turns out LLMs, having learned to predict the next words or maybe even instruction tunes, are not great at precise math
    • So, rather than having the LLM output the answer directly, if the LLM were to output this, after compounding and so on, you would have a calculator
    • This can be interpreted as a command to call an external calculator program to explicitly compute the right answer, and plug the result back into the text to give the user the correct data figure
    • So by giving LLMs the ability to call tools in their output, we can significantly extend the reasoning or the action-taking capabilities of LLMs
    • Make sure that tools aren't triggered in a way that causes harm or irreversible damage

Agents

  • Use LLM to choose and carry out complex sequences of actions

    • Going beyond tools into a more experimental area, which goes beyond triggering a tool to carry out a single action. But it's exploring whether LLMs can choose and carry out complex sequences of actions
  • Cutting-edge area of AI research

    • It's not yet mature enough to count on for the most important applications
    • An agent uses an LLM as a reasoning engine to figure out what are the steps it needs to carry out to do the task.
    • And this reasoning engine, the LLM, might decide it needs to search for the list of the xxx,
    • Then visit the website for each xxx, and finally, for each xxx, write a summary based on the homepage content.
    1. And then perhaps by making a sequence of calls to this reasoning engine, it may figure out how to search the xxx, it has to trigger a tool to call a web search engine on the query xxx
    2. It may visit the websites of some of the xxx to download their homepages
    3. Call an LLM yet again to summarize the text that it found on the website
相关推荐
朱大喜3 小时前
BI 平台搭建:从数仓到自助分析的实战路径
人工智能
一切皆是因缘际会3 小时前
LLM轻量化联邦微调机理
数据结构·人工智能·数学建模·ai
Lkstar3 小时前
万字长文Query改写与多路召回实战|从HyDE到RRF融合,召回率提升22%的完整方案
数据库·人工智能·llm
星辰AI打工人3 小时前
Agent-Reach 源码级解析:一个 30-200 行的插件系统凭什么治理 14 个平台
人工智能
张彦峰ZYF4 小时前
从嵌入、表征到潜空间:理解大模型向量世界的三种视角
人工智能·大模型·向量空间
咕咕AI学堂4 小时前
Python 异步数据库驱动优化:从连接池到 uvloop 的全链路性能调优
人工智能
老H科研技术4 小时前
第 07 篇:OAuth 2.1 与授权架构 —— AS/RS 分离的正确姿势
人工智能·mcp
闵孚龙4 小时前
PyTorch 系列 之 nn.Module:所有模型的骨架
人工智能·pytorch·python
海天一色y4 小时前
深入理解 Function Calling、MCP 与 Skills:AI Agent 的三层能力架构
人工智能·mcp·skills