今天扯一扯在github上看到的一个项目
gpt-pilot
,声称"首个AI程序员"。本来打算玩一下,结果需要配置大语言模型的API,并且只支持OpenAI和claude(Qwen呢)。有没有玩过的老哥说一下好不好用!!(对了,为啥博客现在都没有人留言交流了。。。)
多角色模拟
通过它的说明文档中对其工作流程的描述,可以看到它在创建应用程序的过程中模拟了真实项目中所需的各种角色:
Here are the steps GPT Pilot takes to create an app:You enter the app name and the description.
Product Owner agent like in real life, does nothing.
Specification Writer agent asks a couple of questions to understand the requirements better if project >description is not good enough.
Architect agent writes up technologies that will be used for the app and checks if all technologies are >installed on the machine and installs them if not.
Tech Lead agent writes up development tasks that the Developer must implement.
Developer agent takes each task and writes up what needs to be done to implement it. The description is >in human-readable form.
Code Monkey agent takes the Developer's description and the existing file and implements the changes.
Reviewer agent reviews every step of the task and if something is done wrong Reviewer sends it back to >Code Monkey.
Troubleshooter agent helps you to give good feedback to GPT Pilot when something is wrong.
Debugger agent hate to see him, but he is your best friend when things go south.
Technical Writer agent writes documentation for the project.
根据它的介绍视频描述,它不是生成代码,而是生成应用,这就很牛X了。按照目前对各种大语言模型的使用体验来看,很重要的一点是需要交互,通过不断的交互来筛选出答案。需要多次交互的原因一方面是问题描述可能不准确,另一方面是AI的回答并不总是对的。我猜想gpt-pilot应该也不是那种一键生成,需要通过不断的交互对产出内容进行校准,过程应该类似敏捷开发。
提示词模版
看了下项目代码,对各种角色提供了提示词模版。我知道提示词可以提高返回结果的准确率,但具体的执行效果如何还需体验。