chatgpt 4V 识图功能

1.获取图片的sig和file_id

2e0edc6e489ed13a3f32f0dd87527d77.jpg是本地图片的名字

头部认证信息自己F12 抓取

复制代码
1.获取图片的sig

https://chat.openai.com/backend-api/files

Authorization:Bearer eyJhbGc****************5V-lztYwLb9hr6LP7g
Cookie:  *************************D


{"file_name":"2e0edc6e489ed13a3f32f0dd87527d77.jpg","file_size":51010,"use_case":"multimodal"}

返回{
    "status": "success",
    "upload_url": "https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A24%3A03Z&sp=c&sv=2021-08-06&sr=b&sig=oXzcBB7Q8HWyZr6JUSbuUYtgwgOWVdia7EiO8ALBe%2Bw%3D",
    "file_id": "file-KVRbTP1Xy0NP1WKKZRDKSPL1"
}

2.上传二进制图片

复制代码
https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A24%3A03Z&sp=c&sv=2021-08-06&sr=b&sig=oXzcBB7Q8HWyZr6JUSbuUYtgwgOWVdia7EiO8ALBe%2Bw%3D


Accept: application/json, text/plain, */*
Accept-Encoding: gzip, deflate, br
Accept-Language: zh-CN,zh;q=0.9,en;q=0.8,en-GB;q=0.7,en-US;q=0.6
Connection: keep-alive

Content-Type: image/jpeg
Host: fileserviceuploadsperm.blob.core.windows.net
Origin: https://chat.openai.com
Sec-Fetch-Dest: empty
Sec-Fetch-Mode: cors
Sec-Fetch-Site: cross-site
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/117.0.0.0 Safari/537.36 Edg/117.0.2045.60
sec-ch-ua: "Microsoft Edge";v="117", "Not;A=Brand";v="8", "Chromium";v="117"
sec-ch-ua-mobile: ?0
sec-ch-ua-platform: "Windows"
x-ms-blob-type: BlockBlob
x-ms-version: 2020-04-08
无返回

3.put图片

复制代码
https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A24%3A03Z&sp=c&sv=2021-08-06&sr=b&sig=oXzcBB7Q8HWyZr6JUSbuUYtgwgOWVdia7EiO8ALBe%2Bw%3D
Accept: application/json, text/plain, */*
Accept-Encoding: gzip, deflate, br
Accept-Language: zh-CN,zh;q=0.9,en;q=0.8,en-GB;q=0.7,en-US;q=0.6
Connection: keep-alive
Content-Length: 51010
Content-Type: image/jpeg
Host: fileserviceuploadsperm.blob.core.windows.net
Origin: https://chat.openai.com
Sec-Fetch-Dest: empty
Sec-Fetch-Mode: cors
Sec-Fetch-Site: cross-site
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/117.0.0.0 Safari/537.36 Edg/117.0.2045.60
sec-ch-ua: "Microsoft Edge";v="117", "Not;A=Brand";v="8", "Chromium";v="117"
sec-ch-ua-mobile: ?0
sec-ch-ua-platform: "Windows"
x-ms-blob-type: BlockBlob
x-ms-version: 2020-04-08

form-data
本地图片地址

4.上传二进制图片

复制代码
4.上传二进制图片
https://chat.openai.com/backend-api/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1/uploaded

Authorization:Bearer eyJ*****tYwLb9hr6LP7g
Cookie:  *HS%2BvXw%3D

提交空
{}

返回
{
    "status": "success",
    "download_url": "https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A34%3A11Z&sp=r&sv=2021-08-06&sr=b&rscd=attachment%3B%20filename%3D2e0edc6e489ed13a3f32f0dd87527d77.jpg&sig=LJbEvMeV6KtlkTi6Y1udDd%2BK3YSbgfUWnsGCEi5rGSs%3D",
    "metadata": null
}

5.和chatgpt问题绑定,提问

复制代码
https://chat.openai.com/backend-api/conversation
Authorization:Bearer eyJhbGciOiJSU******%3D


{
	"action": "next",
	"messages": [{
		"id": "aaa27270-f62e-454a-962c-f62794152450",
		"author": {
			"role": "user"
		},
		"content": {
			"content_type": "multimodal_text",
			"parts": [{
				"asset_pointer": "file-service://file-KVRbTP1Xy0NP1WKKZRDKSPL1",
				"size_bytes": 239505,
				"width": 1706,
				"height": 1280
			}, "再看看"]
		},
		"metadata": {}
	}],
	"conversation_id": "41181b17-b71d-4748-8db5-3a80d7a27a33",
	"parent_message_id": "49bd950f-372f-45f6-9f72-b6f27fa04373",
	"model": "gpt-4",
	"timezone_offset_min": -480,
	"suggestions": [],
	"history_and_training_disabled": false,

	"force_paragen": false
}

返回答案
相关推荐
古希腊掌管学习的神38 分钟前
[LangGraph教程]LangGraph04——支持人机协作的聊天机器人
人工智能·语言模型·chatgpt·机器人·agent
鸿蒙布道师1 小时前
OpenAI为何觊觎Chrome?AI时代浏览器争夺战背后的深层逻辑
前端·人工智能·chrome·深度学习·opencv·自然语言处理·chatgpt
AIGC大时代3 小时前
高质量学术引言如何妙用ChatGPT?如何写提示词
人工智能·深度学习·chatgpt·学术写作·chatgpt-o3·deep reaserch
盈达科技2 天前
[盈达科技】GEO(生成式引擎优化)实战指南:从认知重构、技术落地到内容突围的三维战略
人工智能·chatgpt
Feel_狗焕3 天前
transformer架构详解由浅入深-大模型入坑笔记真的很详细
chatgpt·llm
赵钰老师3 天前
【大语言模型DeepSeek+ChatGPT+python】最新AI-Python机器学习与深度学习技术在植被参数反演中的核心技术应用
人工智能·arcgis·语言模型·chatgpt·数据分析
Awesome Baron3 天前
《Learning Langchain》阅读笔记2-基于 Gemini 的 Langchain PromptTemplate 实现方式
jupyter·chatgpt·langchain·llm
背太阳的牧羊人3 天前
用 MongoIndexStore 实现对话存档和恢复 & 实现“多用户、多对话线程”场景(像一个 ChatGPT 对话列表那样)
mongodb·chatgpt·llamaindex·对话存档·持久化存储聊天
Panesle3 天前
用一个大型语言模型(LLM)实现视觉与语言的融合: Liquid_V1_7B
人工智能·语言模型·自然语言处理·多模态
安替-AnTi3 天前
Google Colab测试部署Qwen大模型,实现PDF转MD场景OCR 识别(支持单机环境)
pdf·ocr·多模态·qwen 2.5·图片转文本