chatgpt 4V 识图功能

1.获取图片的sig和file_id

2e0edc6e489ed13a3f32f0dd87527d77.jpg是本地图片的名字

头部认证信息自己F12 抓取

复制代码
1.获取图片的sig

https://chat.openai.com/backend-api/files

Authorization:Bearer eyJhbGc****************5V-lztYwLb9hr6LP7g
Cookie:  *************************D


{"file_name":"2e0edc6e489ed13a3f32f0dd87527d77.jpg","file_size":51010,"use_case":"multimodal"}

返回{
    "status": "success",
    "upload_url": "https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A24%3A03Z&sp=c&sv=2021-08-06&sr=b&sig=oXzcBB7Q8HWyZr6JUSbuUYtgwgOWVdia7EiO8ALBe%2Bw%3D",
    "file_id": "file-KVRbTP1Xy0NP1WKKZRDKSPL1"
}

2.上传二进制图片

复制代码
https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A24%3A03Z&sp=c&sv=2021-08-06&sr=b&sig=oXzcBB7Q8HWyZr6JUSbuUYtgwgOWVdia7EiO8ALBe%2Bw%3D


Accept: application/json, text/plain, */*
Accept-Encoding: gzip, deflate, br
Accept-Language: zh-CN,zh;q=0.9,en;q=0.8,en-GB;q=0.7,en-US;q=0.6
Connection: keep-alive

Content-Type: image/jpeg
Host: fileserviceuploadsperm.blob.core.windows.net
Origin: https://chat.openai.com
Sec-Fetch-Dest: empty
Sec-Fetch-Mode: cors
Sec-Fetch-Site: cross-site
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/117.0.0.0 Safari/537.36 Edg/117.0.2045.60
sec-ch-ua: "Microsoft Edge";v="117", "Not;A=Brand";v="8", "Chromium";v="117"
sec-ch-ua-mobile: ?0
sec-ch-ua-platform: "Windows"
x-ms-blob-type: BlockBlob
x-ms-version: 2020-04-08
无返回

3.put图片

复制代码
https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A24%3A03Z&sp=c&sv=2021-08-06&sr=b&sig=oXzcBB7Q8HWyZr6JUSbuUYtgwgOWVdia7EiO8ALBe%2Bw%3D
Accept: application/json, text/plain, */*
Accept-Encoding: gzip, deflate, br
Accept-Language: zh-CN,zh;q=0.9,en;q=0.8,en-GB;q=0.7,en-US;q=0.6
Connection: keep-alive
Content-Length: 51010
Content-Type: image/jpeg
Host: fileserviceuploadsperm.blob.core.windows.net
Origin: https://chat.openai.com
Sec-Fetch-Dest: empty
Sec-Fetch-Mode: cors
Sec-Fetch-Site: cross-site
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/117.0.0.0 Safari/537.36 Edg/117.0.2045.60
sec-ch-ua: "Microsoft Edge";v="117", "Not;A=Brand";v="8", "Chromium";v="117"
sec-ch-ua-mobile: ?0
sec-ch-ua-platform: "Windows"
x-ms-blob-type: BlockBlob
x-ms-version: 2020-04-08

form-data
本地图片地址

4.上传二进制图片

复制代码
4.上传二进制图片
https://chat.openai.com/backend-api/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1/uploaded

Authorization:Bearer eyJ*****tYwLb9hr6LP7g
Cookie:  *HS%2BvXw%3D

提交空
{}

返回
{
    "status": "success",
    "download_url": "https://fileserviceuploadsperm.blob.core.windows.net/files/file-KVRbTP1Xy0NP1WKKZRDKSPL1?se=2023-10-16T11%3A34%3A11Z&sp=r&sv=2021-08-06&sr=b&rscd=attachment%3B%20filename%3D2e0edc6e489ed13a3f32f0dd87527d77.jpg&sig=LJbEvMeV6KtlkTi6Y1udDd%2BK3YSbgfUWnsGCEi5rGSs%3D",
    "metadata": null
}

5.和chatgpt问题绑定,提问

复制代码
https://chat.openai.com/backend-api/conversation
Authorization:Bearer eyJhbGciOiJSU******%3D


{
	"action": "next",
	"messages": [{
		"id": "aaa27270-f62e-454a-962c-f62794152450",
		"author": {
			"role": "user"
		},
		"content": {
			"content_type": "multimodal_text",
			"parts": [{
				"asset_pointer": "file-service://file-KVRbTP1Xy0NP1WKKZRDKSPL1",
				"size_bytes": 239505,
				"width": 1706,
				"height": 1280
			}, "再看看"]
		},
		"metadata": {}
	}],
	"conversation_id": "41181b17-b71d-4748-8db5-3a80d7a27a33",
	"parent_message_id": "49bd950f-372f-45f6-9f72-b6f27fa04373",
	"model": "gpt-4",
	"timezone_offset_min": -480,
	"suggestions": [],
	"history_and_training_disabled": false,

	"force_paragen": false
}

返回答案
相关推荐
程序员佳佳8 小时前
【万字硬核】从零构建企业级AI中台:基于Vector Engine整合GPT-5.2、Sora2与Veo3的落地实践指南
人工智能·gpt·chatgpt·ai作画·aigc·api·ai编程
sinat_2869451912 小时前
opencode
人工智能·算法·chatgpt
淬炼之火17 小时前
笔记:Cross Modal Fusion-Mamba
图像处理·笔记·计算机视觉·多模态·特征融合
Elaine33619 小时前
【验证码识别算法性能对比实验系统——KNN、SVM、CNN 与多模态大模型的性能博弈与机理分析】
python·opencv·支持向量机·cnn·多模态·数字图像处理
Blossom.11819 小时前
大模型自动化压缩:基于权重共享的超网神经架构搜索实战
运维·人工智能·python·算法·chatgpt·架构·自动化
开发者导航20 小时前
【开发者导航】ChatGPT Atlas 开源平替,一款免费的AI浏览器,让网页自动驾驶!
人工智能·chatgpt
钱彬 (Qian Bin)1 天前
项目实践15—全球证件智能识别系统(切换为Qwen3-VL-8B-Instruct图文多模态大模型)
人工智能·算法·机器学习·多模态·全球证件识别
赫尔·普莱蒂科萨·帕塔2 天前
医疗新纪元的开启
人工智能·chatgpt
喜欢吃豆2 天前
深度解析:FFmpeg 远程流式解复用原理与工程实践
人工智能·架构·ffmpeg·大模型·音视频·多模态
AI架构师易筋2 天前
多模态 LLM 与本地多模态检索 PoC:从原理到工程落地(图片 / 视频关键帧 / LaTeX 公式)
人工智能·llm·多模态·多模态llm