唯品会 item_search_img 接口深度分析及 Python 实现

唯品会的 item_search_img 接口（又称 "唯品会拍立淘"）是唯品会开放平台提供的图像搜索接口，支持通过图片（URL 或本地图片）搜索唯品会平台上的同款或相似商品。该接口基于图像识别技术，能够快速匹配视觉特征相似的商品，广泛应用于商品导购、同款比价、库存查找等场景。

一、接口核心特性分析

接口功能与定位

核心功能：通过图片特征匹配唯品会平台商品，返回相似商品列表及详细信息
技术原理：基于深度学习的图像特征提取与比对，支持商品主体识别和特征匹配
应用场景：
- 商品导购：用户上传图片快速找到同款商品
- 比价工具：通过图片找到同款商品进行价格对比
- 库存查询：商家通过样品图片查询平台库存情况
- 市场分析：根据热销商品图片分析市场趋势

认证机制

唯品会开放平台采用 appkey + access_token 的认证方式：

开发者在唯品会开放平台注册应用，获取 appkey 和 appsecret
使用 appkey 和 appsecret 获取 access_token（有有效期限制）
每次接口调用时，在请求参数中携带 access_token 进行身份验证

核心参数与响应结构

请求参数

参数名	类型	是否必填	说明
`image_url`	String	二选一	图片 URL（公网可访问）
`image_base64`	String	二选一	本地图片 Base64 编码（不含前缀）
`page`	Integer	否	页码，默认 1
`page_size`	Integer	否	每页数量，默认 20，最大 50
`sort`	String	否	排序方式：`similarity`（相似度）、`price_asc`（价格升序）等
`access_token`	String	是	访问令牌

响应核心字段

分页信息：总商品数、总页数、当前页码
商品列表：每个商品包含
- 基础信息：商品 ID、标题、主图、详情链接
- 价格信息：原价、折扣价、折扣率
- 相似度信息：与搜索图片的匹配度
- 店铺信息：店铺名称、类型
- 库存信息：是否有货、大致库存范围

二、Python 脚本实现

以下是调用唯品会 item_search_img 接口的完整实现，支持图片 URL 和本地图片两种搜索方式： import requests import time import json import base64 import logging from typing import Dict, Optional, List from requests.exceptions import RequestException from pathlib import Path

配置日志 logging.basicConfig( level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s" )

class VipImageSearchAPI: def init (self, appkey: str, appsecret: str): """ 初始化唯品会图片搜索API客户端 :param appkey: 唯品会开放平台appkey :param appsecret: 唯品会开放平台appsecret """ self.appkey = appkey self.appsecret = appsecret self.base_url = "api.vip.com" self.access_token = None self.token_expires_at = 0 # token过期时间戳 self.session = requests.Session() self.session.headers.update({ "Content-Type": "application/json", "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36" })

python 复制代码

def _get_access_token(self) -> Optional[str]:
    """获取访问令牌"""
    # 检查token是否有效
    if self.access_token and self.token_expires_at > time.time() + 60:
        return self.access_token
        
    logging.info("获取新的access_token")
    url = f"{self.base_url}/oauth2/token"
    
    params = {
        "grant_type": "client_credentials",
        "client_id": self.appkey,
        "client_secret": self.appsecret
    }
    
    try:
        response = self.session.get(url, params=params, timeout=10)
        response.raise_for_status()
        result = response.json()
        
        if "access_token" in result:
            self.access_token = result["access_token"]
            self.token_expires_at = time.time() + result.get("expires_in", 3600)
            return self.access_token
        else:
            logging.error(f"获取access_token失败: {result.get('error_description', '未知错误')}")
            return None
            
    except RequestException as e:
        logging.error(f"获取access_token请求异常: {str(e)}")
        return None

def _local_image_to_base64(self, image_path: str) -> Optional[str]:
    """将本地图片转换为Base64编码"""
    image_file = Path(image_path)
    
    # 检查文件是否存在
    if not image_file.exists() or not image_file.is_file():
        logging.error(f"图片文件不存在: {image_path}")
        return None
        
    # 检查文件格式
    valid_extensions = ['.jpg', '.jpeg', '.png', '.gif']
    if image_file.suffix.lower() not in valid_extensions:
        logging.error(f"不支持的图片格式: {image_file.suffix}，支持格式: {valid_extensions}")
        return None
        
    try:
        # 读取并编码图片
        with open(image_path, 'rb') as f:
            image_data = f.read()
            # 检查图片大小，避免过大
            if len(image_data) > 10 * 1024 * 1024:  # 10MB
                logging.error("图片大小超过10MB限制")
                return None
            base64_str = base64.b64encode(image_data).decode('utf-8')
        return base64_str
    except Exception as e:
        logging.error(f"图片编码失败: {str(e)}")
        return None

def search_by_image(self, 
                   image_url: Optional[str] = None, 
                   image_path: Optional[str] = None,
                   page: int = 1, 
                   page_size: int = 20,
                   sort: str = "similarity") -> Optional[Dict]:
    """
    按图片搜索唯品会商品
    :param image_url: 图片URL（二选一）
    :param image_path: 本地图片路径（二选一）
    :param page: 页码
    :param page_size: 每页数量
    :param sort: 排序方式
    :return: 搜索结果
    """
    # 验证图片参数
    if not image_url and not image_path:
        logging.error("必须提供image_url或image_path参数")
        return None
        
    # 获取有效的access_token
    if not self._get_access_token():
        return None
        
    url = f"{self.base_url}/item/search/img"
    
    # 构建请求参数
    params = {
        "page": page,
        "page_size": page_size,
        "sort": sort,
        "access_token": self.access_token
    }
    
    # 添加图片参数
    if image_url:
        params["image_url"] = image_url
    else:
        # 处理本地图片
        base64_image = self._local_image_to_base64(image_path)
        if not base64_image:
            return None
        params["image_base64"] = base64_image
        
    try:
        response = self.session.get(url, params=params, timeout=20)  # 图片搜索耗时较长
        response.raise_for_status()
        result = response.json()
        
        # 检查响应状态
        if result.get("code") == 0:
            # 格式化响应数据
            return self._format_response(result.get("data", {}))
        else:
            logging.error(f"图片搜索失败: {result.get('message', '未知错误')} (错误码: {result.get('code')})")
            return None
            
    except RequestException as e:
        logging.error(f"图片搜索请求异常: {str(e)}")
        return None
    except json.JSONDecodeError:
        logging.error(f"图片搜索响应解析失败: {response.text[:200]}...")
        return None

def _format_response(self, response_data: Dict) -> Dict:
    """格式化响应数据"""
    # 分页信息
    pagination = {
        "total_items": int(response_data.get("total", 0)),
        "total_pages": (int(response_data.get("total", 0)) + int(response_data.get("page_size", 20)) - 1) // int(response_data.get("page_size", 20)),
        "current_page": int(response_data.get("page", 1)),
        "page_size": int(response_data.get("page_size", 20))
    }
    
    # 格式化商品列表
    products = []
    for item in response_data.get("items", []):
        products.append({
            "goods_id": item.get("goods_id"),
            "title": item.get("title"),
            "main_image": item.get("main_image"),
            "detail_url": item.get("detail_url"),
            "price": {
                "original_price": float(item.get("original_price", 0)),
                "vip_price": float(item.get("vip_price", 0)),
                "discount": float(item.get("discount", 0))
            },
            "similarity": float(item.get("similarity", 0)),  # 相似度（0-100）
            "stock_status": item.get("stock_status"),  # 库存状态
            "shop_info": {
                "shop_id": item.get("shop_id"),
                "shop_name": item.get("shop_name"),
                "shop_type": item.get("shop_type")
            },
            "brand_info": {
                "brand_id": item.get("brand_id"),
                "brand_name": item.get("brand_name")
            }
        })
        
    return {
        "pagination": pagination,
        "products": products,
        "search_id": response_data.get("search_id")  # 搜索ID，用于后续操作
    }

def search_all_pages(self, image_url: Optional[str] = None, 
                    image_path: Optional[str] = None,
                    max_pages: int = 5) -> List[Dict]:
    """
    获取多页搜索结果
    :param image_url: 图片URL
    :param image_path: 本地图片路径
    :param max_pages: 最大页数限制
    :return: 所有商品列表
    """
    all_products = []
    page = 1
    
    while page <= max_pages:
        logging.info(f"获取第 {page} 页搜索结果")
        result = self.search_by_image(
            image_url=image_url,
            image_path=image_path,
            page=page,
            page_size=50  # 使用最大页大小减少请求次数
        )
        
        if not result or not result["products"]:
            break
            
        all_products.extend(result["products"])
        
        # 检查是否已到最后一页
        if page >= result["pagination"]["total_pages"]:
            break
            
        page += 1
        # 添加延迟，避免触发频率限制
        time.sleep(2)  # 图片搜索接口限制更严格，延迟更长一些
        
    return all_products

示例调用 if name == "main": # 替换为实际的appkey和appsecret（从唯品会开放平台获取） APPKEY = "your_appkey" APPSECRET = "your_appsecret"

python 复制代码

# 初始化API客户端
api = VipImageSearchAPI(APPKEY, APPSECRET)

# 方式1：通过图片URL搜索
# search_result = api.search_by_image(
#     image_url="https://img.vip.com/xxx.jpg",  # 替换为实际图片URL
#     page=1,
#     page_size=20,
#     sort="similarity"
# )

# 方式2：通过本地图片搜索
search_result = api.search_by_image(
    image_path="./product_sample.jpg",  # 替换为本地图片路径
    page=1,
    page_size=20,
    sort="similarity"
)

# 方式3：获取多页结果
# search_result = api.search_all_pages(
#     image_path="./product_sample.jpg",
#     max_pages=3
# )

if isinstance(search_result, dict) and "products" in search_result:
    print(f"共找到 {search_result['pagination']['total_items']} 件相似商品")
    print(f"当前第 {search_result['pagination']['current_page']}/{search_result['pagination']['total_pages']} 页\n")
    
    # 打印前5件商品信息
    for i, product in enumerate(search_result["products"][:5], 1):
        print(f"{i}. {product['title']} (相似度: {product['similarity']}%)")
        print(f"   品牌: {product['brand_info']['brand_name']}")
        print(f"   原价: {product['price']['original_price']}元 → 折扣价: {product['price']['vip_price']}元 ({product['price']['discount']}折)")
        print(f"   店铺: {product['shop_info']['shop_name']} ({product['shop_info']['shop_type']})")
        print(f"   库存状态: {product['stock_status']}")
        print(f"   链接: {product['detail_url']}")
        print("-" * 100)
elif isinstance(search_result, list):
    # 处理多页结果
    print(f"共获取到 {len(search_result)} 件商品")

三、接口调用关键技术与注意事项

图片处理最佳实践

图片质量：清晰的商品主体图片（无水印、无遮挡）识别效果最佳
图片尺寸：建议图片尺寸在 500x500 到 1000x1000 像素之间
图片格式：优先使用 JPG 格式，识别成功率高于 PNG 和 GIF
Base64 处理：
- 必须移除 Base64 编码前缀（如data:image/jpeg;base64,）
- 图片大小控制在 10MB 以内，过大可能导致请求失败
URL 图片：确保图片 URL 为公网可访问，响应速度快的图片源识别效率更高

常见错误及解决方案

错误码	说明	解决方案
400	请求参数错误	检查图片参数是否正确提供，格式是否符合要求
401	未授权或 token 无效	重新获取 access_token
403	权限不足	检查应用是否已申请图片搜索接口权限
413	请求实体过大	图片大小超过限制，压缩图片后重试
429	调用频率超限	降低调用频率，增加请求间隔
500	服务器内部错误	稍后重试，或联系唯品会技术支持
10001	图片处理失败	检查图片是否损坏，尝试更换图片

性能优化建议

请求频率控制：图片搜索接口 QPS 限制通常较低（5 次 / 秒左右），需严格控制调用频率
图片预处理：对本地图片进行压缩和裁剪，突出商品主体，提高识别准确率
结果缓存：相同图片的搜索结果可缓存 30-60 分钟，减少重复调用
异步处理：对于批量图片搜索，采用异步队列方式处理，避免超时
相似度过滤：可设置相似度阈值（如 > 70%），过滤低相似度结果四、应用场景与扩展典型应用场景
智能导购系统：用户上传图片快速找到同款商品
跨平台比价工具：通过图片识别不同平台的同款商品进行价格对比
库存查询系统：商家通过样品图片查询平台库存情况
电商选品工具：根据市场热销商品图片寻找类似款式商品扩展建议
结合商品详情接口获取更完整的产品信息
实现相似度排序和过滤，只展示高相似度商品
开发批量图片搜索功能，支持多张图片同时查询
添加图片编辑功能，允许用户裁剪、旋转图片以提高识别率
实现搜索历史记录功能，方便用户查看之前的搜索结果通过合理使用唯品会图片搜索接口，开发者可以构建便捷的商品查找工具，提升用户体验，同时为电商运营提供市场分析数据支持。使用时需遵守唯品会开放平台的使用规范，确保数据使用的合法性。