报错 No available slot found for the embedding model

gs801402024-11-16 10:03

报错内容

Server error: 503 - [address=0.0.0.0:12781, pid=304366] No available slot found for the embedding model. We recommend to launch the embedding model first, and then launch the LLM models.

目前GPU占用情况如下

解决办法: 关闭大模型, 先把 embedding models 启动起来, 然后再启动 LLM 模型

启动 EMBBEDDING MODEL后的效果

启动LLM后的效果

上一篇：飞书文档只读限制复制

下一篇：c# 在10万条数据中判断是否存在很慢问题

热门推荐

01Qwen3-Coder 快速上手教程 | Qwen Code + Claude Code 02全球最强模型Grok4，国内已可免费使用！（附教程）03vue数据变化但页面不变 04KGG转MP3工具|非KGM文件|解密音频 05sqli-labs 靶场 less-8、9、10 第八关到第十关详解：布尔注入，时间注入 06扣子开源本地部署教程丨Coze智能体小白喂饭级指南 07干翻 Typora！MilkUp：完全免费的桌面端 Markdown 编辑器！08【2025.7.18】更新vscode后所有.vue文件template标签后报红的临时解决办法，Vue - Official 插件3.0.2导致 09ChatGPT Agent 完全使用指南：2025年7月最新功能详解 10《魔兽世界》提示lua警告的含义及解决方法