本地/笔记本/纯 cpu 部署、使用类 gpt 大模型

使用 web UI + 大模型文件，即可在笔记本上部署、使用类 gpt 大模型。

sh 复制代码

conda create -n textgen python=3.11
conda activate textgen

System	GPU	Command
Linux/WSL	NVIDIA	`pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121`
Linux/WSL	CPU only	`pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu`
Linux	AMD	`pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm5.6`
MacOS + MPS	Any	`pip3 install torch torchvision torchaudio`
Windows	NVIDIA	`pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121`
Windows	CPU only	`pip3 install torch torchvision torchaudio`

sh 复制代码

pip install -r <requirements file according to table below>

GPU	CPU	requirements file to use
NVIDIA	has AVX2	`requirements.txt`
NVIDIA	no AVX2	`requirements_noavx2.txt`
AMD	has AVX2	`requirements_amd.txt`
AMD	no AVX2	`requirements_amd_noavx2.txt`
CPU only	has AVX2	`requirements_cpu_only.txt`
CPU only	no AVX2	`requirements_cpu_only_noavx2.txt`
Apple	Intel	`requirements_apple_intel.txt`
Apple	Apple Silicon	`requirements_apple_silicon.txt`

TheBloke 是 hugging face 社区的一个用户， ta 提供了许多预量化大模型的下载。

在该用户的 model 库中搜索需要的模型，常用关键词是 7b-gguf。

在具体模型页面的 Provided files 部分可以看到该模型的不同量化版本、文件大小、预计内存占用、推荐与否。点击具体量化版本的模型即可下载。

打开 conda 命令行窗口，运行以下命令，并保持窗口开启：

sh 复制代码

conda activate textgen
cd text-generation-webui
python server.py

打开 127.0.0.1:7860 网页链接，model 页面，按上图进行模型加载即可（大概需要几十秒）。

打开 chat 页面，即可进行对话。