1.简介
组织机构:百川智能(前搜狗CEO王小川创立)
模型:baichuan-inc/Baichuan2-7B-Chat-4bits
下载:https://huggingface.co/baichuan-inc/Baichuan2-7B-Chat-4bits
镜像下载:https://aliendao.cn/models/baichuan-inc/Baichuan2-7B-Chat-4bits
硬件环境:暗影精灵7Plus
Windows版本:Windows 11家庭中文版 Insider Preview 22H2
内存 32G
GPU显卡:Nvidia GTX 3080 Laptop (16G)
参考资料:https://zhuanlan.zhihu.com/p/658999248
2.代码和模型下载
下载代码仓:
data:image/s3,"s3://crabby-images/56910/5691045f3b7036a827c51666088fd301dfc80c09" alt=""
打开 https://huggingface.co/baichuan-inc/Baichuan2-7B-Chat-4bits 下载模型
并将其拷贝到 d:\Baichuan2\baichuan-inc\Baichuan2-7B-Chat-4bits 目录
data:image/s3,"s3://crabby-images/c6729/c67298cdcdb7a861809cf94bde8b8142f86e051d" alt=""
3.安装依赖
创建conda环境
conda create -n baichuan2 python=3.10
conda activate baichuan2
data:image/s3,"s3://crabby-images/58c74/58c745d90c2aa6f9714b152fc943632191a34484" alt=""
cd Baichuan2
安装量化包:
pip install bitsandbytes --prefer-binary --extra-index-url=https://jllllll.github.io/bitsandbytes-windows-webui
data:image/s3,"s3://crabby-images/0bec5/0bec51693bc13499a808a62db0762b9c33ff7557" alt=""
data:image/s3,"s3://crabby-images/08396/08396740da6393452064a00d6e6bcfcf32ab601b" alt=""
安装Pytorch 2.0.1 for CUDA
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
data:image/s3,"s3://crabby-images/73b7e/73b7edf63ec95fe65ce51035a7a14a19b77e212a" alt=""
data:image/s3,"s3://crabby-images/5608d/5608d6275e6452d63a0f49f69b7acde0c861694a" alt=""
安装加速包:
pip install xformers
data:image/s3,"s3://crabby-images/6b87d/6b87d8f178ea33ac722826ccec1547277bad2a44" alt=""
将代码 cli_demo.py 改成4bit量化的模型:
data:image/s3,"s3://crabby-images/d15fe/d15fee9a6e3a2d912762b8228c52bca12ec91f03" alt=""
d:
cd d:\Baichuan2
pip install -r requirements.txt
data:image/s3,"s3://crabby-images/1636d/1636da9e7b904de874f351988bec7747573447a3" alt=""
4.部署验证
运行命令行模式:
python cli_demo.py
data:image/s3,"s3://crabby-images/7136b/7136b29f5503bc9d2bb304aa48b565a06216048b" alt=""
做一些简单的交互:
data:image/s3,"s3://crabby-images/7a008/7a00832f10a4df6410ba50e38eb3a3f660f3213f" alt=""
修改web_demo.py文件:
data:image/s3,"s3://crabby-images/c4f91/c4f91a7c4cedd9790c41c5df545a039d0bdf239e" alt=""
运行网页模式:
python web_demo.py
data:image/s3,"s3://crabby-images/d5f02/d5f023e2da585cacdb38bfc3f6b059a54942cd0d" alt=""
这里好像哪里不对,但是系统提示可以使用streamlit运行:
streamlit run web_demo.py
系统自动打开浏览器:
data:image/s3,"s3://crabby-images/b6b82/b6b82691c0830e22ad2be814916ce9774a412e4a" alt=""
data:image/s3,"s3://crabby-images/4233a/4233a38c061e29b4529f5048e33ee854501a590d" alt=""
做一些简单的交互:
data:image/s3,"s3://crabby-images/52c17/52c17ad459b855d344b60eac97c336f49c0d4865" alt=""
data:image/s3,"s3://crabby-images/d5f80/d5f802f3fc85d19e6f898ff6e58b7bbe56d48cbd" alt=""
(全文完,谢谢阅读)