whisper.cpp: PC端测试 -- 电脑端部署音频大模型

带电的小王2024-12-22 9:49

whisper.cpp: PC端测试

1.环境需要

以下是经实验验证可行的环境参考，也可尝试其他版本。

（1）PC：Ubuntu 22.04.4

（2）软件环境：如下表所示

工具	版本	安装
Anaconda	2021.05	参考：Anaconda3安装及使用
cmake	3.26.0	conda install -c conda-forge cmake ; conda install -c conda-forge c-compiler ; conda install -c conda-forge cxx-compiler
make	4.4.1	conda install -c conda-forge make
ndk	android-ndk-r26c	官网下载解压，设置环境变量
adb	1.0.41	conda install -c conda-forge adb
git	2.34.1	sudo apt-get update ; sudo apt-get install git
git-lfs	3.6.0	sudo apt-get install git-lfs

2.构建项目

（1）克隆项目：

shell 复制代码

git lfs install
git clone https://github.com/ggerganov/whisper.cpp.git
cd whisper.cpp

（2）下载模型：

shell 复制代码

# 修改脚本./models/download-ggml-model.sh：
#      替换huggingface.co为镜像网站hf-mirror.com（因不能访问外网）
#src="https://huggingface.co/ggerganov/whisper.cpp"
src="https://hf-mirror.com/ggerganov/whisper.cpp"
    #src="https://huggingface.co/akashmjn/tinydiarize-whisper.cpp"
    src="https://hf-mirror.com/akashmjn/tinydiarize-whisper.cpp"

# 下载模型：下载其中一个 Whisper 模型 转换为 ggml 格式。例如：
sh ./models/download-ggml-model.sh base.en

（3）PC构建：

shell 复制代码

# build the main example
cmake -B build
cmake --build build --config Release

3.PC测试

shell 复制代码

# transcribe an audio file
./build/bin/main -f samples/jfk.wav

注意，main示例仅使用 16 位 WAV 运行，因此请确保音频是此格式文件，如不是请转换格式。例如：

shell 复制代码

# mp3转wav
ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav

说明：要进行快速演示，只需运行 make base.en 与以上操作等效。

声明：资源可能存在第三方来源，若有侵权请联系删除！