【wails】（10）：研究go-llama.cpp项目，但是发现不支持最新的qwen大模型，可以运行llama-2-7b-chat

fly-iot2024-03-24 12:17

1，视频演示地址

2，项目地址go-llama.cpp

下载并进行编译：

bash 复制代码

git clone --recurse-submodules https://github.com/go-skynet/go-llama.cpp
cd go-llama.cpp
make libbinding.a

项目中还打了个补丁：

给

编译成功，虽然有一点 warning 警告信息，问题不大。

3，然后运行 llama-2-7b-chat 模型

bash 复制代码

LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/data/home/test/hf_cache/llama-2-7b-chat.Q2_K.gguf" -t 14

LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/data/home/test/hf_cache/qwen1_5-0_5b-chat-q6_k.gguf" -t 14

bash 复制代码

error loading model: unknown model architecture: 'qwen2'
llama_load_model_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model '/data/home/test/hf_cache/qwen1_5-0_5b-chat-q6_k.gguf'
load_binding_model: error: unable to load model
Loading the model failed: failed loading model

上一篇：每日一题 --- 设计链表[力扣][Go]

下一篇：TCP/IP四层模型对比OSI七层网络模型的区别是啥？数据传输过程原来是这样的