LLM之RAG实战（四十四）| rag-chatbot：支持Huggingface和Ollama任意模型的多PDF本地RAG方案

wshzd2024-10-26 21:48

特点：

支持本地运行和Kaggle (new)运行
支持Huggingface 和Ollama 的任意模型
Process multiple PDF inputs.
Chat with multiples languages (Coming soon).
Simple UI with Gradio.

一、安装使用

1.1 Kaggle（推荐）

Step1：把https://github.com/datvodinh/rag-chatbot/blob/main/notebooks/kaggle.ipynb脚本导入到Kaggle。

Step2：把<YOUR_NGROK_TOKEN>替换为自己的token。

1.2 本地安装

a）克隆项目

复制代码

git clone https://github.com/datvodinh/rag-chatbot.gitcd rag-chatbot

b）安装

Docker方式

复制代码

docker compose up --build

脚本方式（Ollama, Ngrok, python package）

复制代码

source ./scripts/install_extra.sh

手动安装

`Step1：Ollama`

MacOS, Window: Download
Linux

curl -fsSL https://ollama.com/install.sh | sh

`Step2：Ngrok`

Macos

brew install ngrok/ngrok/ngrok
Linux

curl -s https://ngrok-agent.s3.amazonaws.com/ngrok.asc | sudo tee /etc/apt/trusted.gpg.d/ngrok.asc >/dev/null && echo "deb https://ngrok-agent.s3.amazonaws.com buster main" | sudo tee /etc/apt/sources.list.d/ngrok.list && sudo apt update && sudo apt install ngrok

Step3：安装rag_chatbot包

复制代码

source ./scripts/install.sh

c）启动

复制代码

source ./scripts/run.sh

或者

复制代码

python -m rag_chatbot --host localhost

使用Ngrok

复制代码

source ./scripts/run.sh --ngrok

此时，会下载大模型

大模型的配置文件：https://github.com/datvodinh/rag-chatbot/blob/main/rag_chatbot/setting/setting.py

LLM默认是：llama3:8b-instruct-q8_0

Embedding模型默认是：BAAI/bge-large-en-v1.5

此时，登录http://0.0.0.0:7860即可访问：

参考文献：

1\] https://github.com/datvodinh/rag-chatbot

上一篇：基于SpringBoot的高校体测管理系统设计与实现（源码+定制+开发）高校体测记录系统设计、高校体测信息管理平台、智能体测管理系统开发、高校体测记录系统设计

下一篇：基于neo4j的新冠治疗和新冠患者轨迹的知识图谱问答系统

热门推荐

01UV安装并设置国内源 02Qwen3-Coder 快速上手教程 | Qwen Code + Claude Code 03KGG转MP3工具|非KGM文件|解密音频 04【2025.08.06最新版】Android Studio下载、安装及配置记录（自动下载sdk）052025最新国内服务器可用docker源仓库地址大全（2025年8月更新）06蜘蛛磁力搜索引擎大全，如何使用蜘蛛磁力查找磁力链接 07TRAE Rules 实践：为项目配置 6A 工作流 08全球最强模型Grok4，国内已可免费使用！（附教程）09NVIDIA显卡驱动、CUDA、cuDNN 和 TensorRT 版本匹配指南 10TRAE 规则（Rules）配置指南：个人习惯、团队规范与最佳实践