elasticsearch8.12.0安装分词

上篇说到,安装了es后正常运行

es分词下载地址

从 GitHub Release 下载(推荐)

👉 https://github.com/medcl/elasticsearch-analysis-ik/releases

https://release.infinilabs.com/analysis-ik/stable/

安装:

选择与你 ES 版本匹配的包,例如:

复制代码
elasticsearch-analysis-ik-8.12.0.zip

下载命令:

复制代码
cd /tmp
wget https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v8.12.0/elasticsearch-analysis-ik-8.12.0.zip

⚠️ 注意:不要下载 source code,要下载 assets 里的 .zip 文件。


3. 创建 plugins 目录(如果不存在)

Elasticsearch 插件默认安装在:

复制代码
$ES_HOME/plugins/ik/

创建目录:

复制代码
mkdir -p $ES_HOME/plugins/ik

4. 解压插件到 plugins 目录

复制代码
unzip elasticsearch-analysis-ik-8.12.0.zip -d $ES_HOME/plugins/ik/

$ES_HOME 是你的 Elasticsearch 安装目录,例如 /data/isee/apps/elasticsearch-8.12.0


5. 检查目录结构

安装完成后,目录结构应如下:

复制代码
$ES_HOME/plugins/ik/
├── plugin-descriptor.properties
├── plugin-security.policy
├── config/
│   ├── IKAnalyzer.cfg.xml
│   ├── main.dic
│   └── stopword.dic
└── lib/
    ├── elasticsearch-analysis-ik-8.12.0.jar
    └── commons-codec-1.9.jar
    └── ...

6. 修改配置文件(可选)

配置文件路径:

复制代码
$ES_HOME/plugins/ik/config/IKAnalyzer.cfg.xml

你可以添加自定义词典:

复制代码
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE properties SYSTEM "http://java.sun.com/dtd/properties.dtd">
<properties>
    <comment>IK Analyzer 扩展配置</comment>
    <entry key="ext_dict">custom.dic</entry>
    <entry key="ext_stopwords">stopwords.dic</entry>
</properties>

然后在 config/ 目录下创建 custom.dic,添加自定义词汇:

复制代码
人工智能
大模型
阿里云
Qwen

7. 设置权限(重要)

确保 Elasticsearch 用户有权限读取插件:

复制代码
chown -R isee:isee $ES_HOME/plugins/ik
# 或你运行 ES 的用户

8. 重启 Elasticsearch

复制代码
# 先停止
ps aux | grep elasticsearch
kill <pid>

# 启动
bin/elasticsearch -d

✅ 三、验证插件是否安装成功

1. 检查日志

查看 $ES_HOME/logs/isee_cluster.log,确认没有插件加载错误。

2. 调用分词 API 测试

bash 复制代码
# curl -X GET  -u elastic:9yZWp=3UnEVkBxYBhnlS "https://10.10.10.10:9200/_analyze" -H "Content-Type: application/json" -d'
> {
>   "analyzer": "ik_smart",
>   "text": "阿里巴巴推出通义千问大模型"
> }'
curl: (60) Peer's certificate issuer has been marked as not trusted by the user.
More details here: http://curl.haxx.se/docs/sslcerts.html

curl performs SSL certificate verification by default, using a "bundle"
 of Certificate Authority (CA) public keys (CA certs). If the default
 bundle file isn't adequate, you can specify an alternate file
 using the --cacert option.
If this HTTPS server uses a certificate signed by a CA represented in
 the bundle, the certificate verification probably failed due to a
 problem with the certificate (it might be expired, or the name might
 not match the domain name in the URL).
If you'd like to turn off curl's verification of the certificate, use
 the -k (or --insecure) option.

才想起来,我们是https的服务,有ca证书,先不认证证书,-k

bash 复制代码
# curl -k -X GET  -u elastic:9yZWp=3UnEVkBxYBhnlS "https://10.10.10.10:9200/_analyze" -H "Content-Type: application/json" -d'
> {
>   "analyzer": "ik_smart",
>   "text": "阿里巴巴推出通义千问大模型"
> }'
{"tokens":[{"token":"阿里巴巴","start_offset":0,"end_offset":4,"type":"CN_WORD","position":0},{"token":"推出","start_offset":4,"end_offset":6,"type":"CN_WORD","position":1},{"token":"通义","start_offset":6,"end_offset":8,"type":"CN_WORD","position":2},{"token":"千","start_offset":8,"end_offset":9,"type":"TYPE_CNUM","position":3},{"token":"问","start_offset":9,"end_offset":10,"type":"CN_CHAR","position":4},{"token":"大模型","start_offset":10,"end_offset":13,"type":"CN_WORD","position":5}]}[isee@host-10-15-32-71 elasticsearch-8.12.0]$ 

分词安装成功。

相关推荐
大树883 天前
金刚石散热越强,管路越先见顶
大数据·运维·服务器·人工智能·ai
摇滚侠3 天前
Linux CentOS7 rpm 安装 MySQL 5.7
linux·运维·mysql
霸道流氓气质3 天前
领域驱动设计(DDD)在 Spring Boot 微服务中的实践指南
运维·spring boot·微服务
Inhand陈工3 天前
基于台达PLC与映翰通IG502的智慧水产养殖精准投喂与远程运维解决方案
运维·人工智能·物联网·阿里云·信息与通信
酣大智3 天前
ARP代理--工作原理
运维·网络·arp·arp代理
shushangyun_3 天前
2026年快消品B2B系统推荐:支持终端门店订货、促销政策自动化的工具?
java·运维·网络·数据库·人工智能·spring·自动化
施努卡机器视觉3 天前
SNK施努卡侧滑门锁上滑轮总成自动化装配线,从零件到组件,全流程精密制造方案
运维·自动化·制造
AC赳赳老秦3 天前
用 OpenClaw 搭建服务器故障应急响应系统,自动处理 80% 常见运维故障
android·运维·服务器·python·rxjava·deepseek·openclaw
java_cj3 天前
深入kube-apiserver认证机制:从Bearer Token到mTLS的完整认证链解析
linux·运维·服务器·云原生·容器·kubernetes
lsyeei3 天前
linux 系统目录详解
linux·运维·服务器