Elasticsearch - 分布式搜索与分析引擎

文章目录

- [一、关于 Elasticsearch](#一、关于 Elasticsearch)
- 二、安装配置
- - 1、云服务部署
  - 2、本地开发环境
- 三、使用示例
- - 1、基础操作
  - [2、Python 客户端](#2、Python 客户端)
  - 3、数据操作
- 四、进阶功能
- - 1、从源码构建
  - 2、版本升级
- 其它
- - 参考项目

一、关于 Elasticsearch

1、项目概览

Elasticsearch 是一个分布式搜索与分析引擎，可扩展的数据存储和向量数据库，针对生产级工作负载的速度和相关性进行了优化。作为 Elastic 开源技术栈的核心组件，支持：

海量数据近实时搜索
向量搜索
生成式 AI 应用集成

2、相关链接资源

Github：https://github.com/elastic/elasticsearch
官网：https://www.elastic.co/products/elasticsearch
官方文档：https://www.elastic.co/guide/en/elasticsearch/reference/current/index.html
Elasticsearch Service：https://www.elastic.co/cloud/as-a-service
下载地址：https://www.elastic.co/downloads/elasticsearch
社区支持：Elastic Forum | Slack
License：Elastic License

3、功能特性

多模态搜索
- 全文检索
- 向量搜索
- 混合搜索
数据分析
- 日志分析
- 指标监控
- 应用性能管理(APM)
- 安全日志分析
AI 集成
- 检索增强生成(RAG)
- 机器学习创新应用

二、安装配置

1、云服务部署

推荐使用 Elasticsearch Service 快速创建托管集群

2、本地开发环境

bash 复制代码

# 使用 Docker 快速启动（仅限开发测试）
curl -fsSL https://elastic.co/start-local | sh

注意：此配置仅适用于本地开发，生产环境需另行配置。

三、使用示例

1、基础操作

bash 复制代码

# 创建索引
curl -u elastic:$ES_LOCAL_PASSWORD -X PUT http://localhost:9200/my-new-index

2、Python 客户端

python 复制代码

from elasticsearch import Elasticsearch

client = Elasticsearch(
    "http://localhost:9200",
    basic_auth=('elastic', os.getenv('ES_LOCAL_PASSWORD'))
)
print(client.info())

3、数据操作

json 复制代码

// 添加文档
POST /customer/_doc/1
{
  "firstname": "Jennifer",
  "lastname": "Walters"
}

// 批量导入
PUT customer/_bulk
{ "create": { } }
{ "firstname": "Monica","lastname":"Rambeau"}

四、进阶功能

1、从源码构建

bash 复制代码

# 构建本地发行版
./gradlew localDistro

# 构建特定平台版本
./gradlew :distribution:archives:linux-tar:assemble

2、版本升级

参考官方升级指南

其它

参考项目

elasticsearch-labs - 示例代码与应用案例

伊织 xAI 2025-09-23（二）