Elasticsearch创建索引分片和副本大小建议

在Elasticsearch中，‌**分片(shard)和副本(replica)**‌ 的设置直接影响集群性能、容错能力和扩展性。以下是最佳实践指南：

‌类型‌	‌描述‌	‌是否可修改‌
‌主分片(Primary Shard)‌	数据的最小存储单元，每个索引被拆分成多个主分片	❌ 索引创建后不可修改
‌副本分片(Replica Shard)‌	主分片的完整拷贝，提供数据冗余和读取负载均衡	✅ 随时动态调整

PUT /your_index { "settings": { "number_of_shards": 5, // 主分片数 "number_of_replicas": 2 // 每个主分片的副本数 } }

PUT /your_index/_settings { "index.number_of_replicas": 1 }

‌**热温架构(Hot-Warm)**‌

{ "index.routing.allocation.require.data_type": "hot" // 热节点存放新数据 }
‌分片自动平衡‌

# elasticsearch.yml cluster.routing.allocation.balance.shard: 0.3 # 分片均衡因子(默认0.45)
‌分片分布约束‌

PUT _cluster/settings { "persistent": { "cluster.routing.allocation.awareness.attributes": "rack_id" } }

查看分片分布：

GET _cat/allocation?v&s=node
定位大分片：

GET _cat/indices/*?v&h=index,pri,rep,shards,store.size&s=store.size:desc
分片移动记录：

GET _cat/recovery?active_only=true

‌问题1：分片过大(>50GB) ‌

👉 解决方案：

‌问题2：节点间分片不均衡 ‌

👉 解决方案：

PUT _cluster/settings { "transient": { "cluster.routing.rebalance.enable": "all" } }

‌问题3：副本同步延迟 ‌

👉 优化方案：

‌预估数据量‌	‌数据增长率‌	‌节点数‌	‌推荐分片数‌	‌推荐副本数‌
500GB	低(5%/月)	3	10-15	1-2
5TB	中(10%/月)	8	100-150	2-3
50TB	高(20%/月)	20+	500+	2-3