ElasticSearch04-高级操作

零、文章目录

ElasticSearch04-高级操作

1、文档添加

(1)生成文档ID
  • 不指定 ID,即自动生成ID,ID 是一行数据的唯一键。
  • 语法:POST /index/_doc
json 复制代码
# 创建索引
PUT testid

# 默认情况下自动生成ID
POST /testid/_doc
{
  "test_field": "test"
}

# 返回
{
  "_index" : "testid",
  "_type" : "_doc",
  "_id" : "pyxXs5MBW--orQEFRZhH",
  "_version" : 1,
  "result" : "created",
  "_shards" : {
    "total" : 2,
    "successful" : 1,
    "failed" : 0
  },
  "_seq_no" : 0,
  "_primary_term" : 1
}
  • 场景:如果导入 ES 的数据想使用外部定义的 ID,需要在添加文档数据的时候指定 ID。
  • 语法:POST /index/_doc/id
json 复制代码
# 指定ID
POST /testid/_doc/1
{
  "test_field": "testid"
}

# 返回 1是指定的ID
{
  "_index" : "testid",
  "_type" : "_doc",
  "_id" : "1",
  "_version" : 1,
  "result" : "created",
  "_shards" : {
    "total" : 2,
    "successful" : 1,
    "failed" : 0
  },
  "_seq_no" : 1,
  "_primary_term" : 1
}

2、文档条件查询

(1)准备数据
json 复制代码
# 创建索引
PUT teacher

POST /teacher/_doc/001
{
  "name":"zhangsan",
  "nickname":"zhangsan",
  "sex":"男",
  "age":30
}

POST /teacher/_doc/002
{
  "name":"lisi",
  "nickname":"lisi",
  "sex":"男",
  "age":20
}

POST /teacher/_doc/003
{
  "name":"wangwu1",
  "nickname":"wangwu2",
  "sex":"女",
  "age":40
}

POST /teacher/_doc/004
{
  "name":"zhaoliu1",
  "nickname":"zhaoliu2",
  "sex":"女",
  "age":50
}

POST /teacher/_doc/005
{
  "name":"zhangsan2",
  "nickname":"zhangsan2",
  "sex":"女",
  "age":30
}

POST /teacher/_doc/006
{
  "name":"zhangsan222",
  "nickname":"zhangsan222",
  "sex":"女",
  "age":30
}
(2)查询全部(match_all)
  • ES中提供了一种强大的检索数据方式,这种检索方式称之为Query DSL,Query DSL是利用Rest API传递JSON格式的请求体与ES进行交互,这种方式的丰富查询语法让ES检索变得更强大,更简洁。
  • 使用_search 表示查询
json 复制代码
# 请求
Get /teacher/_search
{
  "query": {
    "match_all": {}
  }
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "002",
        "_score" : 1.0,
        "_source" : {
          "name" : "lisi",
          "nickname" : "lisi",
          "sex" : "男",
          "age" : 20
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "003",
        "_score" : 1.0,
        "_source" : {
          "name" : "wangwu1",
          "nickname" : "wangwu2",
          "sex" : "女",
          "age" : 40
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "004",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhaoliu1",
          "nickname" : "zhaoliu2",
          "sex" : "女",
          "age" : 50
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "005",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan2",
          "nickname" : "zhangsan2",
          "sex" : "女",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "006",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan222",
          "nickname" : "zhangsan222",
          "sex" : "女",
          "age" : 30
        }
      }
    ]
  }
}
  • 返回的结果说明:
    • took:这个字段表示查询执行所花费的毫秒数。在这个例子中,查询执行花费了0毫秒。
    • timed_out:这个字段表示查询是否超时。false表示查询没有超时。
    • _shards:这个字段包含了关于分片的信息。
      • total:查询被分发到的分片总数。
      • successful:成功处理的分片数。
      • skipped:被跳过的分片数。
      • failed:处理失败的分片数。
    • hits:这个字段包含了查询结果的相关信息。
      • total:匹配的文档总数。value表示总数,relation表示总数的类型,这里是eq,表示精确匹配。
      • max_score:所有匹配文档中最高的相关性得分,这里是1.0。
      • hits:实际匹配的文档列表,每个文档包含以下信息:
        • _index:文档所在的索引名称。
        • _type:文档的类型(在Elasticsearch 7.x之后,类型被标记为废弃)。
        • _id:文档的唯一标识符。
        • _score:文档的相关性得分。
        • _source:文档的原始数据,包含了文档的所有字段和值。
    • hits数组中,每个元素都是一个文档,包含了文档的元数据(如索引名、类型、ID、得分)和文档的内容(在_source字段中)。这个例子中返回了6个文档,每个文档都有namenicknamesexage字段。
(3)分词查询(match)
  • match查询用于在单个字段上执行全文检索。它会使用指定字段的分词器(analyzer)来处理查询文本,然后搜索与这些词元匹配的文档。
  • 只有 text 类型的字段才会进行分词查询。
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "match": {
      "name": "zhangsan"
    }
  }
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      }
    ]
  }
}
(4)精确查询(term)
  • term查询用于精确匹配,不进行分词处理,适用于关键字(keyword)、数字、日期等精确值字段。
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "term": {
      "name": {
        "value": "zhangsan"
      }
    }
  }
}

# 返回
{
  "took" : 1,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      }
    ]
  }
}
(5)范围查询(range)
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "range": {
      "age": {
        "gte": 30,
        "lte": 30
      }
    }
  }
}

# 返回
{
  "took" : 1,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 3,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "005",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan2",
          "nickname" : "zhangsan2",
          "sex" : "女",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "006",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan222",
          "nickname" : "zhangsan222",
          "sex" : "女",
          "age" : 30
        }
      }
    ]
  }
}
(6)前缀查询(prefix)
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "prefix": {
      "name": {
        "value": "zhangsan"
      }
    }
  }
}

# 返回
{
  "took" : 7,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 3,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "005",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan2",
          "nickname" : "zhangsan2",
          "sex" : "女",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "006",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan222",
          "nickname" : "zhangsan222",
          "sex" : "女",
          "age" : 30
        }
      }
    ]
  }
}
(7)通配符查询(wildcard)
  • ?:匹配任意一个字符
  • *:匹配任意多个字符
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "wildcard": {
      "name": {
        "value": "*san*"
      }
    }
  }
}

# 返回
{
  "took" : 6,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 3,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "005",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan2",
          "nickname" : "zhangsan2",
          "sex" : "女",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "006",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan222",
          "nickname" : "zhangsan222",
          "sex" : "女",
          "age" : 30
        }
      }
    ]
  }
}
(8)多id查询(ids)
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "ids": {
      "values": ["001","003","005"]
    }
  }
}

#返回
{
  "took" : 1,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 3,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "003",
        "_score" : 1.0,
        "_source" : {
          "name" : "wangwu1",
          "nickname" : "wangwu2",
          "sex" : "女",
          "age" : 40
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "005",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan2",
          "nickname" : "zhangsan2",
          "sex" : "女",
          "age" : 30
        }
      }
    ]
  }
}
(9)模糊查询(fuzzy)
  • Elasticsearch中的fuzzy查询允许用户搜索包含与指定查询词相似的文档,即使它们不完全相同。这种查询特别适用于处理近似或不精确的搜索词,例如用户可能打错字或输入同一词的不同变体的情况。
  • 以下是fuzzy查询的一些关键点:
    • 编辑距离(Edit Distance)fuzzy查询使用Levenshtein编辑距离来衡量搜索词之间的相似度。编辑距离是指将一个词转换成另一个词所需的单字符更改次数,包括改变一个字符、删除一个字符、插入一个字符或交换两个相邻字符。
    • 创建变体(Creating Variations)fuzzy查询在指定的编辑距离内创建搜索词的所有可能变体,然后返回每个变体的精确匹配。
    • 参数配置
      • fuzziness:允许的最大编辑距离,默认为AUTO。可以设置为AUTO或一个整数,表示允许的编辑次数。
      • max_expansions:创建的变体的最大数量,默认为50。高值可能会导致性能问题,因为会检查大量的变体。
      • prefix_length:在创建变体时保持不变的开始字符数,默认为0。
      • transpositions:是否包括两个相邻字符的置换作为编辑,默认为true
      • rewrite:用于重写查询的方法,如果fuzziness参数不为0,默认使用top_terms_blended_freqs_${max_expansions}
    • 性能考虑 :由于fuzzy查询可能会生成大量的变体,特别是当max_expansions参数值较高且prefix_length参数值为0时,可能会影响查询性能。因此,合理配置这些参数对于保持Elasticsearch查询性能至关重要。
    • 使用场景fuzzy查询是term查询的模糊等价物,通常不会直接使用,但理解其工作原理有助于在更高级的match查询中使用模糊匹配。
    • 不进行分析 :与match查询不同,fuzzy查询是一个term级别的查询,不会对查询词进行分析,它直接在词典中查找与指定fuzziness范围内的所有项。
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "fuzzy": {
      "name": "zhangsan"
    }
  }
}

# 返回
{
  "took" : 2,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "005",
        "_score" : 1.3478894,
        "_source" : {
          "name" : "zhangsan2",
          "nickname" : "zhangsan2",
          "sex" : "女",
          "age" : 30
        }
      }
    ]
  }
}
(10)组合查询(bool)
  • must: 相当于&& 同时成立
  • should: 相当于|| 成立一个就行
  • must_not: 相当于! 不能满足任何一个
bash 复制代码
# 请求
GET /teacher/_search
{
    "query": {
        "bool": {
            "must": [
                {
                    "match": {
                        "name": "zhangsan"
                    }
                }
            ],
            "must_not": [
                {
                    "match": {
                        "age": "40"
                    }
                }
            ],
            "should": [
                {
                    "match": {
                        "sex": "男"
                    }
                }
            ]
        }
    }
}

# 返回
{
  "took" : 3,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 2.5700645,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 2.5700645,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      }
    ]
  }
}
(11)多字段查询(multi_match)
  • multi_match查询允许你在多个字段上执行全文检索。与match查询类似,它也会进行分词,但是可以在多个字段上搜索给定的文本
json 复制代码
# 请求
GET /teacher/_search
{
    "query": {
        "multi_match": {
            "query": "zhangsan",
            "fields": ["name","nickname"]
        }
    }
}

# 返回
{
  "took" : 6,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      }
    ]
  }
}
(12)多值查询(terms)
  • terms 查询和 term 查询一样,但它允许你指定多值进行匹配。类似 SQL 中的 in。
json 复制代码
# 请求
GET /teacher/_search
{
    "query": {
        "terms": {
            "name": ["zhangsan","lisi"]
        }
    }
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "002",
        "_score" : 1.0,
        "_source" : {
          "name" : "lisi",
          "nickname" : "lisi",
          "sex" : "男",
          "age" : 20
        }
      }
    ]
  }
}
(13)复杂查询(query_string)
  • Elasticsearch中的query_string查询是一个强大的全文搜索查询,它允许用户使用一个查询字符串来构造复杂的查询。这个查询字符串可以包含多个字段、逻辑操作符(如AND, OR, NOT)以及通配符等。query_string查询非常灵活,可以模拟其他查询类型,如matchbool等。
  • 以下是query_string查询的一些关键点:
    • 查询语法:
      • 支持布尔操作符:AND, OR, NOT
      • 支持通配符查询:*(匹配任意字符)。
      • 支持短语搜索:使用双引号""包围短语。
      • 支持前缀搜索:使用问号?表示单个字符,使用星号*表示任意长度的字符。
      • 支持正则表达式:使用正斜杠/包围正则表达式。
    • 字段选择:可以在查询中指定一个或多个字段进行搜索。
    • 分析器:默认使用字段的搜索分析器(search analyzer),但也可以使用查询分析器(query analyzer)。
    • 默认操作:默认的布尔操作是AND,意味着如果查询字符串中没有指定布尔操作符,那么查询会将所有条件视为AND连接。
    • 字段名和值的转义:如果需要在查询中使用特殊字符,可以使用反斜杠\进行转义。
    • 性能:由于query_string查询的复杂性,它可能比简单的match查询更消耗资源。因此,在性能敏感的应用中需要谨慎使用。
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "query_string": {
      "query": "name:zhansan OR sex:男"
    }
  }
}

# 返回
{
  "took" : 9,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 1.0296195,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0296195,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "002",
        "_score" : 1.0296195,
        "_source" : {
          "name" : "lisi",
          "nickname" : "lisi",
          "sex" : "男",
          "age" : 20
        }
      }
    ]
  }
}
  • 分析器的使用:这个查询使用standard分析器来处理查询字符串。
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "query_string": {
      "query": "name:zhangsan",
      "analyzer": "standard"
    }
  }
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      }
    ]
  }
}

3、文档查询结果集处理

(1)查询结果高亮(highlight)
  • Elasticsearch中的高亮(highlight)功能允许你在搜索结果中突出显示与查询条件匹配的文本片段,以便用户能够快速定位到关键信息。
  • 以下是关于Elasticsearch高亮功能的详细介绍:
    • 高亮参数:
      • Elasticsearch提供了多种高亮参数,包括字段设置、自定义标签、片段大小等。
      • 默认情况下,Elasticsearch使用<em></em>标签来标记高亮的关键字。
    • 自定义高亮片段:
      • 用户可以自定义高亮标签,比如使用<b></b>标签来代替默认的<em></em>标签。
      • 通过pre_tagspost_tags参数,用户可以指定高亮前后的标签。
    • 多字段高亮:
      • 高亮功能可以应用于一个或多个字段,只需在highlight字段中添加相应的字段名称。
    • 高亮性能分析:
      • 高亮功能需要字段的实际内容,如果字段未存储(映射未设置storetrue),则需要从_source中加载实际值并从中提取相关字段。
  • 高亮查询示例:
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "term": {
      "name": {
        "value": "zhangsan"
      }
    }
  },
  "highlight": {
    "fields": {
      "*":{}
    }
  }
}

# 返回
{
  "took" : 33,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        },
        "highlight" : {
          "name" : [
            "<em>zhangsan</em>"
          ]
        }
      }
    ]
  }
}
  • 自定义标签
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "term": {
      "name": {
        "value": "zhangsan"
      }
    }
  },
  "highlight": {
        "pre_tags": "<font color='red'>",
        "post_tags": "</font>",
        "fields": {
            "name": {}
        }
    }
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 1.540445,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.540445,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        },
        "highlight" : {
          "name" : [
            "<font color='red'>zhangsan</font>"
          ]
        }
      }
    ]
  }
}
(2)指定返回条数(size)
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "match_all": {}
  },
  "size": 2
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "002",
        "_score" : 1.0,
        "_source" : {
          "name" : "lisi",
          "nickname" : "lisi",
          "sex" : "男",
          "age" : 20
        }
      }
    ]
  }
}
(3)分页查询(from)
  • size:取多少条数据
  • from:从哪条数据开始
  • size+from 合起来实现分页效果
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "match_all": {}
  },
  "size": 2,
  "from": 2
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "003",
        "_score" : 1.0,
        "_source" : {
          "name" : "wangwu1",
          "nickname" : "wangwu2",
          "sex" : "女",
          "age" : 40
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "004",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhaoliu1",
          "nickname" : "zhaoliu2",
          "sex" : "女",
          "age" : 50
        }
      }
    ]
  }
}
(4)指定字段排序(sort)
  • 文本字段(text字段)默认不适用于需要逐文档字段数据的操作,如聚合(aggregations)和排序(sorting)。这是因为text字段会被分词器分词,存储的是词元(tokens),而不是完整的原始字符串。
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "terms": {
      "name":["zhangsan","lisi"]
    }
  },
  "sort": [
    {
      "age": {
        "order": "desc"
      }
    }
  ]
}

# 返回
{
  "took" : 5,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : null,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan",
          "sex" : "男",
          "age" : 30
        },
        "sort" : [
          30
        ]
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "002",
        "_score" : null,
        "_source" : {
          "name" : "lisi",
          "nickname" : "lisi",
          "sex" : "男",
          "age" : 20
        },
        "sort" : [
          20
        ]
      }
    ]
  }
}
(5)指定返回字段(_source)
  • includes:来指定想要显示的字段
  • excludes:来指定不想要显示的字段
json 复制代码
# 请求
GET /teacher/_search
{
  "query": {
    "terms": {
      "name":["zhangsan","lisi"]
    }
  },
  "_source": {
      "includes": ["name","nickname"]
  }
}

# 返回
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 2,
      "relation" : "eq"
    },
    "max_score" : 1.0,
    "hits" : [
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "001",
        "_score" : 1.0,
        "_source" : {
          "name" : "zhangsan",
          "nickname" : "zhangsan"
        }
      },
      {
        "_index" : "teacher",
        "_type" : "_doc",
        "_id" : "002",
        "_score" : 1.0,
        "_source" : {
          "name" : "lisi",
          "nickname" : "lisi"
        }
      }
    ]
  }
}

4、文档聚合查询

(1)返回最大值(max)
  • aggs:表示聚合
    • max_age:自定义返回聚合字段名
      • max:取最大值
        • field:聚合的字段是 age
  • size:0 表示返回聚合结果不返回详细数据
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "max_age":{
            "max":{"field":"age"}
        }
    },
    "size":0
}

# 返回
{
  "took" : 4,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "max_age" : {
      "value" : 50.0
    }
  }
}
(2)返回最小值(min)
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "min_age":{
            "min":{"field":"age"}
        }
    },
    "size":0
}

# 返回值
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "min_age" : {
      "value" : 20.0
    }
  }
}
(3)返回平均值(avg)
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "avg_age":{
            "avg":{"field":"age"}
        }
    },
    "size":0
}

# 返回值
{
  "took" : 3,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "avg_age" : {
      "value" : 33.333333333333336
    }
  }
}
(4)返回和(sum)
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "sum_age":{
            "sum":{"field":"age"}
        }
    },
    "size":0
}

# 返回值
{
  "took" : 0,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "sum_age" : {
      "value" : 200.0
    }
  }
}
(5)返回去重行数(cardinality)
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "cardinality_age":{
            "cardinality":{"field":"age"}
        }
    },
    "size":0
}

# 返回值
{
  "took" : 10,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "cardinality_age" : {
      "value" : 4
    }
  }
}
(6)返回全部聚合(stats)
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "stats_age":{
            "stats":{"field":"age"}
        }
    },
    "size":0
}

# 返回值
{
  "took" : 2,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "stats_age" : {
      "count" : 6,
      "min" : 20.0,
      "max" : 50.0,
      "avg" : 33.333333333333336,
      "sum" : 200.0
    }
  }
}
(7)分组聚合(terms)
  • 默认情况下,分组统计的是每个分组的行数
json 复制代码
# 请求
GET /teacher/_search
{
    "aggs":{
        "terms_age":{
            "terms":{"field":"age"}
        }
    },
    "size":0
}

# 返回值
{
  "took" : 2,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "terms_age" : {
      "doc_count_error_upper_bound" : 0,
      "sum_other_doc_count" : 0,
      "buckets" : [
        {
          "key" : 30,
          "doc_count" : 3
        },
        {
          "key" : 20,
          "doc_count" : 1
        },
        {
          "key" : 40,
          "doc_count" : 1
        },
        {
          "key" : 50,
          "doc_count" : 1
        }
      ]
    }
  }
}
(8)分组指定聚合(terms+aggs)
json 复制代码
# 请求
GET /teacher/_search?size=0
{
  "aggs": {
    "group_by_field": {
      "terms": {
        "field": "age"
      },
      "aggs": {
        "average_value": {
          "avg": {
            "field": "age"
          }
        }
      }
    }
  }
}

# 返回值
{
  "took" : 1,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "group_by_field" : {
      "doc_count_error_upper_bound" : 0,
      "sum_other_doc_count" : 0,
      "buckets" : [
        {
          "key" : 30,
          "doc_count" : 3,
          "average_value" : {
            "value" : 30.0
          }
        },
        {
          "key" : 20,
          "doc_count" : 1,
          "average_value" : {
            "value" : 20.0
          }
        },
        {
          "key" : 40,
          "doc_count" : 1,
          "average_value" : {
            "value" : 40.0
          }
        },
        {
          "key" : 50,
          "doc_count" : 1,
          "average_value" : {
            "value" : 50.0
          }
        }
      ]
    }
  }
}
(9)分组聚合排序(terms+aggs+sort)
json 复制代码
GET /teacher/_search?size=0
{
  "aggs": {
    "group_by_field": {
      "terms": {
        "field": "age",
        "order": {"average_value": "desc"}
      },
      "aggs": {
        "average_value": {
          "avg": {
            "field": "age"
          }
        }
      }
    }
  }
}

# 返回值
{
  "took" : 4,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 6,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  },
  "aggregations" : {
    "group_by_field" : {
      "doc_count_error_upper_bound" : 0,
      "sum_other_doc_count" : 0,
      "buckets" : [
        {
          "key" : 50,
          "doc_count" : 1,
          "average_value" : {
            "value" : 50.0
          }
        },
        {
          "key" : 40,
          "doc_count" : 1,
          "average_value" : {
            "value" : 40.0
          }
        },
        {
          "key" : 30,
          "doc_count" : 3,
          "average_value" : {
            "value" : 30.0
          }
        },
        {
          "key" : 20,
          "doc_count" : 1,
          "average_value" : {
            "value" : 20.0
          }
        }
      ]
    }
  }
}
相关推荐
唐可盐2 小时前
如何进行mysql慢查询日志设置以及日志管理与分析
运维·数据库·mysql
梁辰兴2 小时前
汽车租赁系统数据库 E-R 图设计
数据库·ea·汽车租赁系统·er图设计·er
白云如幻3 小时前
MySQL数据表的管理
数据库·mysql
上辈子杀猪这辈子学IT4 小时前
【Hive数据仓库】Hive部署、Hive数据库操作(增删改查)、表操作(内部表、外部表、分区表、桶表)
大数据·数据库·数据仓库·hive·hadoop
web安全工具库4 小时前
探索 Python 中的字典:序列解包与复杂数据结构
前端·数据库·python
万能的编程语言4 小时前
ORACLE逗号分隔的字符串字段,关联表查询
数据库·oracle
33三 三like4 小时前
数据库基础知识点(选填/问答题)
数据库·oracle
jjw_zyfx4 小时前
Django结合websocket实现分组的多人聊天
数据库·websocket·django
不会编程的猫星人4 小时前
oracle控制文件发生变化的情况
数据库·oracle