ElasticSearch中實(shí)現(xiàn)模糊查詢效果(類似數(shù)據(jù)庫(kù)中l(wèi)ike功能)
場(chǎng)景:
業(yè)務(wù)要求提供一個(gè)es查詢功能,實(shí)現(xiàn)類似模糊查詢效果,并且命中字段顯示紅色。舉例說(shuō)明:
es中字段內(nèi)容 | 輸入(即關(guān)鍵字) | 是否輸出 |
---|---|---|
你好,中國(guó),強(qiáng)大的祖國(guó) | 中國(guó) | 是(則‘中國(guó)’兩個(gè)字飄紅) |
你好,中國(guó),強(qiáng)大的祖國(guó) | 俄國(guó) | 否 |
你好,中國(guó),強(qiáng)大的祖國(guó) | 最大 | 否 |
實(shí)現(xiàn)方式:
這種實(shí)現(xiàn)方式主要是用es的query_string查詢方式,不過(guò)需要對(duì)輸入條件做區(qū)分處理才能實(shí)現(xiàn)模糊查詢效果。
首先,先復(fù)習(xí)一下query_string查詢方式的特點(diǎn):
{
"query": {
"query_string": {
"query": "中國(guó)"
}
},
"size": 10,
"from": 0,
"sort": []
}
pom依賴:
我用的elasticsearch版本是6.8.1,springboot版本是2.3.5.RELEASE,因此pom依賴的版本不對(duì),需要先排除再引入正確的包。
<dependency>
<groupId>org.springframework.boot</groupId>
<artifactId>spring-boot-starter-data-elasticsearch</artifactId>
<exclusions>
<exclusion>
<artifactId>spring-data-elasticsearch</artifactId>
<groupId>org.springframework.data</groupId>
</exclusion>
</exclusions>
</dependency>
<dependency>
<groupId>org.springframework.data</groupId>
<artifactId>spring-data-elasticsearch</artifactId>
<version>3.2.6.RELEASE</version>
</dependency>
代碼:
import org.apache.commons.lang3.StringUtils;
import org.elasticsearch.action.search.SearchRequest;
import org.elasticsearch.action.search.SearchResponse;
import org.elasticsearch.client.RequestOptions;
import org.elasticsearch.client.RestHighLevelClient;
import org.elasticsearch.client.core.CountRequest;
import org.elasticsearch.client.core.CountResponse;
import org.elasticsearch.index.query.BoolQueryBuilder;
import org.elasticsearch.index.query.QueryBuilder;
import org.elasticsearch.index.query.QueryBuilders;
import org.elasticsearch.search.SearchHit;
import org.elasticsearch.search.builder.SearchSourceBuilder;
import org.elasticsearch.search.fetch.subphase.highlight.HighlightBuilder;
import org.elasticsearch.search.fetch.subphase.highlight.HighlightField;
import org.springframework.stereotype.Service;
import javax.annotation.Resource;
import java.io.IOException;
import java.util.ArrayList;
import java.util.Arrays;
import java.util.LinkedList;
import java.util.List;
import java.util.Map;
/**
* 彈性搜索核心服務(wù)
*
* @author lukou
* @date 2023/05/15
*/
@Service
public class ElasticSearchCoreService {
/**
* 常量和
*/
private static final String CONSTANT_AND = " AND ";
/**
* 常量不
*/
private static final String CONSTANT_NOT = " NOT ";
/**
* 常量或
*/
private static final String CONSTANT_OR = " OR ";
@Resource
private RestHighLevelClient restHighLevelClient;
/**
* 通過(guò)query_string方法查詢統(tǒng)計(jì)
*
* @param index 指數(shù)
* @param keyword 關(guān)鍵字
* @return {@link CountResponse}
* @throws IOException ioexception
*/
public CountResponse queryCount(String index, String keyword) throws IOException {
SearchSourceBuilder searchSourceBuilder = new SearchSourceBuilder();
searchSourceBuilder.query(this.queryString(keyword));
CountRequest request = new CountRequest(index);
request.source(searchSourceBuilder);
return restHighLevelClient.count(request, RequestOptions.DEFAULT);
}
/**
* 通過(guò)query_string方法查詢搜索
*
* @param index 指數(shù)
* @param keyword 關(guān)鍵字
* @param from 起始位置
* @param size 大小
* @return {@link SearchResponse}
* @throws IOException ioexception
*/
public SearchResponse querySearch(String index, String keyword, int from, int size) throws IOException {
SearchSourceBuilder searchSourceBuilder = new SearchSourceBuilder();
searchSourceBuilder.query(this.queryString(keyword)).highlighter(redHighlightBuilder())
.from(from)
.size(size);
SearchRequest request = new SearchRequest(index);
// preference解決分頁(yè)數(shù)據(jù)不準(zhǔn)確的問(wèn)題(和分片有關(guān)系)
request.source(searchSourceBuilder).preference(String.valueOf(keyword.hashCode()));
return restHighLevelClient.search(request, RequestOptions.DEFAULT);
}
/**
* 通過(guò)query_string方法查詢字符串
* 按照邏輯表達(dá)式切割(AND OR NOT)
*
* <pre>
* "123 AND abc"
* "123 OR abc"
* "123 NOT abc"
* "NOT 123 NOT abc"
* </pre>
*
* @param key 關(guān)鍵
* @return {@link QueryBuilder}
*/
public QueryBuilder queryString(String key) {
//Bool查找
BoolQueryBuilder boolQueryBuilder = QueryBuilders.boolQuery();
if (StringUtils.contains(key, CONSTANT_AND)) {
// 包含AND
String[] keys = key.split(CONSTANT_AND);
for (String s : keys) {
boolQueryBuilder.must(QueryBuilders.queryStringQuery(convert(s)));
}
return boolQueryBuilder;
} else if (StringUtils.contains(key, CONSTANT_OR)) {
String[] keys = key.split(CONSTANT_OR);
for (String s : keys) {
boolQueryBuilder.should(QueryBuilders.queryStringQuery(convert(s)));
}
return boolQueryBuilder;
} else if (StringUtils.contains(key, CONSTANT_NOT)) {
List<String> keys = new ArrayList<>(Arrays.asList(key.split(CONSTANT_NOT)));
if (keys.get(0).startsWith("NOT ")) {
keys.set(0, keys.get(0).replaceAll("Not ", ""));
for (String s : keys) {
boolQueryBuilder.mustNot(QueryBuilders.queryStringQuery(convert(s)));
}
return boolQueryBuilder;
}
String keyword = keys.remove(0);
boolQueryBuilder.must(QueryBuilders.queryStringQuery(convert(keyword)));
for (String s : keys) {
boolQueryBuilder.mustNot(QueryBuilders.queryStringQuery(convert(s)));
}
return boolQueryBuilder;
}
boolQueryBuilder.must(QueryBuilders.queryStringQuery(convert(key)));
return boolQueryBuilder;
}
/**
* 轉(zhuǎn)換
* 判斷是不是字母、數(shù)字、漢字
*
* @param key 關(guān)鍵
* @return {@link String}
*/
public String convert(String key) {
//在執(zhí)行查詢時(shí),搜索的詞不會(huì)被分詞器分詞,而是直接以一個(gè)短語(yǔ)的形式查詢
String res = "\"" + key + "\"";
if (key.matches("^[A-Za-z0-9]*$")) {
res = "*" + key + "*";
}
if (key.matches("^[\u4e00-\u9fa5][A-Za-z0-9]*$")) {
res = key + "*";
}
return res;
}
/**
* 紅色突出顯示生成器
*
* @return {@link HighlightBuilder}
*/
public HighlightBuilder redHighlightBuilder() {
HighlightBuilder highlightBuilder = new HighlightBuilder();
//高亮的字段
highlightBuilder.field("*");
//是否多個(gè)字段都高亮
highlightBuilder.requireFieldMatch(true);
//前綴后綴
highlightBuilder.preTags("<span style='color:red'>");
highlightBuilder.postTags("</span>");
return highlightBuilder;
}
/**
* 構(gòu)建突出標(biāo)記
*
* @param hits 支安打
* @return {@link List}<{@link Map}<{@link String}, {@link Object}>>
*/
public List<Map<String, Object>> buildHighlightTags(SearchHit[] hits) {
//解析結(jié)果
List<Map<String, Object>> result = new LinkedList<>();
for (SearchHit hit : hits) {
//解析高亮的字段
//獲取高亮字段
Map<String, HighlightField> highlightFields = hit.getHighlightFields();
Map<String, Object> sourceAsMap = hit.getSourceAsMap();
for (String s : highlightFields.keySet()) {
if (s != null) {
sourceAsMap.put(s, highlightFields.get(s).getFragments()[0].toString());//替換掉原來(lái)的內(nèi)容
}
}
result.add(sourceAsMap);
}
return result;
}
}
測(cè)試:
import org.elasticsearch.action.search.SearchResponse;
import org.springframework.web.bind.annotation.GetMapping;
import org.springframework.web.bind.annotation.RestController;
import javax.annotation.Resource;
import java.io.IOException;
import java.util.HashMap;
import java.util.List;
import java.util.Map;
@RestController
public class TestController {
@Resource
private ElasticSearchCoreService elasticSearchCoreService;
@GetMapping("/test")
public Map<String, Object> test(String index, String keyword) throws IOException {
SearchResponse searchResponse = elasticSearchCoreService.querySearch(index, keyword, 0, 10);
long total = searchResponse.getHits().getTotalHits();
List<Map<String, Object>> mapList = elasticSearchCoreService.buildHighlightTags(searchResponse.getHits().getHits());
Map<String, Object> result = new HashMap<>();
result.put("total", total);
result.put("data", mapList);
return result;
}
}
造數(shù)據(jù):
新建索引tmp_1以及插入5條數(shù)據(jù)文章來(lái)源:http://www.zghlxwxcb.cn/news/detail-532717.html
PUT http://localhost:9200/tmp_1
{
"settings": {
"number_of_shards": 1,
"number_of_replicas": 0
},
"mappings": {
"_doc": {
"properties": {
"@timestamp": {
"type": "date"
},
"@version": {
"type": "text",
"fields": {
"keyword": {
"type": "keyword",
"ignore_above": 256
}
}
},
"name": {
"type": "keyword"
},
"sfz": {
"type": "text"
},
"content": {
"type": "text"
},
"address": {
"type": "text"
}
}
}
}
}
POST http://localhost:9200/tmp_1/_doc
{
"@version": "1",
"@timestamp": "2020-06-19T09:06:26.446Z",
"name": "唐伯虎",
"sfz": "3212111990018989",
"content": "當(dāng)你無(wú)話可說(shuō)的時(shí)候就別說(shuō)話,在你不知如何回答別人的話的時(shí)候就保持沉默,這就是生活中一個(gè)很好的策略",
"address": "江蘇省南京市"
}
POST http://localhost:9200/tmp_1/_doc
{
"@version": "1",
"@timestamp": "2020-06-19T09:06:26.446Z",
"name": "唐伯龍",
"sfz": "3212111990018989",
"content": "凡事都有偶然的湊巧,結(jié)果卻又如宿命的必然",
"address": "江蘇省無(wú)錫市"
}
POST http://localhost:9200/tmp_1/_doc
{
"@version": "1",
"@timestamp": "2020-06-19T09:06:26.446Z",
"name": "唐小虎",
"sfz": "321211199709227654",
"content": "一個(gè)人如果刻意逃避他所懼怕的東西,到頭來(lái)會(huì)發(fā)現(xiàn)自己只是抄了近路去見(jiàn)它",
"address": "江蘇省蘇州市"
}
POST http://localhost:9200/tmp_1/_doc
{
"@version": "1",
"@timestamp": "2020-06-19T09:06:26.446Z",
"name": "李小龍",
"sfz": "1234211186709222348",
"content": "蟲(chóng)子被踩后縮起來(lái),這是明智的,它借此減少重新被踩的概率。用道德的語(yǔ)言就叫:謙恭",
"address": "江蘇省常州市"
}
POST http://localhost:9200/tmp_1/_doc
{
"@version": "1",
"@timestamp": "2020-06-19T09:06:26.446Z",
"name": "李四",
"sfz": "436754187709087623",
"content": "你好,1234,你好5678",
"address": "上海市"
}
調(diào)用接口:
http://localhost:8081/test?index=tmp_1&keyword=你好
# 響應(yīng)
{
"total": 1,
"data": [
{
"@timestamp": "2020-06-19T09:06:26.446Z",
"address": "上海市",
"sfz": "436754187709087623",
"@version": "1",
"name": "李四",
"content": "<span style='color:red'>你</span><span style='color:red'>好</span>,1234,<span style='color:red'>你</span><span style='color:red'>好</span>5678"
}
]
}
http://localhost:8081/test?index=tmp_1&keyword=唐小虎
# 響應(yīng)
{
"total": 1,
"data": [
{
"@timestamp": "2020-06-19T09:06:26.446Z",
"address": "江蘇省蘇州市",
"sfz": "321211199709227654",
"@version": "1",
"name": "<span style='color:red'>唐小虎</span>",
"content": "一個(gè)人如果刻意逃避他所懼怕的東西,到頭來(lái)會(huì)發(fā)現(xiàn)自己只是抄了近路去見(jiàn)它"
}
]
}
測(cè)試場(chǎng)景沒(méi)有全面覆蓋,如有錯(cuò)誤,歡迎指正。文章來(lái)源地址http://www.zghlxwxcb.cn/news/detail-532717.html
到了這里,關(guān)于ElasticSearch中實(shí)現(xiàn)模糊查詢效果(類似數(shù)據(jù)庫(kù)中l(wèi)ike功能)的文章就介紹完了。如果您還想了解更多內(nèi)容,請(qǐng)?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章,希望大家以后多多支持TOY模板網(wǎng)!