2023年4月11日 07:21:29go评论56阅读模式

英文:

Substring match in elasticsearch with conditions

问题

以下是您要翻译的部分：

我正在尝试执行一个Elasticsearch查询，我应该得到所有餐馆的子串中包含“pizz”的餐馆名称，但不包含“pizza”和“pizzeria”。

我为此目的编写的查询如下：

GET my_index/_search
{
  "query": {
    "bool": {
      "must": [
        {
          "wildcard": {
            "RestaurantName": {
              "value": "*pizz*"
            }
          }
        }
      ],
      "must_not": [
        {
          "match": {
            "RestaurantName": "pizza"
          }
        },
        {
          "match": {
            "RestaurantName": "pizzeria"
          }
        }
      ]
    }
  }
}

这个查询匹配了类似“Instapizza”的字段，这是错误的。它应该匹配任何组合或大写的情况，如“Fozzie's Pizzaiolo”、“PizzaVito”、“Pizzalicious”。我如何修改查询以避免匹配不需要的字段？对此有任何帮助将非常棒。

英文:

I am trying to perform an Elasticsearch query, where I am supposed to get all restaurants which contain the substring 'pizz' in the restaurant name but do not contain neither 'pizza' nor 'pizzeria'.

The query I wrote for this purpose is this:

GET my_index/_search
{
  &quot;query&quot;: {
    &quot;bool&quot;: {
      &quot;must&quot;: [
        {
          &quot;wildcard&quot;: {
            &quot;RestaurantName&quot;: {
              &quot;value&quot;: &quot;*pizz*&quot;
            }
          }
        }
      ],
      &quot;must_not&quot;: [
        {
          &quot;match&quot;: {
            &quot;RestaurantName&quot;: &quot;pizza&quot;
          }
        },
        {
          &quot;match&quot;: {
            &quot;RestaurantName&quot;: &quot;pizzeria&quot;
          }
        }
      ]
    }
  }
}

This query matches fields like Instapizza which is wrong. It should match anything combined or uppercase cases like: Fozzie's Pizzaiolo, PizzaVito, Pizzalicious. How can I fix the query to lose the match for unwanted fields? Any help with this would be really great.

答案1

得分: 2

以下是翻译好的内容：

当您将'RestaurantName'索引为文本字段时，“标准”分析器包括小写过滤器，“小写”标记过滤器使字段不区分大小写，这意味着Lucene中的所有标记都是小写。

首先，您应该为RestaurantName字段添加额外的关键字类型。

{
"mappings": {
"properties": {
"RestaurantName": {
"type": "text",
"fields": {
"keyword": {
"type": "keyword",
"ignore_above": 256
}
}
}
}
}
}

使用通配符进行搜索，

{
"query": {
"bool": {
"must": [
{
"wildcard": {
"RestaurantName.keyword": {
"value": "Pizz"
}
}
}
],
"must_not": [
{
"match": {
"RestaurantName": "pizza"
}
},
{
"match": {
"RestaurantName": "pizzeria"
}
}
]
}
}
}

结果是，

{
"took": 8,
"timed_out": false,
"_shards": {
"total": 1,
"successful": 1,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 3,
"relation": "eq"
},
"max_score": 1.0,
"hits": [
{
"_index": "pizza",
"_type": "_doc",
"_id": "1L6ob4cB6Rdc8HbDY8vi",
"_score": 1.0,
"_source": {
"RestaurantName": "Fozzie's Pizzaiolo"
}
},
{
"_index": "pizza",
"_type": "_doc",
"_id": "1b6ob4cB6Rdc8HbDg8tA",
"_score": 1.0,
"_source": {
"RestaurantName": "PizzaVito"
}
},
{
"_index": "pizza",
"_type": "_doc",
"_id": "1r6ob4cB6Rdc8HbDmMuJ",
"_score": 1.0,
"_source": {
"RestaurantName": "Pizzalicious"
}
}
}
}
}

英文:

When you index 'RestaurantName' as a text field, the "Standard" analyzer includes the lowercase filter, "lowercase" token filter makes fields case-insensitive, which means all tokens in lucene are lowercase.

first, you should add an extra keyword type to RestaurantName field.

{
    &quot;mappings&quot;: {
        &quot;properties&quot;: {
            &quot;RestaurantName&quot;: {
                &quot;type&quot;: &quot;text&quot;,
                &quot;fields&quot;: {
                    &quot;keyword&quot;: {
                        &quot;type&quot;: &quot;keyword&quot;,
                        &quot;ignore_above&quot;: 256
                    }
                }
            }
        }
    }
}

search with wildcard,

{
    &quot;query&quot;: {
        &quot;bool&quot;: {
            &quot;must&quot;: [
                {
                    &quot;wildcard&quot;: {
                        &quot;RestaurantName.keyword&quot;: {
                            &quot;value&quot;: &quot;*Pizz*&quot;
                        }
                    }
                }
            ],
            &quot;must_not&quot;: [
                {
                    &quot;match&quot;: {
                        &quot;RestaurantName&quot;: &quot;pizza&quot;
                    }
                },
                {
                    &quot;match&quot;: {
                        &quot;RestaurantName&quot;: &quot;pizzeria&quot;
                    }
                }
            ]
        }
    }
}

the result is,

{
    &quot;took&quot;: 8,
    &quot;timed_out&quot;: false,
    &quot;_shards&quot;: {
        &quot;total&quot;: 1,
        &quot;successful&quot;: 1,
        &quot;skipped&quot;: 0,
        &quot;failed&quot;: 0
    },
    &quot;hits&quot;: {
        &quot;total&quot;: {
            &quot;value&quot;: 3,
            &quot;relation&quot;: &quot;eq&quot;
        },
        &quot;max_score&quot;: 1.0,
        &quot;hits&quot;: [
            {
                &quot;_index&quot;: &quot;pizza&quot;,
                &quot;_type&quot;: &quot;_doc&quot;,
                &quot;_id&quot;: &quot;1L6ob4cB6Rdc8HbDY8vi&quot;,
                &quot;_score&quot;: 1.0,
                &quot;_source&quot;: {
                    &quot;RestaurantName&quot;: &quot;Fozzie&#39;s Pizzaiolo&quot;
                }
            },
            {
                &quot;_index&quot;: &quot;pizza&quot;,
                &quot;_type&quot;: &quot;_doc&quot;,
                &quot;_id&quot;: &quot;1b6ob4cB6Rdc8HbDg8tA&quot;,
                &quot;_score&quot;: 1.0,
                &quot;_source&quot;: {
                    &quot;RestaurantName&quot;: &quot;PizzaVito&quot;
                }
            },
            {
                &quot;_index&quot;: &quot;pizza&quot;,
                &quot;_type&quot;: &quot;_doc&quot;,
                &quot;_id&quot;: &quot;1r6ob4cB6Rdc8HbDmMuJ&quot;,
                &quot;_score&quot;: 1.0,
                &quot;_source&quot;: {
                    &quot;RestaurantName&quot;: &quot;Pizzalicious&quot;
                }
            }
        ]
    }
}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在Elasticsearch中的子字符串匹配与条件。

问题

答案1

春季数据Elasticsearch不会从注释中隐式创建映射。

ElasticSearch: check how analyzers/tokenizers/filters applied to an index split text into tokens?

弹性搜索查询的Java客户端序列化

ElasticSearch通过重定向的URL进行REST HTTP请求

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论