2020年7月23日 21:17:31go评论166阅读模式

英文:

How to match exact Search in Solr with space

问题

如何在两个单词之间有空格时搜索单词的一部分。我的查询如下：

/select?q=*:*&amp;fq=pageType:program&amp;fl=programLocation&amp;rows=100&amp;fq=programLocation:&quot;Mohali&quot;

我得到的结果如下：

&quot;response&quot;:{&quot;numFound&quot;:3,&quot;start&quot;:0,&quot;docs&quot;:[
      {
        &quot;programLocation&quot;:[&quot;Mohali&quot;]},
      {
        &quot;programLocation&quot;:[&quot;Mohali&quot;]},
      {
        &quot;programLocation&quot;:[&quot;Mohali and Hyderabad&quot;]}]

我想要仅检索 "Mohali"，但现在我得到了 "Mohali" 和 "Mohali and Hyderabad"。如何构建查询以仅获取 Mohali？

英文:

How to search a part of the word when space is present between two words. My query this like below

/select?q=*:*&amp;fq=pageType:program&amp;fl=programLocation&amp;rows=100&amp;fq=programLocation:&quot;Mohali&quot;

The result I am getting is as below

&quot;response&quot;:{&quot;numFound&quot;:3,&quot;start&quot;:0,&quot;docs&quot;:[
      {
        &quot;programLocation&quot;:[&quot;Mohali&quot;]},
      {
        &quot;programLocation&quot;:[&quot;Mohali&quot;]},
      {
        &quot;programLocation&quot;:[&quot;Mohali and Hyderabad&quot;]}]

I want to retrieve only "Mohali", but now I am getting both "Mohali" and "Mohali and Hyderabad". How to form a query to only fetch Mohali?

答案1

得分: 1

你需要将string作为fieldtype应用于你的字段programLocation。

将字符串作为fieldtype应用并重新索引数据。

<field name="programLocation" type="string" indexed="true" stored="true" required="true" multiValued="false" />

String将单词/句子存储为精确字符串，不执行任何标记化操作。
它在存储精确匹配时非常有用，例如用于分面、排序。

与string类型相反的是text类型。
Text对数据进行标记化处理，执行诸如转小写等处理。在需要匹配句子的一部分时很有用。

如果您想要实现小写搜索，那么请为您的字段使用以下fieldtype。

<fieldType name="forExactMatch" class="solr.TextField" sortMissingLast="true" omitNorms="true">
  <analyzer>
    <!-- KeywordTokenizer不会实际进行标记化，因此整个
         输入字符串将保留为单个标记
      -->
    <tokenizer class="solr.KeywordTokenizerFactory"/>
    <!-- LowerCase TokenFilter按预期执行操作，可用于
         在排序时进行大小写不敏感的匹配
      -->
    <filter class="solr.LowerCaseFilterFactory" />
  </analyzer>
</fieldType>

如果存在空格，您还可以在小写过滤工厂之后使用以下过滤器。

<!-- TrimFilter删除任何前导或尾随空格 -->
<filter class="solr.TrimFilterFactory" />

然后，您的字段定义将如下所示。

<field name="programLocation" type="forExactMatch" indexed="true" stored="true"/>

英文:

You need to use the string as fieldtype for your field programLocation.

Apply the String as fieldtype and reindex the data.

&lt;field name=&quot;programLocation&quot; type=&quot;string&quot; indexed=&quot;true&quot; stored=&quot;true&quot; required=&quot;true&quot; multiValued=&quot;false&quot; /&gt;

String stores the word/sentence as an exact string without performing any tokenization on it.
Its useful in cases for storing exact matches, e.g, for faceting, sorting.

Opposite of string type is text.
Text does the tokenization of data, performs processing such as lower-casing etc. It is helpful in case when we want to match part of a sentence.

If you want achieve the lowercase search then use the below fieldtype for you field.

&lt;fieldType name=&quot;forExactMatch&quot; class=&quot;solr.TextField&quot; sortMissingLast=&quot;true&quot; omitNorms=&quot;true&quot;&gt;
      &lt;analyzer&gt;
        &lt;!-- KeywordTokenizer does no actual tokenizing, so the entire
             input string is preserved as a single token
          --&gt;
        &lt;tokenizer class=&quot;solr.KeywordTokenizerFactory&quot;/&gt;
        &lt;!-- The LowerCase TokenFilter does what you expect, which can be
             when you want your sorting to be case insensitive
          --&gt;
        &lt;filter class=&quot;solr.LowerCaseFilterFactory&quot; /&gt;
      &lt;/analyzer&gt;
    &lt;/fieldType&gt;

If you have spaces the you can also use the below filter after the lowercase filter factory.

&lt;filter class=&quot;solr.TrimFilterFactory&quot; /&gt;

Then your field defination will look like below

&lt;field name=&quot;programLocation&quot; type=&quot;forExactMatch&quot; indexed=&quot;true&quot; stored=&quot;true&quot;/&gt;

答案2

得分: 0

用于在Java中进行与Lucene搜索的精确匹配
`?q=&quot;Your search&quot;~0`
您可以更改整数值以定义单词之间的最大距离
`?q=&quot;Your search&quot;~2`
示例：&quot;这是一个精确匹配的很棒的示例&quot;
- `?q=&quot;awesome exemple&quot;~0` 将匹配
- `?q=&quot;awesome exact match&quot;~0` 不会匹配
- `?q=&quot;awesome exact match&quot;~2` 将匹配

英文:

For an exact match with lucene search in java

?q="Your search"~0

Your can play with the integer to define the max distance between words

?q="Your search"~2

Exemple : "This is an awesome exemple of exact match"

?q="awesome exemple"~0 will match
?q="awesome exact match"~0 won't match
?q="awesome exact match"~2 will match

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在Solr中进行带空格的精确搜索匹配

问题

答案1

答案2

如何在Java中使用静态导入`java.util.Arrays.toString`？

无法从指标查询语言 MQL – GCP 收集数据。

Karaf、cxf 和 jax-rs – 未找到任何服务

获取包含用户所有标签的hmap列表

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。