问题

我想为波斯语文本创建一个“index”，并为其创建词干处理器。以下是如何将“PersianStemmer” Python库实现到Elasticsearch的“analyzer”中的示例：

PUT my_index
{
    "settings": {
        "analysis": {
            "filter": {
                "persian_stemmer": {
                    "type": "stemmer",
                    "name": "persian"
                }
            },
            "analyzer": {
                "persian_analyzer": {
                    "type": "custom",
                    "tokenizer": "standard",
                    "filter": ["lowercase", "persian_stemmer"]
                }
            }
        }
    },
    "mappings": {
        "properties": {
            "description": {
                "type": "text",
                "analyzer": "persian_analyzer"
            }
        }
    }
}

此示例将创建一个名为“persian_analyzer”的自定义分析器，该分析器使用标准分词器，然后应用小写转换和波斯文词干处理器。描述字段使用此分析器进行分析。请确保您已经安装了“PersianStemmer” Python库，并且已将其集成到您的Elasticsearch环境中。

英文:

I want to create an index for persian-language text and I want to create stemmer for that, this is english-stemming for description field

PUT my_index
{
    &quot;mappings&quot;: {
      &quot;properties&quot;: {
        &quot;description&quot;: {
          &quot;type&quot;: &quot;text&quot;,
          &quot;analyzer&quot;: &quot;english&quot;
        }
      }
    }, 
    &quot;settings&quot;: {
      &quot;analysis&quot;:{
        &quot;filter&quot;: {
          &quot;english_stemmer&quot;: {
            &quot;type&quot;:       &quot;stemmer&quot;,
            &quot;language&quot;:   &quot;english&quot;
          }
        }
      }
    }
}

Now I want to know how can implement the PersianStemmer python library to elasticsearch analyzer?

答案1

得分: 1

你需要为此创建自定义分析器：

PUT my_index
{
   "settings": {
      "analysis": {
         "filter": {
            "persian_stemmer": {
               "type": "stemmer",
               "language": "persian"
            }
         },
         "analyzer": {
            "persian_analyzer": {
               "tokenizer": "standard",
               "filter": [
                  "lowercase",
                  "persian_stemmer"
               ]
            }
         }
      }
   },
   "mappings": {
      "properties": {
         "description": {
            "type": "text",
            "analyzer": "persian_analyzer"
         }
      }
   }
}

英文:

You need to create custom analyzer for that:

    PUT my_index
{
  &quot;settings&quot;: {
    &quot;analysis&quot;: {
      &quot;filter&quot;: {
        &quot;persian_stemmer&quot;: {
          &quot;type&quot;: &quot;stemmer&quot;,
          &quot;language&quot;: &quot;persian&quot;
        }
      },
      &quot;analyzer&quot;: {
        &quot;persian_analyzer&quot;: {
          &quot;tokenizer&quot;: &quot;standard&quot;,
          &quot;filter&quot;: [
            &quot;lowercase&quot;,
            &quot;persian_stemmer&quot;
          ]
        }
      }
    }
  },
  &quot;mappings&quot;: {
    &quot;properties&quot;: {
      &quot;description&quot;: {
        &quot;type&quot;: &quot;text&quot;,
        &quot;analyzer&quot;: &quot;persian_analyzer&quot;
      }
    }
  }
}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用Python库来自定义Elasticsearch中的过滤器分析器。

问题

答案1

覆盖 Numpy 数组内存（In-Place）

Pyautogui无法输入表情符号。

在Python中为多个父类调用Super()init

为什么 pandas 的 `date_range` 会向上取整到下个月？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论