2023年6月25日 20:11:27go评论75阅读模式

英文:

Counting by ranges in Double elasticSearch Spring boot using aggregation

问题

I'm trying to count records from elastic search which are in specific range.
我想要计算在 Elasticsearch 中特定范围内的记录。

I have 3 ranges which represent different values in double.
我有3个范围，代表不同的双精度值。

low (0-4]
低（0-4]

medium (4-7]
中等（4-7]

high (7-10]
高（7-10]

and the object is something like
对象类似于

{

"company":"companyName", // string
"score": 6.2 // double

}

and let's say for company1 I would like to get all the counts for score values.
假设对于公司1，我想获取所有分数值的计数。

to get back an object like the following
为了获得以下类似的对象

{"high":20, "medium":10, "low":3}

I have found a way to do it using the API,
我已经找到了一种使用API来实现的方法，

public interface ItemRepository extends ElasticsearchRepository {

@Query("{\"bool\":{\"must\":[{\"match\":{\"userId\":100}},{\"range\":{\"score\":{\"gte\":?0,\"lt\":?1}}}]}}")
long countByScoreRangeAndUserId(int lowerBound, int upperBound);

}

@Service
public class ItemService {

@Autowired
private ItemRepository itemRepository;

public void getScoreCountRanges() {
    long range1Count = itemRepository.countByScoreRangeAndUserId(0, 40);
    long range2Count = itemRepository.countByScoreRangeAndUserId(40, 70);
    long range3Count = itemRepository.countByScoreRangeAndUserId(70, 101); // inclusive lower bound, exclusive upper bound
    System.out.println("Range 0-39 Count: " + range1Count);
    System.out.println("Range 40-69 Count: " + range2Count);
    System.out.println("Range 70-100 Count: " + range3Count);
}

}

but I would like to do it in a single swipe over the database and not to count it in 3 times.
但我想在数据库上进行一次单独的操作，而不是三次计数。

and which way is better and faster?
哪种方式更好、更快？

Thanks a lot!
非常感谢！

英文:

I'm trying to count records from elastic search which are in specific range.
I have 3 ranges which represent different values in double.

low (0-4]
medium (4-7]
high (7-10]

and the object is something like

{

&quot;company&quot;:&quot;companyName&quot;, // string

&quot;score&quot;: 6.2 // double

}

and lets say for company1 i would like to get all the counts for score values.

to get back an object like the following
{"high":20, "medium":10, "low":3}

I have found a way to do it using the API ,

public interface ItemRepository extends ElasticsearchRepository&lt;Item, String&gt; {

    @Query(&quot;{\&quot;bool\&quot;:{\&quot;must\&quot;:[{\&quot;match\&quot;:{\&quot;userId\&quot;:100}},{\&quot;range\&quot;:{\&quot;score\&quot;:{\&quot;gte\&quot;:?0,\&quot;lt\&quot;:?1}}}]}}&quot;)
    long countByScoreRangeAndUserId(int lowerBound, int upperBound);
}


@Service
public class ItemService {

    @Autowired
    private ItemRepository itemRepository;

    public void getScoreCountRanges() {
        long range1Count = itemRepository.countByScoreRangeAndUserId(0, 40);
        long range2Count = itemRepository.countByScoreRangeAndUserId(40, 70);
        long range3Count = itemRepository.countByScoreRangeAndUserId(70, 101); // inclusive lower bound, exclusive upper bound
        System.out.println(&quot;Range 0-39 Count: &quot; + range1Count);
        System.out.println(&quot;Range 40-69 Count: &quot; + range2Count);
        System.out.println(&quot;Range 70-100 Count: &quot; + range3Count);
    }
}

but i would like to do it in a singe swipe over the database and not to count it in 3 times.
and which way is better and faster ?

thanks a lot

答案1

得分: 0

尝试使用NativeSearchQueryBuilder

@Service
public class ItemService {

    @Autowired
    private ElasticsearchOperations elasticsearchOperations;

    public void getScoreCountRanges() {
        SearchQuery searchQuery = new NativeSearchQueryBuilder()
                .withQuery(QueryBuilders.matchQuery("userId", 100))
                .addAggregation(AggregationBuilders.range("score_ranges")
                        .field("score")
                        .addUnboundedTo("low", 4)
                        .addRange("medium", 4, 7)
                        .addUnboundedFrom("high", 7)
                )
                .build();

        Aggregations aggregations = elasticsearchOperations.query(searchQuery, SearchResponse::getAggregations);
        Range rangeAggregation = aggregations.get("score_ranges");

        long lowCount = rangeAggregation.getBucketByKey("low").getDocCount();
        long mediumCount = rangeAggregation.getBucketByKey("medium").getDocCount();
        long highCount = rangeAggregation.getBucketByKey("high").getDocCount();

        System.out.println("Low Range Count: " + lowCount);
        System.out.println("Medium Range Count: " + mediumCount);
        System.out.println("High Range Count: " + highCount);
    }
}

英文:

Try using the NativeSearchQueryBuilder

@Service
public class ItemService {

@Autowired
private ElasticsearchOperations elasticsearchOperations;

public void getScoreCountRanges() {
    SearchQuery searchQuery = new NativeSearchQueryBuilder()
            .withQuery(QueryBuilders.matchQuery(&quot;userId&quot;, 100))
            .addAggregation(AggregationBuilders.range(&quot;score_ranges&quot;)
                    .field(&quot;score&quot;)
                    .addUnboundedTo(&quot;low&quot;, 4)
                    .addRange(&quot;medium&quot;, 4, 7)
                    .addUnboundedFrom(&quot;high&quot;, 7)
            )
            .build();

    Aggregations aggregations = elasticsearchOperations.query(searchQuery, SearchResponse::getAggregations);
    Range rangeAggregation = aggregations.get(&quot;score_ranges&quot;);

    long lowCount = rangeAggregation.getBucketByKey(&quot;low&quot;).getDocCount();
    long mediumCount = rangeAggregation.getBucketByKey(&quot;medium&quot;).getDocCount();
    long highCount = rangeAggregation.getBucketByKey(&quot;high&quot;).getDocCount();

    System.out.println(&quot;Low Range Count: &quot; + lowCount);
    System.out.println(&quot;Medium Range Count: &quot; + mediumCount);
    System.out.println(&quot;High Range Count: &quot; + highCount);
  }
}

答案2

得分: 0

我将用工作代码回答，如果将来有人可能会觉得它有用。

    @Override
    public Scores getScoreCountRanges() {
        Scores scores = new Scores();
        String aggregationName = "score_ranges";
        NativeSearchQuery searchQuery = new NativeSearchQueryBuilder()
                .withQuery(QueryBuilders.matchQuery("userId", "DESIRED-USER-ID"))
                .withAggregations(AggregationBuilders.range(aggregationName)
                        .field("scores")
                        .addRange(LOW, LOW_LOWER_BOUND, MEDIUM_LOWER_BOUND)
                        .addRange(MEDIUM, MEDIUM_LOWER_BOUND, HIGH_LOWER_BOUND)
                        .addRange(HIGH, HIGH_LOWER_BOUND, HIGH_UPPER_BOUND)
                )
                .build();

        SearchHits<?> searchHits = operations.search(searchQuery, ClassOfData.class);
        if (!searchHits.hasAggregations())
            return scores;
        AggregationsContainer<?> aggregationsContainer = searchHits.getAggregations();
        if (aggregationsContainer == null) {
            return scores;
        }
        Aggregations aggregations = (Aggregations) aggregationsContainer.aggregations();
        ParsedRange rangeAggregation = aggregations.get(aggregationName);
        rangeAggregation.getBuckets().forEach(bucket -> fillScores(scores, bucket.getKey().toString(), bucket.getDocCount()));
        return scores;
    }

此代码将执行所谓的范围聚合，并返回符合 matchQuery 的记录，并且位于您指定的范围内的桶。

我们将使用该桶的文档计数来知道符合特定范围的条件的记录有多少。

您可以根据需要使用 "fillScores" 或执行其他操作，返回包含键值对的映射也是一个不错的选择。

希望对您有所帮助。

英文:

I will Answer this with working Code if someone might find it useful in the future.

@Override
public Scores getScoreCountRanges() {
	Scores scores = new Scores();
    String aggregationName = &quot;score_ranges&quot;;
    NativeSearchQuery searchQuery = new NativeSearchQueryBuilder()
            .withQuery(QueryBuilders.matchQuery(&quot;userId&quot;, &quot;DESIRED-USER-ID&quot;)
            .withAggregations(AggregationBuilders.range(aggregationName)
                    .field(&quot;scores&quot;)
                    .addRange(LOW, LOW_LOWER_BOUND,MEDIUM_LOWER_BOUND)    
                    .addRange(MEDIUM, MEDIUM_LOWER_BOUND, HIGH_LOWER_BOUND)   
                    .addRange(HIGH, HIGH_LOWER_BOUND, HIGH_UPPER_BOUND)  
            )
            .build();

    SearchHits&lt;?&gt; searchHits = operations.search(searchQuery, ClassOfData.class);
    if (!searchHits.hasAggregations())
        return scores;
    AggregationsContainer&lt;?&gt; aggregationsContainer = searchHits.getAggregations();
    if (aggregationsContainer == null) {
        return scores;
    }
    Aggregations aggregations = (Aggregations) aggregationsContainer.aggregations();
    ParsedRange rangeAggregation = aggregations.get(aggregationName);
    rangeAggregation.getBuckets().forEach(bucket -&gt; fillScores(scores, bucket.getKey().toString(), bucket.getDocCount()));
    return scores;
}

this code will do what is called rangeAggregation, and will return buckets that contail records that answer the matchQuery and are found inside the ranges you decided.

and we will use the doc count of that bucket to know how many records that match the criteria are in each specific range.

you can use fill scores as you desire or do any thing else, returning a map containing a key and value is also a good choice.

hope it was helpful.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用聚合在Double ElasticSearch Spring Boot中按范围计数。

问题

答案1

答案2

如何在 Android 中测试 SAML RestAPI？

监控TCP连接使用半开放/未完成连接。

安全连接-DBNAME.zip 是否安全可上传至存储库？

订阅者如何使用响应式拉取背压来控制发布者？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论