2023年5月11日 20:10:52go评论72阅读模式

英文:

How to return the top 10 more frequent column values in excel?

问题

我有一个包含三列的Excel表格，在“id”列中有一些数字并且有一些重复。如何在Excel中将“id”列中出现频率最高的前10个值获取到一个列表中？

到目前为止，我编写了以下代码：

=INDEX(B1:B10;MATCH(LARGE(FREQUENCY(B1:B10;B1:B10);2);FREQUENCY(B1:B10;B1:B10);0))

范围：B1:B10

但是这只会逐个给出第1、2、...最频繁的值，我希望得到一个包含前10个最频繁值的列表。

附注：我使用的是MacOS。

英文:

I have an excel with three columns and in the column "id" there are some numbers with some repetitions. How can I get in a list the top 10 more frequent column values ("id" column) in excel?

So far I made this code:

 =INDEX(B1:B10;MATCH(LARGE(FREQUENCY(B1:B10;B1:B10);2);FREQUENCY(B1:B10;B1:B10);0))

range: B1:B10

but it gives only one by one the 1th, 2th, ... more frequent value, what I'd like is to have an only list with the top 10 more frequent values.

PS: I have a MacOS

答案1

得分: 1

以下是翻译好的部分：

以下的内容仍然适用，即使有多个具有相同频率的 ID：

=LET(x, A2:A12, top, 3, cnts, COUNTIFS(x, x),
 TAKE(SORT(UNIQUE(HSTACK(x, cnts)), 2, -1), top, 1))

如@JosWoolley在评论中指出的，您可以使用 SORTBY 替代 SORT，这将生成一个排序公式：

=LET(x, A2:A12, top, 3, cnts, COUNTIFS(x, x), TAKE(UNIQUE(SORTBY(x, cnts, -1)), top))

您还可以使用您的方法，但您忘记在 LARGE 的第二个输入参数中使用 SEQUENCE，但它无法选择作为顶部一部分的重复频率：

=LET(A, A2:A12, top, 3, freq, FREQUENCY(A, A),
 INDEX(A, MATCH(LARGE(freq, SEQUENCE(top)), freq, 0)))

这是输出：

上述公式假定使用 Office 365，对于较旧的版本，您可以进行以下替换：

TAKE(x, top, [y]) -> INDEX(x, SEQUENCE(top), [y])
HSTACK(x, y) -> CHOOSE({1,2}, x, y)

英文:

The following works even, there are several ids with the same frequency:

=LET(x, A2:A12,top,3, cnts,COUNTIFS(x,x),
 TAKE(SORT(UNIQUE(HSTACK(x,cnts)),2,-1),top,1))

As @JosWoolley pointed out in the comment section, you can use SORTBY instead of SORT which produces a sorter formula:

=LET(x,A2:A12, top,3, cnts,COUNTIFS(x,x),TAKE(UNIQUE(SORTBY(x,cnts,-1)),top))

You can also use your approach, you missed using SEQUENCE as the second input argument for LARGE, but it doesn't work for repeated frequencies being selected as part of the top:

=LET(A,A2:A12, top,3, freq,FREQUENCY(A,A),
 INDEX(A,MATCH(LARGE(freq,SEQUENCE(top)),freq,0)))

Here is the output:

The above formulas assume O365, for older versions, you can make the following substitutions:

TAKE(x,top,[y]) -> INDEX(x, SEQUENCE(top),[y])
HSTACK(x,y) -> CHOOSE({1,2},x,y)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在Excel中返回前10个最频繁的列值？

问题

答案1

如何在Excel工作表中将具有多个标题行的值矩阵进行”pivot_wider/melt”操作？

如何修改VBA函数以在不打开工作簿的情况下访问另一个工作簿中的数据？

从Outlook收件箱文件夹提取数据到Excel。

如何在VBA Excel中将数据格式化为可变大小的表格？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论