2023年6月29日 07:14:55go评论67阅读模式

英文:

Filter a table to remove duplicates from one column only, bottom to top

问题

你可以尝试以下公式来实现你的需求：

=FILTER(A2:C5, COUNTIF(B2:B5, B2:B5) > 1)

这个公式将会过滤掉列B中的重复值，只保留最后一个重复的行。

英文:

I have some data that looks like this:

Column A	Column B	Column C
a	z	ba
b	c	ba
d	w	vo
g	z	ba

There are duplicate values in column B and C, but I only want to remove duplicates from col B. The result should look like this, keeping the last row that contains a duplicate in column B:

Column A	Column B	Column C
b	c	ba
d	w	vo
g	z	ba

I need to use =filter and not the Remove Duplicates tool. So far, my formulas only either return col B or display an error. I've tried:

=unique(B2:B5)

=filter(A2:C5,unique(B2:B5)=B2:B5)

答案1

得分: 7

Use XMATCH with a fourth parameter of -1:

=LET(
ζ, A2:C5,
ξ, INDEX(ζ, , 2),
FILTER(ζ, XMATCH(ξ, ξ, , -1) = SEQUENCE(ROWS(ζ)))
)

英文:

Use XMATCH with a fourth parameter of -1:

=LET(
    ζ, A2:C5,
    ξ, INDEX(ζ, , 2),
    FILTER(ζ, XMATCH(ξ, ξ, , -1) = SEQUENCE(ROWS(ζ)))
)

答案2

得分: 4

Formula in E1:

=CHOOSEROWS(A1:C4,SORT(XMATCH(UNIQUE(B1:B4),B1:B4,,-1)))

英文:

Alternatively, you could try:

Formula in E1:

=CHOOSEROWS(A1:C4,SORT(XMATCH(UNIQUE(B1:B4),B1:B4,,-1)))

答案3

得分: 3

以下是翻译好的部分：

可能有更简单的方法来实现它：

=LET(in, A1:C4, seq,SEQUENCE(ROWS(in)), B,INDEX(in,,2),
 cnt, MAP(B,seq,LAMBDA(x,s, ROWS(FILTER(B, (seq &lt;= s) * (B=x))))),
 idx,UNIQUE(MAP(B,LAMBDA(x, FILTER(seq, (B=x) * (cnt=MAX(FILTER(cnt, B=x))))))),
 CHOOSEROWS(in,SORT(idx)))

注意：添加SORT确保输出中的预期排序。感谢@JvdV提供的解决方案。

这是输出结果：

名称cnt通过MAP计算每个B值的总重复值，包括当前和之前的值。名称idx找到具有给定B值的最大计数发生的索引位置。最后，我们通过CHOOSEROWS选择了已识别的索引位置的行。

英文:

Probably there are simpler ways of achieving it:

=LET(in, A1:C4, seq,SEQUENCE(ROWS(in)), B,INDEX(in,,2),
 cnt, MAP(B,seq,LAMBDA(x,s, ROWS(FILTER(B, (seq &lt;= s) * (B=x))))),
 idx,UNIQUE(MAP(B,LAMBDA(x, FILTER(seq, (B=x) * (cnt=MAX(FILTER(cnt, B=x))))))),
 CHOOSEROWS(in,SORT(idx)))

Note: Adding SORT ensures the expected ordering in the output. Credit to @JvdV from his solution.

Here is the output:

The name cnt, counts total current and previous repeated values for each value of B via MAP. The name idx, finds the index position where the max counts happens for a given B value. Finally, we select the rows via CHOOSEROWS for the index positions identified.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

从底部向顶部筛选表格以仅移除一个列中的重复项。

问题

答案1

答案2

答案3

获取两个字符之间的文本

Lambda函数无法在数组上进行评估的原因是什么？

如果Len=3，则将单元格数据向左移动1列，循环遍历列。

筛选多列并将值编译到主工作簿

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论