问题

我想展示数据框的重复行，以便更好地理解。我想要按重复行进行分组。

这个示例希望能够澄清我的意图。假设我们有以下数据框：

CC BF FA WC Strength
1  2  3  4   1
2  3  4  5   6
1  2  3  4   8
1  2  3  4   4
2  3  4  5   7

在去除 Strength 列后，行1,3,4和行2,5是重复的。我想要获得一个新的数据框，显示如下：

CC BF FA WC Strength_min Strength_max Count
1  2  3  4  1            8             3
2  3  4  5  6            7             2

英文:

I would like to display the duplicates of a dataframe in order to get a better understanding. I would like to groupby the duplicated rows

This example hopefully clarifies what I want to do. Assume we have given the dataframe below


CC BF FA WC Strength
1  2  3  4   1
2  3  4  5   6
1  2  3  4   8
1  2  3  4   4
2  3  4  5   7

Here rows 1,3,4 and 2,5 are duplicates after removing Strength. I would like to get a new dataframe that displays

CC BF FA WC Strength_min Strength_max Count
1  2  3  4  1            8             3
2  3  4  5  6            7             2

答案1

得分: 4

你需要一个自定义的 groupby.agg，其中使用 Index.difference 的输出作为分组依据：

(df.groupby(list(df.columns.difference(['Strength'], sort=False)))[['Strength']]
   .agg({'Strength_min': 'min', 'Strength_max': 'max', 'Count': 'count'})
   .reset_index()
)

输出：

   CC  BF  FA  WC  Strength_min  Strength_max  Count
0   1   2   3   4             1             8      3
1   2   3   4   5             6             7      2

英文:

You need a custom groupby.agg with the output from Index.difference as grouper:

(df.groupby(list(df.columns.difference([&#39;Strength&#39;], sort=False)))[&#39;Strength&#39;]
   .agg(**{&#39;Strength_min&#39;: &#39;min&#39;, &#39;Strength_max&#39;: &#39;max&#39;, &#39;Count&#39;: &#39;count&#39;})
   .reset_index()
)

Output:

   CC  BF  FA  WC  Strength_min  Strength_max  Count
0   1   2   3   4             1             8      3
1   2   3   4   5             6             7      2

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

显示pandas中的重复项

问题

答案1

OpenGL为什么无法加载纹理（通用图像）？

Python中的深拷贝在极小化函数中。

在Python或NumPy中已经有一些方法可以确定数字的格式吗？

在整个 pandas 数据框中查找部分字符串匹配的列和行。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论