groupby() 根据列 A 进行分组，统计列 C，以及统计列 B 的唯一值数量。

huangapple

117266
文章

0
评论

2023年8月5日 09:52:01go评论118阅读模式

英文:

groupby() Col A, count Col C, and count unique column B

问题

我有一个包含多列的数据框。我想要按列A（这是一个人的名字）分组。然后，我想要按列A分组计算列C中的总行数。我还想要按列A分组计算列B中的唯一行数。

在Python中是否有一种方法可以做到这一点？

英文:

I have a dataframe with multiple columns. I want to groupby column A (which is a person's name). Then I want to count total number of rows in column C grouped by column A. I also want to count number of unique rows in Column B grouped by column A.

Is there a way to do this in Python?

答案1

得分: 1

这：

df.groupby('A').agg({'C':'size', 'B':'nunique'})

尽管实际上，C 中的行数应该与 B 中的行数相同。这也可以工作：

df.groupby('A')['B'].agg(['size','nunique'])

英文:

This:

df.groupby(&#39;A&#39;).agg({&#39;C&#39;:&#39;size&#39;, &#39;B&#39;:&#39;nunique&#39;})

although really, number of rows in C should just be the same with number of rows in B. This should also work

df.groupby(&#39;A&#39;)[&#39;B&#39;].agg([&#39;size&#39;,&#39;nunique&#39;])

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2023年8月5日 09:52:01
转载请务必保留本文链接：https://go.coder-hub.com/76839846.html

count
group-by
pandas
pivot
python

Add the mean in box plots with plotly express?

go 88 05/10

Sphinx不会记录复杂的Enum类。

go 165 01/06

Selecting Item from One table and Iterate in another table to see if It exists and Add a column Label

go 87 07/10

计算Pandas数据帧中与第一个值相关的时间差。

go 87 03/09

groupby() 根据列 A 进行分组，统计列 C，以及统计列 B 的唯一值数量。

问题

答案1

Add the mean in box plots with plotly express?

Sphinx不会记录复杂的Enum类。

Selecting Item from One table and Iterate in another table to see if It exists and Add a column Label

计算Pandas数据帧中与第一个值相关的时间差。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。