2023年5月13日 13:38:58go评论89阅读模式

英文:

Julia: Remove or add key from GroupedDataFrame

问题

Sure, here are the translated parts:

Case 1

df = DataFrame(rand(160, 3), :auto)
rename!(df, [:A, :B, :Z])
@. df.B = ifelse(rand() &lt; 0.5, 1, 2)
@. df.A = ifelse(rand() &lt; 0.5, 1, 2)
# I group here by A and B
gd = groupby(df, [:A, :B])
#=
	我根据A和B对df执行分组操作。
	...但现在我只需要基于B执行操作。
=#

How to remove key A?

gd.removegroup([:A])
gd.removekey([:A])
gd.ungroup([:A])

Case 2

df = DataFrame(rand(160, 3), :auto)
rename!(df, [:A, :B, :Z])
@. df.B = ifelse(rand() &lt; 0.5, 1, 2)
@. df.A = ifelse(rand() &lt; 0.5, 1, 2)
# I group here by B
gd = groupby(df, [:B])
#=
	我根据B对df执行分组操作。
	...但现在我需要基于B和A执行操作。
=#

How to add key A?

groupby(gd, [:A]) ❌❌❌❌
gd.addkey([:A])
gd.addgroup([:A])

Please note that the ❌❌❌❌ indicates that the specific line is not a valid Julia code for adding key A.

英文:

Imagine I have a df on which I need to perform operations based on grouped columns. But I need two perform actions based on two groupings.

Having cols A, B, C I need to do operation x to df grouped by A, B and operation y to df grouped only by B. Do I need to group the dataframe twice?

Case 1

df=DataFrame(rand(160,3), :auto)
rename!(df,[:A,:B,:Z])
@. df.B = ifelse(rand() &lt; 0.5, 1, 2)
@. df.A = ifelse(rand() &lt; 0.5, 1, 2)
# I group here by A and B
gd = groupby(df, [:A, :B])
#=
	My operations with df grouped by A and B.
	... But now I need to perform only with B
=#

How to remove key A?

gd.removegroup([:A])
gd.removekey([:A])
gd.ungroup([:A])

Case 2

df=DataFrame(rand(160,3), :auto)
rename!(df,[:A,:B,:Z])
@. df.B = ifelse(rand() &lt; 0.5, 1, 2)
@. df.A = ifelse(rand() &lt; 0.5, 1, 2)
# I group here by B
gd = groupby(df, [:B])
#=
	My operations with df grouped by B.
	... But now I need to perform with B and A
=#

How to add key A?

groupby(gd, [:A]) ❌❌❌❌
gd.addkey([:A])
gd.addgroup([:A])

答案1

得分: 0

Sure, here's the translation of the provided text:

"我需要两次分组数据框吗？

需要。两次分组与添加/删除分组一样编码量相同。只需执行以下操作：

gd1 = groupby(df, [:A, :B])
gd2 = groupby(df, :B)

由于分组的数据框是源 df 的视图，如果您更改 gd1，更改将自动反映在 gd2 中。

您唯一需要记住的是，在更改 df 时，不应更改列 :A 和 :B，因为更改分组列可能会使分组无效。"

英文:

> Do I need to group the dataframe twice?

Yes. Grouping twice is the same amount of coding as adding/removing group. Just do e.g.:

gd1 = groupby(df, [:A, :B])
gd2 = groupby(df, :B)

Since grouped data frame is a view of source df, if you mutate gd1 the changes will be reflected in gd2 automatically.

The only thing you need to keep in mind that when mutating df you should not mutate columns :A and :B as mutating grouping columns could invalidate groupings.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

从GroupedDataFrame中删除或添加键。

问题

Case 1

Case 2

Case 1

Case 2

答案1

Vectorize the assignment of a column in a pandas dataframe where a custom index has many rows and the column value is set using all rows in the index

如何告诉Julia在集群上启动作业时利用多个节点？

How can I replace values in one column with values from another out of several options, when the first column contains the name of the other column?

如何在标题之前写入包含配置行的数据框到 CSV 文件中

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论