2023年3月21日 01:11:56go评论104阅读模式

英文:

Comparing subset of rows in pandas

问题

我在想是否有一种好的方法来比较pandas中的一部分行？
假设我有一个带有以下内容的df：

id	in_test	value
1	True	5
2	True	5
1	False	7
2	False	8

我想要的结果是id和从in_test从true到false的差异（或百分比变化）的df。

我知道我可以将表旋转然后执行逐行计算，或者创建一个过滤后的df并将其与另一个过滤后的df合并，然后逐行计算。

我在想是否有一种在一行中完成此操作的python方法？可能使用pandas函数？

百分比差异的输出将是：

id	value
1	+40%
2	+60%

差异的输出将是：

id	value
1	2
2	3

英文:

I was wondering if there is nice way to compare a subset of rows in pandas?
let's say I have a df with:

id	in_test	value
1	True	5
2	True	5
1	False	7
2	False	8

I would like the resulting df with id and difference (or percentage change) from in_test from true to false.

I know I could pivot the table and then perform row wise calculations, or create a filtered df and merge it with another filtered df and then compute it row wise.

I was wondering if there is python way of doing this in one line? With probably a pandas function?

The output for percentage diff would be :

id	value
1	+40%
2	+60%

The output for diff would be :

id	value
1	2
2	3

(or minus -2 & -3 i guess I would have top define some kind of order) )

答案1

得分: 1

按id分组，计算value的差异，然后重置索引

差异

按id分组，计算value的百分比变化，然后重置索引

百分比变化

英文:

df.groupby(&#39;id&#39;)[&#39;value&#39;].apply(lambda x: x.diff().values[1]).reset_index()

difference

df.groupby(&#39;id&#39;)[&#39;value&#39;].apply(lambda x: x.pct_change().values[1] * 100).reset_index()

percentage difference

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在pandas中比较行的子集。

问题

答案1

我的代码为什么无法检测到Selenium中的元素？

在pandas中识别列中两个日期之间的日期差异并标记模式。

有关Python中的管道操作符是否有任何PEP？

Capturing a warning sent using logging.warning() from a library function, python.

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。