2023年5月17日 23:54:10go评论92阅读模式

英文:

Calculate time difference in seconds in pandas if row in other column matches

问题

# 我在一个pandas DataFrame中使用3列（'time'，'SEC'，'DeviceName'）进行工作。 
# 使用以下代码计算'time'列中行之间的差异，并将结果赋给'SEC'列：
df['SEC'] = df['time'].diff().dt.total_seconds()
# 'DeviceName'列可能包含多种不同的设备，因此我需要修改代码，
# 只有在设备名称与前一行匹配时才执行计算，否则将'SEC'列赋值为0。
# 例如：
# 时间                秒         设备名称
# 4/18/2023 2:43:00             Applied_AA-12
# 4/18/2023 3:13:00   1800      Applied_AA-12  # 计算因为设备名称与前一行匹配
# 4/18/2023 3:35:53   0         Applied_AA-14  # 不计算因为设备名称与前一行不匹配
# 4/18/2023 3:36:03   10        Applied_AA-14  # 计算因为设备名称与前一行匹配

英文:

I'm working with 3 columns ('time', 'SEC', 'DeviceName') in a pandas DataFrame. I'm using the following code to calculate the differences between rows in the 'time' column and assign to the 'SEC' column:

df[&#39;SEC&#39;] = df[&#39;time&#39;].diff().dt.total_seconds()

The 'DeviceName' column can have several different devices, so I need to modify this to only perform the calculation if the device name matches the previous row, otherwise assign a 0 to 'SEC'.

For example:

time                    SEC       DeviceName
4/18/2023 2:43:00                 Applied_AA-12
4/18/2023 3:13:00       1800      Applied_AA-12  # calculate because the device name matches the previous row
4/18/2023 3:35:53       0         Applied_AA-14  # don&#39;t calculate because the device name doesn&#39;t match the previous row
4/18/2023 3:36:03       10        Applied_AA-14  # calculate because the device name matches the previous row

答案1

得分: 1

你可以使用 GroupyBy.diff ：

df["SEC"] = df.groupby("DeviceName")["time"].diff().dt.total_seconds().fillna(0)
df.at[0, "SEC"] = np.nan # 这是可选的吗？

输出：

print(df)
                 time     DeviceName     SEC
0 2023-04-18 02:43:00  Applied_AA-12     NaN
1 2023-04-18 03:13:00  Applied_AA-12 1800.00
2 2023-04-18 03:35:53  Applied_AA-14    0.00
3 2023-04-18 03:36:03  Applied_AA-14   10.00

英文:

You can use GroupyBy.diff :

df[&quot;SEC&quot;] = df.groupby(&quot;DeviceName&quot;)[&quot;time&quot;].diff().dt.total_seconds().fillna(0)
df.at[0, &quot;SEC&quot;] = np.nan # is this optional ?

Output :

print(df)
                 time     DeviceName     SEC
0 2023-04-18 02:43:00  Applied_AA-12     NaN
1 2023-04-18 03:13:00  Applied_AA-12 1800.00
2 2023-04-18 03:35:53  Applied_AA-14    0.00
3 2023-04-18 03:36:03  Applied_AA-14   10.00

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Calculate time difference in seconds in pandas if row in other column matches.

问题

答案1

如何在生产环境中为不同的Python包创建不同的环境

请求的网址未找到。在pythonanywhere中出现了404错误。

You can use scikit-learn K-Means Clustering的时候，如何提取原始数据域中的质心?

Deadlock with Django / MYSQL and filter on select_for_update.

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。