问题

我最初的pandas数据框如下所示：

ID	参数	开始日期	结束日期
1	4	2023年05月05日	2023年05月07日
2	2	2023年05月05日	2023年05月06日
3	1	2023年05月08日	2023年05月09日

能否从数据框中获得以下结构？实际上，我想展开时间段。

ID	参数	日期
1	4	2023年05月05日
1	4	2023年05月06日
1	4	2023年05月07日
2	2	2023年05月05日
2	2	2023年05月06日
3	1	2023年05月08日
3	1	2023年05月09日

英文:

My initial pandas df looks as follows:

ID	Parameter	Start	End
1	4	05.05.2023	07.05.2023
2	2	05.05.2023	06.05.2023
3	1	08.05.2023	09.05.2023

Is it possible to get the following structure out of the dataframe? In fact I want to unpivot the time period.

ID	Parameter	Date
1	4	05.05.2023
1	4	06.05.2023
1	4	07.05.2023
2	2	05.05.2023
2	2	06.05.2023
3	1	08.05.2023
3	1	09.05.2023

答案1

得分: 3

将两列转换为日期时间，使用 to_datetime 函数，通过 Index.repeat 创建新行，将差值列转换为天数，使用 Series.dt.days，然后使用 GroupBy.cumcount 计算最后使用计数器，将其转换为天的时间差，添加到 Start 列中，使用 to_timedelta。DataFrame.pop 在使用后移除列：

df['Start'] = pd.to_datetime(df['Start'], dayfirst=True)
df['End'] = pd.to_datetime(df['End'], dayfirst=True)
df = df.loc[df.index.repeat(df.pop('End').sub(df['Start']).dt.days + 1)]
td = pd.to_timedelta(df.groupby(level=0).cumcount(), unit='d')
df['Date'] = df.pop('Start').add(td).dt.strftime('%d.%m.%Y')
df = df.reset_index(drop=True)
print(df)
   ID  Parameter        Date
0   1          4  05.05.2023
1   1          4  06.05.2023
2   1          4  07.05.2023
3   2          2  05.05.2023
4   2          2  06.05.2023
5   3          1  08.05.2023
6   3          1  09.05.2023

英文:

Convert both columns to datetimes by to_datetime, create new rows by Index.repeat with subtraction columns converted to days by Series.dt.days, last use counter by GroupBy.cumcount and convert to days timedeltas added to Start column by to_timedelta, DataFrame.pop is remove column after using:

df[&#39;Start&#39;] = pd.to_datetime(df[&#39;Start&#39;], dayfirst=True)
df[&#39;End&#39;] = pd.to_datetime(df[&#39;End&#39;], dayfirst=True)
df = df.loc[df.index.repeat(df.pop(&#39;End&#39;).sub(df[&#39;Start&#39;]).dt.days + 1)]
td = pd.to_timedelta(df.groupby(level=0).cumcount(), unit=&#39;d&#39;)
df[&#39;Date&#39;] = df.pop(&#39;Start&#39;).add(td).dt.strftime(&#39;%d.%m.%Y&#39;)
df = df.reset_index(drop=True)
print (df)
   ID  Parameter        Date
0   1          4  05.05.2023
1   1          4  06.05.2023
2   1          4  07.05.2023
3   2          2  05.05.2023
4   2          2  06.05.2023
5   3          1  08.05.2023
6   3          1  09.05.2023

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何根据日期更改df的结构

问题

答案1

Python asyncio sleep is big memory usage.

应用高亮于数据透视表

检测在Polars中未给定唯一性的行。

在Locust中每隔x秒后台运行一个请求

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。