2023年2月6日 08:36:56go评论92阅读模式

英文:

Python pandas: How to match data between two dataframes

问题

第一个数据框（df1）类似于这样：

Result	A	B	C
2021-12-31	False	True	True
2022-01-01	False	False	True
2022-01-02	False	True	False
2022-01-03	True	False	True

df2是df1的更新版本，日期数据是新的，列名可能增加，类似于这样：

Result	A	B	C	D
2022-01-04	False	False	True	True
2022-01-05	True	False	True	True
2022-01-06	False	True	False	True
2022-01-07	False	False	True	True

我想要整合这两个数据库，但不知道如何做。
我想要得到类似以下的结果：

Result	A	B	C	D
2021-12-31	False	True	True	NaN
2022-01-01	False	False	True	NaN
2022-01-02	False	True	False	NaN
2022-01-03	True	False	True	NaN
2022-01-04	False	False	True	True
2022-01-05	True	False	True	True
2022-01-06	False	True	False	True
2022-01-07	False	False	True	True

非常感谢！

英文:

The first dataframe(df1) is similar to this:

Result	A	B	C
2021-12-31	False	True	True
2022-01-01	False	False	True
2022-01-02	False	True	False
2022-01-03	True	False	True

df2 is an updated version of df1, the date data are new and the column names may be increased, which is similar to this:

Result	A	B	C	D
2022-01-04	False	False	True	True
2022-01-05	True	False	True	True
2022-01-06	False	True	False	True
2022-01-07	False	False	True	True

I want to integrate two databases, but I don't know how to do it。
I want to get a result similar to the following:

Result	A	B	C	D
2021-12-31	False	True	True	NaN
2022-01-01	False	False	True	NaN
2022-01-02	False	True	False	NaN
2022-01-03	True	False	True	NaN
2022-01-04	False	False	True	True
2022-01-05	True	False	True	True
2022-01-06	False	True	False	True
2022-01-07	False	False	True	True

Thank you very much!

答案1

得分: 0

使用concatenate函数时忽略索引

df_new = pd.concat([df1, df2], ignore_index=True)

任何缺失的数值将会是'NaN'。

英文:

Use the concatenate function while ignoring indexes

> df_new = pd.concat([df1, df2], ignore_index=True)

Any missing values will be 'NaN'.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Python pandas: 如何在两个数据框之间匹配数据

问题

第一个数据框（df1）类似于这样：

df2是df1的更新版本，日期数据是新的，列名可能增加，类似于这样：

我想要整合这两个数据库，但不知道如何做。
我想要得到类似以下的结果：

The first dataframe(df1) is similar to this:

df2 is an updated version of df1, the date data are new and the column names may be increased, which is similar to this:

I want to integrate two databases, but I don't know how to do it。
I want to get a result similar to the following:

Thank you very much!

答案1

传递命令行参数给已经参数化的 pytest 测试。

可以在列表推导式中初始化变量吗？

Python/Bash ‘MemoryError’: 如何使我的脚本更高效？

设置使用Python 3.9 + Poetry配置Docker时，出现’不包含任何元素’错误。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论

问题

第一个数据框（df1）类似于这样：

df2是df1的更新版本，日期数据是新的，列名可能增加，类似于这样：

我想要整合这两个数据库，但不知道如何做。 我想要得到类似以下的结果：

The first dataframe(df1) is similar to this:

df2 is an updated version of df1, the date data are new and the column names may be increased, which is similar to this:

I want to integrate two databases, but I don't know how to do it。 I want to get a result similar to the following:

Thank you very much!

答案1

发表评论

我想要整合这两个数据库，但不知道如何做。
我想要得到类似以下的结果：

I want to integrate two databases, but I don't know how to do it。
I want to get a result similar to the following: