2023年7月20日 19:25:44go评论110阅读模式

英文:

How to add two pandas data frames and keep both indexes

问题

以下是翻译好的部分：

问题： 什么是添加两个 Pandas 数据框并保持两者索引的最优雅和高效的方法？

表格 1

索引	Col1	Col2
Sample1	0.5	1.0
Sample2	0.0	0.5

表格 2

索引	Col1	Col2
Sample3	0.0	1.0
Sample4	1.0	1.0

结果表格

索引	Col1	Col2
[Sample1, Sample3]	0.5	2.0
[Sample2, Sample4]	1.0	1.5

import pandas as pd
table_one = pd.DataFrame({'Col1': [0.5, 1.0],
                          'Col2': [0.0, 0.5]},
                          index=['Sample1', 'Sample2'])
table_two = pd.DataFrame({'Col1': [0.0, 1.0],
                          'Col2': [1.0, 1.0]},
                          index=['Sample3', 'Sample4'])
table_result = pd.DataFrame({'Col1': [0.5, 2.0],
                          'Col2': [1.0, 1.5]},
                          index=[['Sample1','Sample3'], ['Sample2','Sample4']])
# 请在此处插入解决方案...

英文:

What is the most elegant and efficient way of adding two pandas data frames and keep both indexes?

Table 1

Index	Col1	Col2
Sample1	0.5	1.0
Sample2	0.0	0.5

Table 2

Index	Col1	Col2
Sample3	0.0	1.0
Sample4	1.0	1.0

Result Table

Index	Col1	Col2
[Sample1, Sample3]	0.5	2.0
[Sample2, Sample4]	1.0	1.5

import pandas as pd
table_one = pd.DataFrame({&#39;Col1&#39;: [0.5, 1.0],
                          &#39;Col2&#39;: [0.0, 0.5]},
                          index=[&#39;Sample1&#39;, &#39;Sample2&#39;])
table_two = pd.DataFrame({&#39;Col1&#39;: [0.0, 1.0],
                          &#39;Col2&#39;: [1.0, 1.0]},
                          index=[&#39;Sample3&#39;, &#39;Sample4&#39;])
table_result = pd.DataFrame({&#39;Col1&#39;: [0.5, 2.0],
                          &#39;Col2&#39;: [1.0, 1.5]},
                          index=[[&#39;Sample1&#39;,&#39;Sample3&#39;], [&#39;Sample2&#39;,&#39;Sample4&#39;]])
# Please insert solution here...

答案1

得分: 2

我会使用以下代码：

out = table_one.add(table_two.set_axis(table_one.index))
out.index = zip(table_one.index, table_two.index)

对于任意数量的表格泛化：

from functools import reduce
tables = [table_one, table_two]
idx = list(zip(*(d.index for d in tables)))
out = reduce(lambda a, b: a.add(b), (d.set_axis(idx) for d in tables))

输出：

                    Col1  Col2
(Sample1, Sample3)   0.5   1.0
(Sample2, Sample4)   2.0   1.5

英文:

I would use:

out = table_one.add(table_two.set_axis(table_one.index))
out.index = zip(table_one.index, table_two.index)

Generalization to an arbitrary number of tables:

from functools import reduce
tables = [table_one, table_two]
idx = list(zip(*(d.index for d in tables)))
out = reduce(lambda a, b: a.add(b), (d.set_axis(idx) for d in tables))

Output:

                    Col1  Col2
(Sample1, Sample3)   0.5   1.0
(Sample2, Sample4)   2.0   1.5

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何将两个pandas数据帧相加并保留两个索引。

问题

答案1

寻找一种方法可以多次运行一个Python脚本，同时将txt文件转换为csv。

如何计算Python集合中字符的总数

上传图像时，通过Postman使用Printify API上传图像端点出现验证错误。

如何在Jazzmin Django主题中更改注销图标？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。