2020年1月3日 22:56:59go评论107阅读模式

英文:

How to swap rows in 2 pandas dataframes which contain lists and have different size?

问题

You can achieve this in Python using the Pandas library. Here's the code to swap the second element of each row of df1 with the second element of each row of df2, considering the differing number of rows:

import pandas as pd
# Your df1 and df2 dataframes
# Create a dictionary to store the new rows for df1
new_rows_df1 = {}
# Iterate through the rows of df2 and update df1 accordingly
for index, row in df2.iterrows():
    if index in df1.index:
        new_row = df1.loc[index].copy()  # Copy the row from df1
        new_row['Path'][1] = row['Path'][1]  # Swap the second element
        new_rows_df1[index] = new_row
# Concatenate the updated rows with the original df1
updated_df1 = pd.concat([df1, pd.DataFrame.from_dict(new_rows_df1, orient='index')])
# Sort the index to maintain the original order
updated_df1.sort_index(inplace=True)
# Your updated df1 and df2 are now in 'updated_df1' and 'df2'
print(updated_df1)
print(df2)

This code will give you the desired output with swapped second elements in df1 while maintaining the rows that don't have corresponding rows in df2.

英文:

I have two Pandas dataframes, df1 and df2. Each dataframe has one column named 'Path'. Each row has a list. They are like this:

df1

Path
[OAK, ORD, FLL, PBG]
[OAK, SEA, FLL, PBG]
[OAK, AUS, FLL, PBG]
[OAK, LAS, FLL, PBG]
[OAK, LAX, FLL, PBG]
[OAK, DAL, FLL, PBG]
[OAK, MDW, FLL, PBG]
[OAK, BWI, FLL, PBG]

The df1 constructor is:

{&#39;Path&#39;: {0: [&#39;OAK&#39;, &#39;ORD&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   2: [&#39;OAK&#39;, &#39;SEA&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   4: [&#39;OAK&#39;, &#39;AUS&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   6: [&#39;OAK&#39;, &#39;LAS&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   8: [&#39;OAK&#39;, &#39;LAX&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   10: [&#39;OAK&#39;, &#39;DAL&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   12: [&#39;OAK&#39;, &#39;MDW&#39;, &#39;FLL&#39;, &#39;PBG&#39;],   14: [&#39;OAK&#39;, &#39;BWI&#39;, &#39;FLL&#39;, &#39;PBG&#39;]}}

df2

Path
[OAK, DFW, FLL, PBG]
[OAK, JFK, FLL, PBG]
[OAK, MCI, FLL, PBG]
[OAK, PHX, FLL, PBG]
[OAK, DEN, FLL, PBG]
[OAK, HOU, FLL, PBG]
[OAK, ATL, FLL, PBG]

The df2 constructor is:

{&#39;Path&#39;: {1: [&#39;OAK&#39;, &#39;DFW&#39;, &#39;FLL&#39;, &#39;PBG&#39;], 3: [&#39;OAK&#39;, &#39;JFK&#39;, &#39;FLL&#39;, &#39;PBG&#39;], 5: [&#39;OAK&#39;, &#39;MCI&#39;, &#39;FLL&#39;, &#39;PBG&#39;], 7: [&#39;OAK&#39;, &#39;PHX&#39;, &#39;FLL&#39;, &#39;PBG&#39;], 9: [&#39;OAK&#39;, &#39;DEN&#39;, &#39;FLL&#39;, &#39;PBG&#39;], 11: [&#39;OAK&#39;, &#39;HOU&#39;, &#39;FLL&#39;, &#39;PBG&#39;], 13: [&#39;OAK&#39;, &#39;ATL&#39;, &#39;FLL&#39;, &#39;PBG&#39;]}}

One problem is that I have a different number of rows in my dataframes. I would like to swap the second element of each row of df1 with the second element of each row of df2. If there is no corresponding row, the row should not be modified or dropped. The desired output is:

df1

Path
[OAK, DFW, FLL, PBG]
[OAK, JFK, FLL, PBG]
[OAK, MCI, FLL, PBG]
[OAK, PHX, FLL, PBG]
[OAK, DEN, FLL, PBG]
[OAK, HOU, FLL, PBG]
[OAK, ATL, FLL, PBG]
[OAK, BWI, FLL, PBG]

df2

Path
[OAK, ORD, FLL, PBG]
[OAK, SEA, FLL, PBG]
[OAK, AUS, FLL, PBG]
[OAK, LAS, FLL, PBG]
[OAK, LAX, FLL, PBG]
[OAK, DAL, FLL, PBG]
[OAK, MDW, FLL, PBG]

How can I do it in Python?

答案1

得分: 2

你可以在将列表系列转换为数据帧后，在此处使用combine_first()：

n = pd.DataFrame(df2['Path'].tolist())
m = pd.DataFrame(df1['Path'].tolist())
# ----------------------------------------------------
df1_final = n[[1]].combine_first(m).dropna().agg(list, 1)
df2_final = m[[1]].combine_first(n).dropna().agg(list, 1)

print(df1_final)
print('\n')
print(df2_final)

结果如下：

0    [OAK, DFW, FLL, PBG]
1    [OAK, JFK, FLL, PBG]
2    [OAK, MCI, FLL, PBG]
3    [OAK, PHX, FLL, PBG]
4    [OAK, DEN, FLL, PBG]
5    [OAK, HOU, FLL, PBG]
6    [OAK, ATL, FLL, PBG]
7    [OAK, BWI, FLL, PBG]
dtype: object
0    [OAK, ORD, FLL, PBG]
1    [OAK, SEA, FLL, PBG]
2    [OAK, AUS, FLL, PBG]
3    [OAK, LAS, FLL, PBG]
4    [OAK, LAX, FLL, PBG]
5    [OAK, DAL, FLL, PBG]
6    [OAK, MDW, FLL, PBG]
dtype: object

英文:

You can use combine_first() here after converting the series of list into a dataframe:

n=pd.DataFrame(df2[&#39;Path&#39;].tolist())
m=pd.DataFrame(df1[&#39;Path&#39;].tolist())
#----------------------------------------------------
df1_final=n[[1]].combine_first(m).dropna().agg(list,1)
df2_final=m[[1]].combine_first(n).dropna().agg(list,1)

print(df1_final)
print(&#39;\n&#39;)
print(df2_final)
0    [OAK, DFW, FLL, PBG]
1    [OAK, JFK, FLL, PBG]
2    [OAK, MCI, FLL, PBG]
3    [OAK, PHX, FLL, PBG]
4    [OAK, DEN, FLL, PBG]
5    [OAK, HOU, FLL, PBG]
6    [OAK, ATL, FLL, PBG]
7    [OAK, BWI, FLL, PBG]
dtype: object
0    [OAK, ORD, FLL, PBG]
1    [OAK, SEA, FLL, PBG]
2    [OAK, AUS, FLL, PBG]
3    [OAK, LAS, FLL, PBG]
4    [OAK, LAX, FLL, PBG]
5    [OAK, DAL, FLL, PBG]
6    [OAK, MDW, FLL, PBG]
dtype: object

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何交换包含列表且大小不同的两个Pandas数据帧中的行？

问题

答案1

遍历 DataFrame 列表，获取它们的列名并应用一个函数？

将 pandas datetime64 列的时区设置为另一列

基于DataFrame的热力图

怎样将一个字符串值放入 List<Integer> 中？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。