2023年7月28日 05:57:00go评论98阅读模式

英文:

Check 3 columns in 3 different dataframes and create a new column

问题

我有三个类似这样的数据框：

ID	Name
First	row
Second	row

所有的数据框都有一个共同的ID列。有一个主数据框，其他两个需要与之进行比较。
我想要实现的逻辑是：
1）如果第二个数据框的ID与第一个匹配，则创建一个列并给它一个常量'A'的值。
2）如果第三个数据框的ID与第一个匹配，在相同的列中给它一个常量'B'的值。
3）如果它在其中一个中都没有出现，在相同的列中给它一个常量'I'的值。

我正在尝试使其更高效，而不是创建两个for循环。任何帮助将不胜感激！

英文:

I have three dataframes like this

ID	Name
First	row
Second	row

All the dataframe have ID column in common. There is a main dataframe against which the other two needs to be compared.
Logic I want to implement:

if ID's of second dataframe matches with the first, then create a column and give it a constant 'A' value.
if ID's of third dataframe matches with the first, in the same column give it a constant 'B' value.
if it doesn't appear in either of them,in the same column give it a constant 'I'

I am trying to make it efficient instead of creating two for loops. Any help will be appreciated!

答案1

得分: 1

以下是您想要的内容：

import pandas as pd
df1 = pd.DataFrame({
    'ID': [1, 2, 3],
    'name': [
        "阅读文档",
        "在提问之前检查是否已有相同问题",
        "创建一个MRE"
    ]
})
df2 = pd.DataFrame({
    'ID': [1],
    'url': ["http://pandas.pydata.org/docs/"]
})
df3 = pd.DataFrame({
    'ID': [2],
    'url': ["https://stackoverflow.com/search"]
})
df1['new_col'] = df1['ID'].apply(lambda x: 'A' if x in df2['ID'].values else 'B' if x in df3['ID'].values else 'I')
# 输出:
   ID                              name new_col
0   1                            阅读文档       A
1   2  在提问之前检查是否已有相同问题               B
2   3                            创建一个MRE       I

英文:

Here is what it appears you want:

import pandas as pd
df1 = pd.DataFrame({
    &#39;ID&#39;: [1, 2, 3],
    &#39;name&#39;: [
        &quot;read the documentation&quot;,
        &quot;check your question hasn&#39;t been asked before&quot;,
        &quot;create an MRE&quot;
    ]
})
df2 = pd.DataFrame({
    &#39;ID&#39;: [1],
    &#39;url&#39;: [&quot;http://pandas.pydata.org/docs/&quot;]
})
df3 = pd.DataFrame({
    &#39;ID&#39;: [2],
    &#39;url&#39;: [&quot;https://stackoverflow.com/search&quot;]
})
df1[&#39;new_col&#39;] = df1[&#39;ID&#39;].apply(lambda x: &#39;A&#39; if x in df2[&#39;ID&#39;].values else &#39;B&#39; if x in df3[&#39;ID&#39;].values else &#39;I&#39;)
# Output:
   ID                                          name new_col
0   1                        read the documentation       A
1   2  check your question hasn&#39;t been asked before       B
2   3                                 create an MRE       I

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

检查3个不同数据框中的3列，并创建一个新列。

问题

答案1

Raspberry Pi 没有找到名为 Plotly 的模块。

使用R或Python计算具有路由优先级的有向图中顶点的累积和。

`pytz` 目前或将来在Python中被弃用吗？

Pandas groupby(pd.Grouper) is throwing error for datetime but im running it on a datetime object

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。