2023年3月7日 10:23:48go评论91阅读模式

英文:

Filling null with data flitering by other columns

问题

你好，我有一个问题，如何根据其他列来使用fillna()填充列。例如，如果在"Cabin"和"Destination"列中有缺失值，我想使用在"Last Name"列中具有相同值的行的相同值来填充这两列中的空值。

用其他列数据筛选来填充空值

我不知道如何让这个工作。

英文:

Hi guys I have a question, how can I fillna() on columns filtering by other columns. For example, if I have missing values in "Cabin" and "Destination", I want to fill those null values in those 2 columns by using the same values of a row that have the same value in column "Last Name"

用其他列数据筛选来填充空值

I have no idea how to make this work

答案1

得分: 1

这种方法也有效：

import pandas as pd
import numpy as np

df = pd.DataFrame({
    "A": ["a1", np.nan],
    "B": ["b1", "b1"]
})

df_drop = df.dropna()

df["A"] = df["A"].fillna(
    pd.Series(df["B"].values, index=df.index)
        .replace(df_drop.set_index("B")["A"])
)

英文:

This way also works:

import pandas as pd
import numpy as np

df = pd.DataFrame({
    &quot;A&quot;: [&quot;a1&quot;, np.nan],
    &quot;B&quot;: [&quot;b1&quot;, &quot;b1&quot;]
})

df_drop = df.dropna()

df[&quot;A&quot;] = df[&quot;A&quot;].fillna(
    pd.Series(df[&quot;B&quot;].values, index=df.index)
        .replace(df_drop.set_index(&quot;B&quot;)[&quot;A&quot;])
)

答案2

得分: 0

If missing values in "A", and used "B" column to fillna.

One method is to use "mapping", see:

df = pd.DataFrame({
    "A": ["a1", np.nan],
    "B": ["b1", "b1"]
})

df_drop = df.dropna()

df["A"] = df["A"].fillna(df["B"].map(dict(zip(df_drop["B"], df_drop["A"])))

Hope someone can improve this code or propose a better method.

英文:

If missing values in "A", and used "B" column to fillna.

One method is use "mapping", see:

df = pd.DataFrame({
    &quot;A&quot;: [&quot;a1&quot;, np.nan],
    &quot;B&quot;: [&quot;b1&quot;, &quot;b1&quot;]
})

df_drop = df.dropna()

df[&quot;A&quot;] = df[&quot;A&quot;].fillna(df[&quot;B&quot;].map(dict(zip(df_drop[&quot;B&quot;], df_drop[&quot;A&quot;]))))

Hope someone can improve this code or propose better method.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

用其他列数据筛选来填充空值

问题

答案1

答案2

Python的fillna方法添加.0

如何在groupby的DataFrame中应用带条件的ffill fillna()。

如何同时填充几列中的缺失数值

如何使用另一个数组作为参考来填充一个数组为0（或NaN）？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论