2023年5月11日 15:37:28go评论99阅读模式

英文:

pd.fillna(pd.Series()) can't fill all NaN values

问题

我理解你的问题。问题在于你使用了不同长度的filler来填充df2.month列，这可能导致一些NaN值保留。要确保没有NaN值，你可以按以下方式更改代码：

from numpy.random import default_rng
rng = default_rng()
filler = rng.choice(len(df2), size=len(df2), replace=False)
filler = pd.Series(-abs(filler))
df2.month = df2.month.fillna(filler).astype(int)
df2

这会根据df2的长度生成相同长度的filler，并将NaN值填充为整数，确保没有NaN值在输出中。

英文:

I want to fill the NaNs in a dataframe with random values:

df1 = pd.DataFrame(list(zip([&#39;0001&#39;, &#39;0001&#39;, &#39;0002&#39;, &#39;0003&#39;, &#39;0004&#39;, &#39;0004&#39;],
                            [&#39;a&#39;, &#39;b&#39;, &#39;a&#39;, &#39;b&#39;, &#39;a&#39;, &#39;b&#39;],
                           [&#39;USA&#39;, &#39;USA&#39;, &#39;USA&#39;, &#39;USA&#39;, &#39;USA&#39;, &#39;USA&#39;],
                           [np.nan, np.nan, &#39;Jan&#39;, np.nan, np.nan, &#39;Jan&#39;],
                           [1,2,3,4,5,6])),
                    columns=[&#39;sample ID&#39;, &#39;compound&#39;, &#39;country&#39;, &#39;month&#39;, &#39;value&#39;])
df1

Out:

	sample ID	compound	country	month	value
0	0001	      a	          USA	NaN	     1
1	0001	      b	          USA	NaN	     2
2	0002	      a	          USA	Jan	     3
3	0003	      b	          USA	NaN	     4
4	0004	      a	          USA	NaN	     5 
5	0004	      b	          USA	Jan	     6

I slice the database based on the compound column:

df2 = df1.loc[df1.compound == &#39;a&#39;]
df2

Out:

  sample ID	 compound	country	month	value
0	0001	  a	          USA	NaN	     1
2	0002	  a	          USA	Jan	     3
4	0004	  a           USA	NaN	     5

Then I tried to fillna with non-repeated values using filler:

from numpy.random import default_rng
rng = default_rng()
filler = rng.choice(len(df2.month), size=len(df2.month), replace=False)
filler = pd.Series(-abs(filler))
df2.month.fillna(filler, inplace=True)
df2

Out:

   sample ID	compound	country	month	value
0	0001	       a	     USA	-1.0	1
2	0002	       a	     USA	Jan	    3
4	0004	       a	     USA	NaN	    5

I expected no NaN in the out but actually not, Why?

答案1

得分: 3

问题是，您的 filler 索引与 df2 不同，因为 df2 是通过布尔索引是 df1 的一部分，您可以执行以下操作：

filler = pd.Series(-abs(filler)).set_axis(df2.index)
df2['month'].fillna(filler, inplace=True)

英文:

Problem is that your filler index is different from df2, since df2 is part of df1 by boolean indexing, you can do

filler = pd.Series(-abs(filler)).set_axis(df2.index)
df2[&#39;month&#39;].fillna(filler, inplace=True)

答案2

得分: 1

以下是您提供的内容的中文翻译：

**示例**
s1 = pd.Series([1, 2, None])
s2 = pd.Series([3, 4, 5], index=list('abc'))
运行以下代码
s1.fillna(s2)
输出：
0    1.0
1    2.0
2    NaN
fillna 不能填充不同索引的 NaN
这未必一定是问题的原因。如果这不能解决问题，不要仅仅发布您的代码和目标，创建并提供一个代表您的数据集的最小示例。
https://stackoverflow.com/help/minimal-reproducible-example

英文:

Example

s1 = pd.Series([1, 2, None])
s2 = pd.Series([3, 4, 5], index=list(&#39;abc&#39;))

Run below code

s1.fillna(s2)

output:

0    1.0
1    2.0
2    NaN

fillna cant fill NaN of different index

This may not necessarily be the reason. if this cant solve problem, don't just post your code and goals, create and provide a minimal example representing your dataset.

https://stackoverflow.com/help/minimal-reproducible-example

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

`pd.fillna(pd.Series())` 无法填充所有的 NaN 值。

问题

答案1

答案2

两个数组的词嵌入余弦相似度

不要将重复的值添加到Python中的二叉树（非二叉搜索树）。

Python递归调用开销 – 在达到setrecursionlimit指定限制之前的RecursionError结果

如何在AWS CDK中按标签名称查找NAT网关

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论