2023年7月13日 11:17:12go评论97阅读模式

英文:

Adding Rows to Pandas Dataframe where for all values below the max value of a Column

问题

请注意以下代码的翻译：

df = pd.DataFrame({"names": ["foo",  "boo", "coo","coo"],"time": [1,4,2,3],"values": [20,10,15,12]})

我想要为每个名称在1和时间列的最大值之间插入所有可能的时间行。所以期望的数据框如下：

df = pd.DataFrame({"names": ["foo","boo","boo", "boo","boo","coo","coo","coo"],"time": [1,1,2,3,4,1,2,3],"values": [20,NaN,NaN,NaN,10,NaN,15,12]})

如何操作？

英文:

Let's consider the dataframe below -

df = pd.DataFrame({&quot;names&quot;: [&quot;foo&quot;,  &quot;boo&quot;, &quot;coo&quot;,&quot;coo&quot;],&quot;time&quot;: [1,4,2,3],&quot;values&quot;: [20,10,15,12]})

I want to insert rows for all possible time between 1 and maximum of time column for each name.
So the desired dataframe would be -

df = pd.DataFrame({&quot;names&quot;: [&quot;foo&quot;,&quot;boo&quot;,&quot;boo&quot;, &quot;boo&quot;,&quot;boo&quot;,&quot;coo&quot;,&quot;coo&quot;,&quot;coo&quot;],&quot;time&quot;: [1,1,2,3,4,1,2,3],&quot;values&quot;: [20,NaN,NaN,NaN,10,NaN,15,12]})

How to do it?

答案1

得分: 0

使用 GroupBy.apply 中的自定义函数，结合 Series.reindex，通过 range：

out = (df.set_index('time')
         .groupby('names', sort=False)['values']
         .apply(lambda x: x.reindex(range(1, x.index.max()+1)))
         .reset_index())
print(out)
  names  time  values
0   foo     1    20.0
1   boo     1     NaN
2   boo     2     NaN
3   boo     3     NaN
4   boo     4    10.0
5   coo     1     NaN
6   coo     2    15.0
7   coo     3    12.0

英文:

Use custom function in GroupBy.apply with Series.reindex by range:

out = (df.set_index(&#39;time&#39;)
         .groupby(&#39;names&#39;, sort=False)[&#39;values&#39;]
         .apply(lambda x: x.reindex(range(1, x.index.max()+1)))
         .reset_index())
print (out)
  names  time  values
0   foo     1    20.0
1   boo     1     NaN
2   boo     2     NaN
3   boo     3     NaN
4   boo     4    10.0
5   coo     1     NaN
6   coo     2    15.0
7   coo     3    12.0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

向Pandas数据帧添加行，其中所有值均低于某一列的最大值。

问题

答案1

如何遍历列表以生成URL的元素。

如何在单击时清除QLineEdit的默认文本？

如何在PyCharm的交互式调试器中评估协程

Python问题涉及到库。我遇到了一些错误。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。