删除基于有效数据百分比的 Pandas 行

huangapple

117266
文章

0
评论

2023年2月6日 09:50:03go评论94阅读模式

英文:

Drop pandas rows based on percentage of valid data

问题

我有一个类似这样的pandas数据帧

Date_Time	level
2018-02-12 13:22:27	5
2018-02-12 13:17:27	7
2018-02-12 13:12:27	2
2018-02-12 13:07:27	6
2018-02-13 13:12:27	4
2018-02-13 13:17:27	5

如何使特定日期的条目少于3个时将其删除，即自2018-03-13起，删除<4个条目，并获取此表

Date_Time	level
2018-02-12 13:22:27	5
2018-02-12 13:17:27	7
2018-02-12 13:12:27	2
2018-02-12 13:07:27	6

我尝试使用for循环，但运行时间太长。

英文:

I have a pandas data frame that looks like this

Date_Time	level
2018-02-12 13:22:27	5
2018-02-12 13:17:27	7
2018-02-12 13:12:27	2
2018-02-12 13:07:27	6
2018-02-13 13:12:27	4
2018-02-13 13:17:27	5

How do I make it so If there is less than 3 entries on a specific date they get removed
i.e since 2018-03-13 < 4 entries remove them and get this table

Date_Time	level
2018-02-12 13:22:27	5
2018-02-12 13:17:27	7
2018-02-12 13:12:27	2
2018-02-12 13:07:27	6

I tried using a for loop but that takes too long to run

答案1

得分: 0

你可以使用 groupby 和 transform 来进行 count 操作，然后使用 ge 来获取你想要的行：

df[df.groupby(df['Date_Time'].dt.date)['Date_Time'].transform('count').ge(4)]

英文:

You can do groupby and transform with count and then use ge to get the rows you wanted:

df[df.groupby(df[&#39;Date_Time&#39;].dt.date)[&#39;Date_Time&#39;].transform(&#39;count&#39;).ge(4)]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2023年2月6日 09:50:03
转载请务必保留本文链接：https://go.coder-hub.com/75356718.html

pandas
python

在Plotly中在线间隙添加注释。

go 143 04/20

CryptGenRandom使用我的处理器中的RNG吗？

go 119 07/18

关于缺失的分页元素，需要一些爬取指导。

go 134 05/28

Is there a way to reshape a single index pandas DataFrame into a multi index to adapt to time series?

go 99 02/08

删除基于有效数据百分比的 Pandas 行

问题

答案1

在Plotly中在线间隙添加注释。

CryptGenRandom使用我的处理器中的RNG吗？

关于缺失的分页元素，需要一些爬取指导。

Is there a way to reshape a single index pandas DataFrame into a multi index to adapt to time series?

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。