如何根据特定值对数据框中的数据进行分组？

huangapple

117266
文章

0
评论

2020年1月6日 02:46:19go评论103阅读模式

英文:

How do i group data of a dataframe based on certain value?

问题

我想对我的数据框进行分组，以便将时间戳列中具有相同小时的行（该列包含数据，例如2019-01-01 00:00:00.134721167,50,100，其中50是成本，100是百分比）的成本进行求和并计算平均值，以及百分比。

或者更具体地说，我需要为2天的信息生成48行，每小时一行，而现在我有超过500行。我该如何做到这一点？

英文:

I want to group my dataframe so that the rows with the same hour from timestamp column (which has data like 2019-01-01 00:00:00.134721167,50,100 where 50 is the cost, and 100 is percentage) have their cost summed and averaged, as well as percentage.

Or, to be more specific, i need to have 48 rows for 2 days of information, one for each hour, while now i have more than 500 rows. How do I do that?

答案1

得分: 1

以下是已翻译的内容：

这里有一种方法可以做到：

# 样本数据
df = pd.DataFrame({'date': pd.date_range("2019-01-01", freq='H', periods=10),
                   'cost': pd.np.random.randint(10, 100, 10)})

方法 1:

df.set_index('date').resample('H').sum()

方法 2:

df.groupby(pd.Grouper(key='date', freq='H'))['cost'].sum().reset_index()

英文:

Here's a way to do:

# sample data
df = pd.DataFrame({&#39;date&#39;: pd.date_range(&quot;2019-01-01&quot;, freq=&#39;H&#39;, periods = 10),
                  &#39;cost&#39;: pd.np.random.randint(10, 100, 10)})

Method 1:

df.set_index(&#39;date&#39;).resample(&#39;H&#39;).sum()

Method 2:

df.groupby(pd.Grouper(key=&#39;date&#39;, freq=&#39;H&#39;))[&#39;cost&#39;].sum().reset_index()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2020年1月6日 02:46:19
转载请务必保留本文链接：https://go.coder-hub.com/59603093.html

dataframe
pandas
python

如何根据特定值对数据框中的数据进行分组？

问题

答案1

为什么我在使用pip时一直收到这些错误，该如何解决？

The INCRBY指令在Redis管道中执行时返回对管道的引用，而不是修改的键的值。

“`python pd.DataFrame 如何计算 mean()，同时忽略某些单元格中的 ‘NA’ 字符串 “`

数据停止推送到BigQuery。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。