2023年6月15日 14:19:58go评论112阅读模式

英文:

Pandas groupby(pd.Grouper) is throwing error for datetime but im running it on a datetime object

问题

以下是您要翻译的代码部分：

I'm using pandas in python, and am trying to group a set of dates by month, and determine the highest value in the dates_and_grades["Grade_Values"] column for each month. I wrote the following code attempting to do this:
data = pd.read_csv(input_filepath)
data['Date'] = pd.to_datetime(data['Date'], format='ISO8601')
roped = ["Sport", "Trad"]
YDS_DICT={"N/A":"N/A",'3-4':0,'5':1,'5.0':1,'5.1':2,'5.2':3,'5.3':4,'5.4':5,
      '5.5':6,'5.6':7,'5.7':8,'5.8':9,'5.9':10,
      '5.10a':11,'5.10b':12, '5.10': 12, '5.10c':13,'5.10d':14,
      '5.11a':15,'5.11b':16, '5.11':16, '5.11c':17,'5.11d':18,
      '5.12a':19,'5.12b':20,'5.12c':21,'5.12d':22,
      '5.13a':23,'5.13b':24,'5.13c':25,'5.13d':26,
      '5.14a':27,'5.14b':28,'5.14c':29,'5.14d':30,
      '5.15a':31,'5.15b':32,'5.15c':33,'5.15d':34}
roped_only_naive = data.loc[data['Route Type'].isin(roped)].copy()
roped_only_naive["Rating"] = roped_only_naive['Rating'].map(slash_grade_converter)
roped_only_naive["Rating"] = roped_only_naive['Rating'].map(flatten_plus_and_minus_grades)
roped_only_naive["Rating"] = roped_only_naive['Rating'].map(remove_risk_ratings)
dates_and_grades = roped_only_naive[['Date', 'Rating']]
print(dates_and_grades.dtypes)
dates_and_grades["Grade_Values"] = dates_and_grades["Rating"].map(lambda data: YDS_DICT[data])
print(dates_and_grades.dtypes)
dates_and_grades['Date'] = dates_and_grades['Date'].groupby(pd.Grouper(freq='M'))
print(dates_and_grades)
However, I get the following error when run. 
TypeError: Only valid with DatetimeIndex, TimedeltaIndex or PeriodIndex, but got an instance of 'Index'
What is strange is that when I check the types on my dataframe using 
print(dates_and_grades.dtypes)
I get the following printout
Date            datetime64[ns]
Rating                  object
Grade_Values             int64
So it looks like my Date column is indeed a datetime object. 
My question is then, why doesn't the groupby(pd.Grouper(freq='M')) function work on my dates_and_grades['Date'] column if it does seem like dates_and_grades['Date'] is actually a datetime type?

英文:

I'm using pandas in python, and am trying to group a set of dates by month, and determine the highest value in the dates_and_grades["Grade_Values"] column for each month. I wrote the following code attempting to do this:

data = pd.read_csv(input_filepath)
data[&#39;Date&#39;] = pd.to_datetime(data[&#39;Date&#39;], format = &#39;ISO8601&#39;)
roped = [&quot;Sport&quot;, &quot;Trad&quot;]
YDS_DICT={&quot;N/A&quot;:&quot;N/A&quot;,&#39;3-4&#39;:0,&#39;5&#39;:1,&#39;5.0&#39;:1,&#39;5.1&#39;:2,&#39;5.2&#39;:3,&#39;5.3&#39;:4,&#39;5.4&#39;:5,
&#39;5.5&#39;:6,&#39;5.6&#39;:7,&#39;5.7&#39;:8,&#39;5.8&#39;:9,&#39;5.9&#39;:10,
&#39;5.10a&#39;:11,&#39;5.10b&#39;:12, &#39;5.10&#39;: 12, &#39;5.10c&#39;:13,&#39;5.10d&#39;:14,
&#39;5.11a&#39;:15,&#39;5.11b&#39;:16, &#39;5.11&#39;:16, &#39;5.11c&#39;:17,&#39;5.11d&#39;:18,
&#39;5.12a&#39;:19,&#39;5.12b&#39;:20,&#39;5.12c&#39;:21,&#39;5.12d&#39;:22,
&#39;5.13a&#39;:23,&#39;5.13b&#39;:24,&#39;5.13c&#39;:25,&#39;5.13d&#39;:26,
&#39;5.14a&#39;:27,&#39;5.14b&#39;:28,&#39;5.14c&#39;:29,&#39;5.14d&#39;:30,
&#39;5.15a&#39;:31,&#39;5.15b&#39;:32,&#39;5.15c&#39;:33,&#39;5.15d&#39;:34}
roped_only_naive = data.loc[data[&#39;Route Type&#39;].isin(roped)].copy()
roped_only_naive[&quot;Rating&quot;] = roped_only_naive[&#39;Rating&#39;].map(slash_grade_converter)
roped_only_naive[&quot;Rating&quot;] = roped_only_naive[&#39;Rating&#39;].map(flatten_plus_and_minus_grades)
roped_only_naive[&quot;Rating&quot;] = roped_only_naive[&#39;Rating&#39;].map(remove_risk_ratings)
dates_and_grades = roped_only_naive[[&#39;Date&#39;, &#39;Rating&#39;]]
print(dates_and_grades.dtypes)
dates_and_grades[&quot;Grade_Values&quot;] = dates_and_grades[&quot;Rating&quot;].map(lambda data: YDS_DICT[data])
print(dates_and_grades.dtypes)
dates_and_grades[&#39;Date&#39;] = dates_and_grades[&#39;Date&#39;].groupby(pd.Grouper(freq=&#39;M&#39;))
print(dates_and_grades)

However, I get the following error when run.

TypeError: Only valid with DatetimeIndex, TimedeltaIndex or PeriodIndex, but got an instance of &#39;Index&#39;

What is strange is that when I check the types on my dataframe using

print(dates_and_grades.dtypes)

I get the following printout

Date            datetime64[ns]
Rating                  object
Grade_Values             int64

So it looks like my Date column is indeed a datetime object.

My question is then, why doesn't the groupby(pd.Grouper(freq='M')) function work on my dates_and_grades['Date'] column if it does seem like dates_and_grades['Date'] is actually a datetime type?

答案1

得分: 1

在Grouper中使用参数 key，如果不使用一些转换函数，则不能将groupby的输出分配给新列 - 对于列：

dates_and_grades['new'] = dates_and_grades.groupby(pd.Grouper(freq='M', key='Date'))["Grade_Values"].transform('max')

如果省略 key 参数，Grouper 需要 DatetimeIndex，所以会产生错误。

如果需要每月按最大Grade Values获取行，请使用以下方法：

out = dates_and_grades.loc[dates_and_grades.groupby(pd.Grouper(freq='M', key='Date'))["Grade_Values"].idxmax()]

英文:

Use parameter key in Grouper, also cannot assign ouput of groupby to new column if not used some transformation function - for column :

dates_and_grades[&#39;new&#39;] = dates_and_grades.groupby(pd.Grouper(freq=&#39;M&#39;, key=&#39;Date&#39;))[&quot;Grade_Values&quot;].transform(&#39;max&#39;)

If omit key parameter Grouper need DatetimeIndex, so error is expected.

If need rows by maximal Grade Values per months use:

out = dates_and_grades.loc[dates_and_grades.groupby(pd.Grouper(freq=&#39;M&#39;, key=&#39;Date&#39;))[&quot;Grade_Values&quot;].idxmax())

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Pandas groupby(pd.Grouper) is throwing error for datetime but im running it on a datetime object

问题

答案1

有没有一种方法可以将文件路径字段转换为原地解析的模型？

有关Python中的管道操作符是否有任何PEP？

R – 如何将汇总的行结果放入列中

检查时间是否位于两个时间之间。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。