英文:
Python StatsModels: ValueError: Expected frequency D. Got M
问题
我正在使用 statsmodels.graphics 从 kaggle 数据集 中的时间序列数据绘制一个 month_plot。我已将数据转换为所需的每日频率均值数据以进行绘图。然而,我收到了一个错误,错误信息说“期望的数据频率是 D,但实际的数据频率是 M”,而我的实际数据已经是 D。
import pandas as pd
from statsmodels.graphics.tsaplots import month_plot
import matplotlib.pyplot as plt
df = pd.read_csv('/kaggle/input/hourly-energy-consumption/DOM_hourly.csv')
df.set_index('Datetime', inplace=True, drop=True)
df.index = pd.to_datetime(df.index, format='%Y-%m-%d %H:%M:%S')
# 去除重复的索引
df = df[~df.index.duplicated(keep='first')]
# 将 df 转换为每日均值频率的数据框
ddf = df.resample(rule='24H', kind='interval').mean().to_period('d')
# 打印示例数据框 ddf
#
#                 DOM_MW
# Datetime 	
# 2005-05-01 	7812.347826
# 2005-05-02 	8608.083333
# ...         ...
# 2017-12-30 	14079.125000
# 2017-12-31 	15872.833333
# 从每日频率数据绘制月度图
plt.figure(figsize=(14,4))
month_plot(ddf)
plt.show()
当前输出:如上所示,我的 ddf 明显是每日频率数据。但我收到了以下奇怪的错误,说我的 ddf 数据实际上是 M(月度),但它期望的是 D(每日)。
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-7-675f2911920c> in <module>
      7 
      8 plt.figure(figsize=(14,4))
----> 9 month_plot(ddf)
     10 plt.show()
ValueError: Expected frequency D. Got M
英文:
I am using statsmodels.graphics to draw a month_plot from timeseries data in a kaggle dataset. I have converted the data to daily frequency mean data as required for the plot. However, I am getting an error that says the expected data frequency is D, but the actual data frequency is M where as my actual data is already D.
import pandas as pd
from statsmodels.graphics.tsaplots import month_plot
import matplotlib.pyplot as plt
df = pd.read_csv('/kaggle/input/hourly-energy-consumption/DOM_hourly.csv')
df.set_index('Datetime', inplace=True, drop=True)
df.index = pd.to_datetime(df.index, format='%Y-%m-%d %H:%M:%S')
# drop duplicated index
df = df[~df.index.duplicated(keep='first')]
# convert df to daily mean frequency dataframe
ddf = df.resample(rule='24H', kind='interval').mean().to_period('d')
# print example dataframe ddf
#
#                 DOM_MW
# Datetime 	
# 2005-05-01 	7812.347826
# 2005-05-02 	8608.083333
# ...         ...
# 2017-12-30 	14079.125000
# 2017-12-31 	15872.833333
# Monthly plot from the Daily frequency data
plt.figure(figsize=(14,4))
month_plot(ddf)
plt.show()
Present output: As you can see above, my ddf is clearly a daily frequency data. But I am getting following weird error saying my ddf data is actually M (Monthly) but it expects D (Daily).
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-7-675f2911920c> in <module>
      7 
      8 plt.figure(figsize=(14,4))
----> 9 month_plot(ddf)
     10 plt.show()
ValueError: Expected frequency D. Got M
答案1
得分: 1
你尝试使用 month_plot 绘制每日数据。如果想使用 month_plot,必须首先对数据进行重新采样:
>>> 帮助(month_plot)
...
    x: array_like
        要绘制的季节性数据。如果未提供日期,则 x 必须是带有**月度频率**的 Pandas 对象,具有 PeriodIndex 或 DatetimeIndex。
...
month_plot(ddf.resample('M').mean())
plt.show()
输出:
英文:
You try to use month_plot to plot daily data. If you want to use month_plot, you have to resample your data first:
>>> help(month_plot)
...
    x : array_like
        Seasonal data to plot. If dates is None, x must be a pandas object
        with a PeriodIndex or DatetimeIndex with a **monthly frequency**.
...
month_plot(ddf.resample('M').mean())
plt.show()
Output:
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。



评论