2023年3月9日 14:15:10go评论93阅读模式

英文:

Trouble shooting pd.to_timedelta calculation failure

问题

我先用中文翻译代码部分：

for RxDat in df2:
       condition = (df['Tdate'] > RxDat - pd.to_timedelta(46, unit="D")) & (df['Tdate'] < RxDat)

您提到您之前成功使用了上述条件语句，但现在遇到了以下错误：

TypeError: unsupported operand type(s) for -: 'str' and 'Timedelta'

您提供了以下数据来说明错误：

df['Tdate'] 包含的日期数据如下：

[Timestamp('2004-08-25 00:00:00'), Timestamp('2004-10-13 00:00:00'), Timestamp('2004-12-13 00:00:00'), Timestamp('2005-02-21 00:00:00'), Timestamp('2005-04-28 00:00:00'), Timestamp('2005-08-24 00:00:00')]

df2['RxDate'] 包含的日期数据如下：

[Timestamp('2004-08-20 00:00:00'), Timestamp('2004-08-23 00:00:00'), Timestamp('2004-08-18 00:00:00'), Timestamp('2004-08-15 00:00:00'), Timestamp('2004-08-12 00:00:00'), Timestamp('2004-08-13 00:00:00')]

您尝试了多种方式但无法找出错误的原因。

英文:

I have previously used the following conditional statement successfully

for RxDat in df2:
       condition = (df[&#39;Tdate&#39;] &gt; RxDat - pd.to_timedelta(46, unit=&quot;D&quot;)) &amp; (df[&#39;Tdate&#39;] &lt; RxDat)

Now I am getting the following error

> TypeError: unsupported operand type(s) for -: 'str' and 'Timedelta'

I have extracted the following data to illustrate the error

df['Tdate'] contains

[Timestamp(&#39;2004-08-25 00:00:00&#39;), Timestamp(&#39;2004-10-13 00:00:00&#39;), Timestamp(&#39;2004-12-13 00:00:00&#39;), Timestamp(&#39;2005-02-21 00:00:00&#39;), Timestamp(&#39;2005-04-28 00:00:00&#39;), Timestamp(&#39;2005-08-24 00:00:00&#39;)]

df2['RxDate'] contains

[Timestamp(&#39;2004-08-20 00:00:00&#39;), Timestamp(&#39;2004-08-23 00:00:00&#39;), Timestamp(&#39;2004-08-18 00:00:00&#39;), Timestamp(&#39;2004-08-15 00:00:00&#39;), Timestamp(&#39;2004-08-12 00:00:00&#39;), Timestamp(&#39;2004-08-13 00:00:00&#39;)]

I have tried looking at this a few ways and cannot see why I get the error?

答案1

得分: 3

如果循环由 d2 则 RxDat 为列名:

for RxDat in df2:

使用:

for RxDat in df2['RxDate']:

非循环解决方案，使用广播，输出为 2D 的 NumPy 数组:

a = df['Tdate'].to_numpy()[:, None]
b = df2['RxDate'].sub(pd.to_timedelta(46, unit="D")).to_numpy()
c = df2['RxDate'].to_numpy()
condition = (a > b) & (a < c)

英文:

If loop by d2 then RxDat are columns names:

for RxDat in df2:

Use:

for RxDat in df2[&#39;RxDate&#39;]:

Non loop solution with broadcasting, output is 2d numpy array:

a = df[&#39;Tdate&#39;].to_numpy()[:, None]
b = df2[&#39;RxDate&#39;].sub(pd.to_timedelta(46, unit=&quot;D&quot;)).to_numpy()
c = df2[&#39;RxDate&#39;].to_numpy()
              
condition = (a &gt; b) &amp; (a &lt; c)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

故障排除 pd.to_timedelta 计算失败

问题

答案1

为什么我的使用NumPy数组的排序算法比使用列表慢？

将一个Python字典转换为正确的Python基础模型（BaseModel）pydantic类

将osmnx中的道路网络改为双向。

Python代码在执行时出现错误。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。