2023年2月6日 17:44:04go评论110阅读模式

英文:

df.apply(hurst_function) gave TypeError: must be real number, not tuple in, Python

问题

以下是您要翻译的部分：

"I have a column in form of a data-frame that contains the ratio of some numbers.
On that df col, I want to apply hurst function using df.apply() method.

I don't know if the error is with the df.apply or with the hurst_function.
Consider the code which calculates hurst exponent on a col using the df.apply method:

import hurst 
def hurst_function(df_col_slice):
    display(df_col_slice)
    return hurst.compute_Hc(df_col_slice)
def func(df_col):
    
    results = round(df_col.rolling(101).apply(hurst_function)[100:],1)
    return results
func(df_col)

I get the error:

Input In [73], in func(df_col)
---&gt; 32     results = round(df_col.rolling(101).apply(hurst_function)[100:],1)
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:1843, in Rolling.apply(self, func, raw, engine, engine_kwargs, args, kwargs)
   1822 @doc(
   1823     template_header,
   1824     create_section_header(&quot;Parameters&quot;),
   (...)
... (中间部分省略)
TypeError: must be real number, not tuple

What can I do to solve this?

Edit: display(df_col_slice) is giving the following output:

0      0.282043
1      0.103355
2      0.537766
3      0.491976
4      0.535050
         ...   
96     0.022696
97     0.438995
98    -0.131486
99     0.248250
100    1.246463
Length: 101, dtype: float64

英文:

I have a column in form of a data-frame that contains the ratio of some numbers.
On that df col, I want to apply hurst function using df.apply() method.

I don't know if the error is with the df.apply or with the hurst_function.
Consider the code which calculates hurst exponent on a col using the df.apply method:

import hurst 
def hurst_function(df_col_slice):
    display(df_col_slice)
    return hurst.compute_Hc(df_col_slice)
def func(df_col):
    
    results = round(df_col.rolling(101).apply(hurst_function)[100:],1)
    return results
func(df_col)

I get the error:

Input In [73], in func(df_col)
---&gt; 32     results = round(df_col.rolling(101).apply(hurst_function)[100:],1)
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:1843, in Rolling.apply(self, func, raw, engine, engine_kwargs, args, kwargs)
   1822 @doc(
   1823     template_header,
   1824     create_section_header(&quot;Parameters&quot;),
   (...)
   1841     kwargs: dict[str, Any] | None = None,
   1842 ):
-&gt; 1843     return super().apply(
   1844         func,
   1845         raw=raw,
   1846         engine=engine,
   1847         engine_kwargs=engine_kwargs,
   1848         args=args,
   1849         kwargs=kwargs,
   1850     )
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:1315, in RollingAndExpandingMixin.apply(self, func, raw, engine, engine_kwargs, args, kwargs)
   1312 else:
   1313     raise ValueError(&quot;engine must be either &#39;numba&#39; or &#39;cython&#39;&quot;)
-&gt; 1315 return self._apply(
   1316     apply_func,
   1317     numba_cache_key=numba_cache_key,
   1318     numba_args=numba_args,
   1319 )
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:590, in BaseWindow._apply(self, func, name, numba_cache_key, numba_args, **kwargs)
    587     return result
    589 if self.method == &quot;single&quot;:
--&gt; 590     return self._apply_blockwise(homogeneous_func, name)
    591 else:
    592     return self._apply_tablewise(homogeneous_func, name)
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:442, in BaseWindow._apply_blockwise(self, homogeneous_func, name)
    437 &quot;&quot;&quot;
    438 Apply the given function to the DataFrame broken down into homogeneous
    439 sub-frames.
    440 &quot;&quot;&quot;
    441 if self._selected_obj.ndim == 1:
--&gt; 442     return self._apply_series(homogeneous_func, name)
    444 obj = self._create_data(self._selected_obj)
    445 if name == &quot;count&quot;:
    446     # GH 12541: Special case for count where we support date-like types
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:431, in BaseWindow._apply_series(self, homogeneous_func, name)
    428 except (TypeError, NotImplementedError) as err:
    429     raise DataError(&quot;No numeric types to aggregate&quot;) from err
--&gt; 431 result = homogeneous_func(values)
    432 return obj._constructor(result, index=obj.index, name=obj.name)
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:582, in BaseWindow._apply.&lt;locals&gt;.homogeneous_func(values)
    579     return func(x, start, end, min_periods, *numba_args)
    581 with np.errstate(all=&quot;ignore&quot;):
--&gt; 582     result = calc(values)
    584 if numba_cache_key is not None:
    585     NUMBA_FUNC_CACHE[numba_cache_key] = func
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:579, in BaseWindow._apply.&lt;locals&gt;.homogeneous_func.&lt;locals&gt;.calc(x)
    571 start, end = window_indexer.get_window_bounds(
    572     num_values=len(x),
    573     min_periods=min_periods,
    574     center=self.center,
    575     closed=self.closed,
    576 )
    577 self._check_window_bounds(start, end, len(x))
--&gt; 579 return func(x, start, end, min_periods, *numba_args)
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\window\rolling.py:1342, in RollingAndExpandingMixin._generate_cython_apply_func.&lt;locals&gt;.apply_func(values, begin, end, min_periods, raw)
   1339 if not raw:
   1340     # GH 45912
   1341     values = Series(values, index=self._on)
-&gt; 1342 return window_func(values, begin, end, min_periods)
File ~\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\_libs\window\aggregations.pyx:1315, in pandas._libs.window.aggregations.roll_apply()
TypeError: must be real number, not tuple

What can I do to solve this?

Edit: display(df_col_slice) is giving the following output:

0      0.282043
1      0.103355
2      0.537766
3      0.491976
4      0.535050
         ...   
96     0.022696
97     0.438995
98    -0.131486
99     0.248250
100    1.246463
Length: 101, dtype: float64

答案1

得分: 3

hurst.compute_Hc 函数返回一个包含 3 个值的元组：

H，c，vals = compute_Hc(df_col_slice)

其中，H 是赫斯特指数，而 c 是某个常数。

但是，pandas._libs.window.aggregations.roll_apply() 期望其参数（函数）返回一个单一的标量，它是滚动窗口的减小结果。

这就是为什么你的 hurst_function 函数需要从 vals 返回某个特定值。

英文:

hurst.compute_Hc function returns a tuple of 3 values:

H, c, vals = compute_Hc(df_col_slice)

where H is the Hurst exponent , and c - is some constant.

But, pandas._libs.window.aggregations.roll_apply() expects its argument (function) to return a single (scalar) which is the reduced result of a rolling window.

That's why your hurst_function function need to return a certain value from vals.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

df.apply(hurst_function) 报错：必须是实数，而不是元组，在 Python 中。

问题

答案1

如何使装饰器在函数体中缩小类型？

A Python script built upon the requests module throws a KeyError when it goes for the next page after grabbing content from the first page

在Python中将列表附加到基本列表中。

如何在tkinter上创建一个具有3个选项卡的窗口。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。