2023年3月8日 17:10:32go评论126阅读模式

英文:

Pandas update previous records because future peaking is not possible

问题

以下是您要翻译的代码部分：

import numpy as np
import pandas_ta as ta
from pandas import DataFrame, pandas

df = pandas.DataFrame({"color": [None, None, 'blue', None, None, None, 'orange', None, None, None, None],
                       'bottom': [1, 2, 7, 5, 9, 9, 5, 4, 5, 5, 3],
                       'top': [5, 5, 11, 8, 10, 10, 9, 7, 10, 6, 7])

print(df)

# lookback period
N = 3

# Pivot each color to own column and shift
df2 = (df.pivot(columns='color', values=['top', 'bottom'])
         .drop(columns=np.nan, level=1)
         .ffill(limit=N-1).shift()
       )

# compare current top with bottom & top from color occurance
out = df.join((df2['bottom'].le(df['top'], axis=0)
               & df2['top'].ge(df['top'], axis=0)).astype(int))
print(out)

希望这对您有所帮助。如果您有任何其他问题，请随时提出。

英文:

This is what I have so far:

import numpy as np
import pandas_ta as ta
from pandas import DataFrame, pandas

df = pandas.DataFrame({&quot;color&quot;: [None, None, &#39;blue&#39;, None, None, None, &#39;orange&#39;, None, None, None, None],
                       &#39;bottom&#39;: [1, 2, 7, 5, 9, 9, 5, 4, 5, 5, 3],
                       &#39;top&#39;: [5, 5, 11, 8, 10, 10, 9, 7, 10, 6, 7]})

print(df)

&quot;&quot;&quot;
     color  down  top
0     None     1    5
1     None     2    5
2     blue     7   11
3     None     5    8
4     None     9   10
5     None     9   10
6   orange     5    9
7     None     4    7
8     None     5   10
9     None     5    6
10    None     3    7
&quot;&quot;&quot;

# lookback period
N = 3

# Pivot each color to own column and shift
df2 = (df.pivot(columns=&#39;color&#39;, values=[&#39;top&#39;, &#39;bottom&#39;])
         .drop(columns=np.nan, level=1)
         .ffill(limit=N-1).shift()
       )


# compare current top with bottom &amp; top from color occurance
out = df.join((df2[&#39;bottom&#39;].le(df[&#39;top&#39;], axis=0)
               &amp; df2[&#39;top&#39;].ge(df[&#39;top&#39;], axis=0)).astype(int))
print(out)


&quot;&quot;&quot;
     color  bottom  top  blue  orange
0     None       1    5     0       0
1     None       2    5     0       0
2     blue       7   11     0       0
3     None       5    8     1       0
4     None       9   10     1       0
5     None       9   10     1       0
6   orange       5    9     0       0
7     None       4    7     0       1
8     None       5   10     0       0
9     None       5    6     0       1
10    None       3    7     0       0
&quot;&quot;&quot;

Question:

I only want to consume each color once. That means that for every blue or orange occurrence there can only be only one 1 in the upcoming 3 rows.
( 2 blues after each other will result in two 1s. One 1 for every blue.)

&quot;&quot;&quot;
     color  bottom  top  blue  orange
0     None       1    5     0       0
1     None       2    5     0       0
2     blue       7   11     0       0
3     None       5    8     1       0
4     None       9   10     1       0 --&gt; this should be 0, blue already consumed on row 3
5     None       9   10     1       0 --&gt; this should be 0, blue already consumed on row 3
6   orange       5    9     0       0
7     None       4    7     0       1
8     None       5   10     0       0
9     None       5    6     0       1 --&gt; this should be 0, orange already consumed on row 7
10    None       3    7     0       0
&quot;&quot;&quot;

One bottleneck is that for this to function correctly I am not allowed to peak in to the future. So I am not allowed to use .shift(-3) or iloc[-1] for example.

That sort of kills my initial thinking about keeping track of a consumed state by using something like .rolling(-3).max() == 1 .

答案1

得分: 1

你可以对输出进行后处理，只保留每个组的第一个1：

使用循环：

cols = list(df['color'].dropna().unique())

g = out.groupby(df['color'].notna().cumsum())
for c in cols:
    out[c] = np.where(out[c].eq(1) & df.index.isin(g[c].idxmax()), 1, 0)

输出：

     color  bottom  top  blue  orange
0     None       1    5     0       0
1     None       2    5     0       0
2     blue       7   11     0       0
3     None       5    8     1       0
4     None       9   10     0       0
5     None       9   10     0       0
6   orange       5    9     0       0
7     None       4    7     0       1
8     None       5   10     0       0
9     None       5    6     0       0
10    None       3    7     0       0

请注意，上述代码是对给定代码的输出进行后处理以保留每个组的第一个1。

英文:

You can post-process the output to only keep the first 1 per group:

# lookback period
N = 3

# Pivot each color to own column and shift
df2 = (df.pivot(columns=&#39;color&#39;, values=[&#39;top&#39;, &#39;bottom&#39;])
         .drop(columns=np.nan, level=1)
         .ffill(limit=N-1).shift()
       )

# compare current top with bottom &amp; top from color occurance
out = df.join((df2[&#39;bottom&#39;].le(df[&#39;top&#39;], axis=0)
               &amp; df2[&#39;top&#39;].ge(df[&#39;top&#39;], axis=0)).astype(int))

# post process the output to keep only the first 1
cols = list(df[&#39;color&#39;].dropna().unique())

out[cols] = out[cols].mask(out[cols].ne(out.groupby(df[&#39;color&#39;].notna().cumsum())[cols].cumsum()), 0)

Or with a loop:

cols = list(df[&#39;color&#39;].dropna().unique())

g = out.groupby(df[&#39;color&#39;].notna().cumsum())
for c in cols:
    out[c] = np.where(out[c].eq(1) &amp; df.index.isin(g[c].idxmax()), 1, 0)

Output:

     color  bottom  top  blue  orange
0     None       1    5     0       0
1     None       2    5     0       0
2     blue       7   11     0       0
3     None       5    8     1       0
4     None       9   10     0       0
5     None       9   10     0       0
6   orange       5    9     0       0
7     None       4    7     0       1
8     None       5   10     0       0
9     None       5    6     0       0
10    None       3    7     0       0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Pandas 更新先前的记录，因为未来的峰值不可能。

问题

Question:

答案1

PyQt5标签不会自动换行文本

如何获取tkinter Spinbox的值？

循环遍历 n 个 CSV 文件并在 Python 中删除列

‘scikit-learn documentation example: ‘got an unexpected keyword argument”

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论