2023年7月18日 14:46:48go评论99阅读模式

英文:

Conditional shifting pandas column

问题

以下是您要翻译的内容：

import pandas as pd
import numpy as np
# 创建'i'和'price'列的数据
n = 10  # 条目数
i_values = list(range(1, n+1))
price_values = [10.99, 19.99, 5.99, 8.49, 12.99, 15.99, 9.99, 14.99, 6.99, 11.99]
# 创建DataFrame
data = {'i': i_values,
        'price': price_values}
df = pd.DataFrame(data)
df['price_new'] = df.loc[df.i > 6, 'price'].shift(-3)

期望的输出：

n = 10  # 条目数
i_values = list(range(1, n+1))
price_values = [10.99, 19.99, 5.99, 8.49, 12.99, 15.99, 9.99, 14.99, 6.99, 11.99]
new_price_values = [np.NaN, np.NaN, np.NaN, 9.99, 14.99, 6.99, 11.99, np.NaN, np.NaN, np.NaN]
# 创建DataFrame
data = {'i': i_values,
        'price': price_values,
        'new_price': new_price_values}
df = pd.DataFrame(data)

英文:

I want to conditional shift pandas column, would want to shift all columns with i > 6 below is what I am doing and it is not working

import pandas as pd
import numpy as np
# Creating data for &#39;i&#39; and &#39;price&#39; columns
n = 10  # Number of entries
i_values = list(range(1, n+1))
price_values = [10.99, 19.99, 5.99, 8.49, 12.99, 15.99, 9.99, 14.99, 6.99, 11.99]
# Creating DataFrame
data = {&#39;i&#39;: i_values,
        &#39;price&#39;: price_values}
df = pd.DataFrame(data)
df[&#39;price_new&#39;] = df.loc[df.i&gt;6, &#39;price&#39;].shift(-3)

Expected output:

n = 10  # Number of entries
i_values = list(range(1, n+1))
price_values = [10.99, 19.99, 5.99, 8.49, 12.99, 15.99, 9.99, 14.99, 6.99, 11.99]
new_price_values = [np.NaN, np.NaN, np.NaN, 9.99, 14.99, 6.99, 11.99, np.NaN, np.NaN, np.NaN]
# Creating DataFrame
data = {&#39;i&#39;: i_values,
        &#39;price&#39;: price_values,
        &#39;new_price&#39;: new_price_values}
df = pd.DataFrame(data)

答案1

得分: 1

应用偏移量，然后选择您希望保留的单元格。看起来您试图一次完成所有操作，只是在过程中错误地获取了索引。

您所寻求的一行代码

shift_from = 6
shift_by = -3
df['price_new'] = df.loc[df.i>(shift_from+shift_by),'price'].shift(shift_by)

这将产生与您期望的输出完全相同的结果。

为了清晰起见，拆分成两个步骤

带有可舍弃的中间列。

1) 应用偏移

df['price_shift'] = df['price'].shift(shift_by)
df
    i  price  price_shift
0   1  10.99         8.49
1   2  19.99        12.99
2   3   5.99        15.99
3   4   8.49         9.99
4   5  12.99        14.99
5   6  15.99         6.99
6   7   9.99        11.99
7   8  14.99          NaN
8   9   6.99          NaN
9  10  11.99          NaN

2) 选择单元格

df['price_new'] = df.loc[df.i>(shift_from+shift_by), 'price_shift']
df
    i  price  price_shift  price_new
0   1  10.99         8.49        NaN
1   2  19.99        12.99        NaN
2   3   5.99        15.99        NaN
3   4   8.49         9.99       9.99
4   5  12.99        14.99      14.99
5   6  15.99         6.99       6.99
6   7   9.99        11.99      11.99
7   8  14.99          NaN        NaN
8   9   6.99          NaN        NaN
9  10  11.99          NaN        NaN

英文:

Apply the shift, then select the cells you wish to keep. It looks like you're attempting to do it all at once and simply getting the indices wrong in the process.

What you seek as a one-liner

shift_from = 6
shift_by = -3
df[&#39;price_new&#39;] = df.loc[df.i&gt;(shift_from+shift_by),&#39;price&#39;].shift(shift_by)

This produces exactly your expected output.

Decomposed in 2 steps for clarity

With dispensable intermediate column.

1) Apply shift

df[&#39;price_shift&#39;] = df[&#39;price&#39;].shift(shift_by)
df
	i	price	price_shift
0	1	10.99	8.49
1	2	19.99	12.99
2	3	5.99	15.99
3	4	8.49	9.99
4	5	12.99	14.99
5	6	15.99	6.99
6	7	9.99	11.99
7	8	14.99	NaN
8	9	6.99	NaN
9	10	11.99	NaN

2) Select cells

df[&#39;price_new&#39;] = df.loc[df.i&gt;(shift_from+shift_by), &#39;price_shift&#39;]
df
    i  price  price_shift  price_new
0   1  10.99         8.49        NaN
1   2  19.99        12.99        NaN
2   3   5.99        15.99        NaN
3   4   8.49         9.99       9.99
4   5  12.99        14.99      14.99
5   6  15.99         6.99       6.99
6   7   9.99        11.99      11.99
7   8  14.99          NaN        NaN
8   9   6.99          NaN        NaN
9  10  11.99          NaN        NaN

答案2

得分: 0

这是一种方法：

df['new_price'] = df['price'].where(df.index >= 6, np.NaN).shift(-3)

使用df.loc[df.i>6, 'price'].shift(-3)的问题在于它选择了最后四行（其中索引大于6的行）：

>>> df.loc[df.i>6, 'price']
6     9.99
7    14.99
8     6.99
9    11.99

然后对它们进行了向前平移：

>>> df.loc[df.i>6, 'price'].shift(-3)
6    11.99
7      NaN
8      NaN
9      NaN

英文:

Here's one approach:

df[&#39;new_price&#39;] = df[&#39;price&#39;].where(df.index &gt;= 6, np.NaN).shift(-3)

The problem with df.loc[df.i>6, 'price'].shift(-3) is that it's selecting the last four rows (the ones where index is greater than 6:

&gt;&gt;&gt; df.loc[df.i&gt;6, &#39;price&#39;]
6     9.99
7    14.99
8     6.99
9    11.99

and then it's shifting those:

&gt;&gt;&gt; df.loc[df.i&gt;6, &#39;price&#39;].shift(-3)
6    11.99
7      NaN
8      NaN
9      NaN

答案3

得分: 0

以下是翻译好的内容：

这是另一种方法。

import pandas as pd
import numpy as np
# 为'i'和'price'列创建数据
n = 10  # 条目数
i_values = list(range(1, n+1))
price_values = [10.99, 19.99, 5.99, 8.49, 12.99, 15.99, 9.99, 14.99, 6.99, 11.99]
# 创建DataFrame
data = {'i': i_values,
        'price': price_values}
df = pd.DataFrame(data)
df['price_new'] = df.loc[df.i > 6, 'price']
df['price_new'] = df['price_new'].shift(-3)

所以，首先创建新列（price_new），然后应用移位。

英文:

Here is another approach.

import pandas as pd
import numpy as np
# Creating data for &#39;i&#39; and &#39;price&#39; columns
n = 10  # Number of entries
i_values = list(range(1, n+1))
price_values = [10.99, 19.99, 5.99, 8.49, 12.99, 15.99, 9.99, 14.99, 6.99, 11.99]
# Creating DataFrame
data = {&#39;i&#39;: i_values,
        &#39;price&#39;: price_values}
df = pd.DataFrame(data)
df[&#39;price_new&#39;] = df.loc[df.i&gt;6, &#39;price&#39;]
df[&#39;price_new&#39;] = df[&#39;price_new&#39;].shift(-3)

So, first create new column (price_new), and then apply shift.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

条件移动 pandas 列

问题

答案1

答案2

答案3

可以使用 Playwright Python 进行网页抓取后按下另一个按钮吗？

解决二次阻力耦合微分方程。

在使用pyautogui和keyboard库进行Python循环时出现问题。

如何使用Python请求在SynchroTeam API中POST customFieldValues？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。