2023年2月8日 17:11:03go评论93阅读模式

英文:

Set values in dataframe A by iterating from values on dataframe B

问题

DataFrame A 与以下类似：

info2 = {'speed': [None]*80}
dfA = pd.DataFrame(info2)
dfA

DataFrame B 与以下类似：

info={"IndexSpeed":[7,16,44,56,80],"speed":[25,50,25,50,90]}
dfB = pd.DataFrame(info)
dfB

我需要使用DataFrame B中的值来设置DataFrame A中的值。例如，对于dfA中索引小于等于7的每一行，速度应设置为25。对于索引在8和16之间的每一行，速度应设置为50，以此类推，直到设置了所有80行。

最佳的方法是什么？

英文:

Dataframe A is similar to this :

info2 = {&#39;speed&#39;: [None]*80}
dfA = pd.DataFrame(info2)
dfA

Dataframe B is similar to this :

info={&quot;IndexSpeed&quot;:[7,16,44,56,80],&quot;speed&quot;:[25,50,25,50,90]}
dfB = pd.DataFrame(info)
dfB

I need to set the values in dfA['speed'] by using the values in dfB.
For instance, for each row in dfA of index <=7, speed should be set at 25.
for each row of index between 8 and 16, speed should be set at 50. and so on untill all 80 rows are set.

What would be the optimal way to do this?

答案1

得分: 1

你可以使用 merge_asof 函数：

dfA['speed'] = pd.merge_asof(dfA.drop(columns='speed'), dfB,
                             left_index=True, right_on='IndexSpeed',
                             direction='forward',
                             )['speed']

注意：dfA 必须按其索引排序，而 dfB 必须按 IndexSpeed 排序。

输出：

    speed
0      25
1      25
2      25
3      25
4      25
..    ...
75     90
76     90
77     90
78     90
79     90
[80 行 x 1 列]

输出为数组：

array([25, 25, 25, 25, 25, 25, 25, 25, 50, 50, 50, 50, 50, 50, 50, 50, 50,
       25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25,
       25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 50, 50, 50, 50, 50, 50,
       50, 50, 50, 50, 50, 50, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90,
       90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90])

英文:

You can use a merge_asof:

dfA[&#39;speed&#39;] = pd.merge_asof(dfA.drop(columns=&#39;speed&#39;), dfB,
                             left_index=True, right_on=&#39;IndexSpeed&#39;,
                             direction=&#39;forward&#39;,
                             )[&#39;speed&#39;]

NB. dfA must be sorted on its index and dfB on IndexSpeed.

Output:

    speed
0      25
1      25
2      25
3      25
4      25
..    ...
75     90
76     90
77     90
78     90
79     90
[80 rows x 1 columns]

Output as array:

array([25, 25, 25, 25, 25, 25, 25, 25, 50, 50, 50, 50, 50, 50, 50, 50, 50,
       25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25,
       25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 25, 50, 50, 50, 50, 50, 50,
       50, 50, 50, 50, 50, 50, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90,
       90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90])

答案2

得分: 0

或许可以使用一个包含键对应于具有两个组件的元组的字典：

zipped = zip(pd.concat([pd.Series(0), dfB.IndexSpeed + 1]), dfB.IndexSpeed)
ind_mapper = {(i, j): k for (k, (i, j)) in enumerate(zipped)}
for lower, upper in ind_mapper:
    dfA.iloc[lower:upper, 0] = dfB.iloc[ind_mapper[(lower, upper)], 1]

英文:

Maybe use a dict with keys corresponding to tuples with two components

zipped = zip(pd.concat([pd.Series(0),dfB.IndexSpeed + 1]),dfB.IndexSpeed))
ind_mapper = {(i,j): k for (k,(i,j)) in enumerate(zipped)}
for lower, upper in ind_mapper:
    dfA.iloc[lower:upper,0] = dfB.iloc[index_mapper[(lower, upper)],1]

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在数据框A中通过从数据框B的数值进行迭代来设置数值。

问题

答案1

答案2

pymc5 – 寻找模型比较的 AIC、BIC、LOO

如何从Java中执行安装在Python虚拟环境中的Python工具。

How to add a new column in pandas Dataframe if the string or object value of column 1 is repeated in three consecutive rows

欺骗 Python 中正在进行的日期时间

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。