2023年7月7日 01:04:48go评论70阅读模式

英文:

How to assign new columns based on chaining in pandas

问题

我正在尝试在pandas中使用链式操作创建一个新的数据框。

result = (
    df.drop(['Day_of_year', 'Month', 'Week_of_year'], axis='columns'),
    pd.to_datetime(df['timestamp']),
    .assign("random" = 0)
)
# 访问元组的第一个元素
updated_df = result[0]
updated_df
如果我注释掉最后一行，元组中的代码可以正常工作，但我想要分配新的列。
我该如何做到这一点？

英文:

I'm trying to create a new dataframe using chaining in pandas.

result = (
    df.drop([&#39;Day_of_year&#39;, &#39;Month&#39;, &#39;Week_of_year&#39;], axis=&#39;columns&#39;),
    pd.to_datetime(df[&#39;timestamp&#39;]),
    .assign(&quot;random&quot; = 0)
)
# Access the first element of the tuple
updated_df = result[0]
updated_df

If I comment out the last line the code in the tuple work but I want to assign new columns.

How do I do this?

答案1

得分: 1

尝试这个：

updated_df = (
    df.drop(columns=['Day_of_year', 'Month', 'Week_of_year'])
    .assign(**{"random": 0, 'timestamp': pd.to_datetime(df['timestamp'])})
)

测试

在一个虚拟数据框上评估上述代码：

import pandas as pd
# 创建一个日期范围
date_range = pd.date_range(start='2023-07-01', end='2023-07-10')
# 创建数据框
df = pd.DataFrame()
df['timestamp'] = date_range
df['Day_of_year'] = df['timestamp'].dt.dayofyear
df['Month'] = df['timestamp'].dt.month
df['Week_of_year'] = df['timestamp'].dt.isocalendar().week
updated_df = (
    df.drop(columns=['Day_of_year', 'Month', 'Week_of_year'])
    .assign(**{"random": 0, 'timestamp': pd.to_datetime(df['timestamp'])})
)
print(updated_df)
# 打印：
#
#    timestamp  random
# 0 2023-07-01       0
# 1 2023-07-02       0
# 2 2023-07-03       0
# 3 2023-07-04       0
# 4 2023-07-05       0
# 5 2023-07-06       0
# 6 2023-07-07       0
# 7 2023-07-08       0
# 8 2023-07-09       0
# 9 2023-07-10       0

英文:

Try this:

updated_df = (
    df.drop(columns=[&#39;Day_of_year&#39;, &#39;Month&#39;, &#39;Week_of_year&#39;])
    .assign(**{&quot;random&quot;: 0, &#39;timestamp&#39;: pd.to_datetime(df[&#39;timestamp&#39;])})
)

Testing

Evaluating the above code on a dummy dataframe:

import pandas as pd
# Create a date range
date_range = pd.date_range(start=&#39;2023-07-01&#39;, end=&#39;2023-07-10&#39;)
# Create the DataFrame
df = pd.DataFrame()
df[&#39;timestamp&#39;] = date_range
df[&#39;Day_of_year&#39;] = df[&#39;timestamp&#39;].dt.dayofyear
df[&#39;Month&#39;] = df[&#39;timestamp&#39;].dt.month
df[&#39;Week_of_year&#39;] = df[&#39;timestamp&#39;].dt.isocalendar().week
updated_df = (
    df.drop(columns=[&#39;Day_of_year&#39;, &#39;Month&#39;, &#39;Week_of_year&#39;])
    .assign(**{&quot;random&quot;: 0, &#39;timestamp&#39;: pd.to_datetime(df[&#39;timestamp&#39;])})
)
print(updated_df)
# Prints:
#
#    timestamp  random
# 0 2023-07-01       0
# 1 2023-07-02       0
# 2 2023-07-03       0
# 3 2023-07-04       0
# 4 2023-07-05       0
# 5 2023-07-06       0
# 6 2023-07-07       0
# 7 2023-07-08       0
# 8 2023-07-09       0
# 9 2023-07-10       0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在pandas中基于链式操作分配新列

问题

答案1

测试

Testing

Dataframe通过在另一个Dataframe中查找另一个列的出现来填充一列的值。

Pandas: Shape of passed values is (10, 1), indices imply (10, 5) error when trying to append a dict to an existing Dataframe

基于匹配观察时间计算差异。

筛选 pandas 数据框，使用多个不同列的等值检查。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。