2020年1月7日 02:25:01go评论90阅读模式

英文:

Apply function to list of columns from a dataframe

问题

I'm creating a function that accepts 3 inputs: a dataframe, a column, and a list of columns.

import numpy as np
df = pd.DataFrame([[1, 2, 3, 4], [1, 3, 5, 6], [4, 6, 7, 8], [5, 4, 3, 6]], columns=['A', 'B', 'C', 'D'])
def pre_process(dataframe, y_col_name, x_col_names):
    return new_dataframe

The calculation to be applied to y_col_name's rows is each value of y_col_name divided by the mean of y_col_name.

The calculation to be applied to each of the list of columns in x_col_name is each value of each column, divided by the column's standard deviation.

I would like some help to write the function. I think I need to use an "apply" or a "lambda" function but I'm unsure.

This is what calling the command would look like:

pre_process_data = pre_process(df, 'A', ['B', 'D'])

Thanks

英文:

I'm creating a function that accepts 3 inputs: a dataframe, a column and a list of columns.
The function should apply a short calculation to the single column, and a different short calculation to the list of other columns. It should return a dataframe containing just the amended columns (and their amended rows) from the original dataframe.

import numpy as np
df = pd.DataFrame([[1, 2, 3, 4], [1, 3, 5, 6], [4, 6, 7, 8], [5, 4, 3, 6], columns=[&#39;A&#39;, &#39;B&#39;, &#39;C&#39;, &#39;D&#39;])
def pre_process(dataframe, y_col_name, x_col_names):
    return = new_dataframe

The calculation to be applied to y_col_name's rows is each value of y_col_name divided by the mean of y_col_name.

The calculation to be applied to each of the list of columns in x_col_name is each value of each column, divided by the column's standard deviation.

I would like some help to write the function. I think I need to use an "apply" or a "lambda" function but I'm unsure.

This is what calling the command would look like:

pre_process_data = preprocess(df,&#39;A&#39;, [&#39;B&#39;,&#39;D&#39;])

Thanks

答案1

得分: 0

def pre_process(dataframe, y_col_name, x_col_names):
    new_dataframe = dataframe.copy()
    new_dataframe[y_col_name] = new_dataframe[y_col_name] / new_dataframe[y_col_name].mean()
    new_dataframe[x_col_names] = new_dataframe[x_col_names] / new_dataframe[x_col_names].std()
    return new_dataframe

英文:

def pre_process(dataframe, y_col_name, x_col_names):
    new_dataframe = dataframe.copy()
    new_dataframe[y_col_name] =  new_dataframe[y_col_name]/new_dataframe[y_col_name].mean()
    new_dataframe[x_col_names] = new_dataframe[x_col_names]/new_dataframe[x_col_names].std()
    return new_dataframe

Is this what you mean?

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将函数应用于数据框中的列列表

问题

答案1

如何在 pandas 数据框中删除包含 NaN 数组的行。

如何在Golang中找到列表对象中的重叠值？

当在数据框构造函数中使用’squeeze’关键字时为什么会出错？

如何根据要求，在SPARK AZURE-DATABRICKS中使用SCALA将JSON对象转换为列的值

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。