2023年3月7日 23:06:45go评论107阅读模式

英文:

How to Inverse Transform a Predicted Output of a loaded pickle XGBoost model?

问题

以下是翻译好的内容：

I am trying to run a program that could produce a predicted output using a loaded model (pickle file). The saved model (XGBoost) was trained to have its dataset to undergo transformation via StandardScaler before fitting it, and the predicted value needs to be inverse transformed to get the actual predicted value. The data consists of 2 input values, and 1 output value.

我正在尝试运行一个程序，使用加载的模型（pickle文件）来产生预测输出。保存的模型（XGBoost）在拟合之前经过StandardScaler的数据集转换，需要将预测值逆向转换以获得实际预测值。数据包括2个输入值和1个输出值。

I already have done prediction using the pickle file. However, when I try to inverse transform the output, I get an error saying "sklearn.exceptions.NotFittedError: This StandardScaler instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator."

我已经使用pickle文件进行了预测。然而，当我尝试逆向转换输出时，出现错误，提示："sklearn.exceptions.NotFittedError: This StandardScaler instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator."

What could fix this error?

如何修复这个错误？

I also tried StandardScaler transform on the input variables of raw_data. Yet, I receive another error saying "ValueError: non-broadcastable output operand" with shape (1,1) doesn't match the broadcast shape (1,2).

我还尝试对raw_data的输入变量进行了StandardScaler转换。然而，我收到另一个错误，提示："ValueError: non-broadcastable output operand"，形状为（1,1），与广播形状（1,2）不匹配。

英文:

raw_data = pd.DataFrame(data, columns=columns)
raw_data[&#39;X&#39;] = raw_data[&#39;X&#39;].astype(float)
raw_data[&#39;Y&#39;] = raw_data[&#39;Y&#39;].astype(float)
print(raw_data)
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
xgb_model_loaded = pickle.load(open(&#39;model_1.pkl&#39;, &#39;rb&#39;))
output = xgb_model_loaded.predict(raw_data)
output = sc.inverse_transform((output.reshape(-1,1)), copy=None)
print(output)

What could fix this error?

I also tried StandardScaler transform on the input variables of raw_data. Yet, I receive another error saying
"ValueError: non-broadcastable output operand" with shape (1,1) doesn't match the broadcast shape (1,2)"

答案1

得分: 1

将StandardScaler导出为pickle文件，然后与模型一起加载以供使用。
只需保存用于训练模型的StandardScaler参数（sc.mean_ 和 sc.var_），这就足够用于进行变换和逆变换。
使用管道（首选方法）：在训练模型时，使用管道将StandardScaler和XGBoost组合成一个单一的模型：

from sklearn.pipeline import Pipeline
# 创建一个管道
model = Pipeline([
    ('StandardScaler', sc),
    ('XGBoost', xgb_model),
])
# 然后训练并保存模型。

英文:

There are three ways to do this:

1. Export StandardScaler as pickle as well
You can export the StandardScaler you used to train the model as a pickle as well and then load it with the model to use it.

2. Just save StandardScaler parameters
Save the parameters of the StandardScaler you used to train the model (sc.mean_ and sc.var_), it's all you need to transform and inverse transform.

3. Using a pipeline (preferred method)
When training the model, use a pipeline to group StandardScaler and XGBoost in a single model:

from sklearn.pipeline import Pipeline
# Create a pipeline
model = Pipeline([
    (&#39;StandardScaler&#39;, sc),
    (&#39;XGBoost&#39;, xgb_model),
])
#Then train and save the model.

答案2

得分: 0

以下是翻译好的部分：

"我通过保存StandardScaler参数并在我的程序中加载它来解决了这个问题。现在，预测数据可以以其逆转换的形式显示。

raw_data = pd.DataFrame(data, columns=columns)
raw_data['X'] = raw_data['X'].astype(float)
raw_data['Y'] = raw_data['Y'].astype(float)
print(raw_data)
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
xgb_model_loaded = pickle.load(open('model_1.pkl', 'rb'))
output = xgb_model_loaded.predict(raw_data)
sc = load('Std_Scaler_1.bin')
output = sc.inverse_transform((bgl_output.reshape(-1,1)), copy=None)
print(output)

希望这对你有帮助。

英文:

I was able to solve this problem by saving the StandardScaler parameters, and loading it in my program. Now, the predicted data can be displayed in its inverse transform figure.

raw_data = pd.DataFrame(data, columns=columns)
raw_data[&#39;X&#39;] = raw_data[&#39;X&#39;].astype(float)
raw_data[&#39;Y&#39;] = raw_data[&#39;Y&#39;].astype(float)
print(raw_data)
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
xgb_model_loaded = pickle.load(open(&#39;model_1.pkl&#39;, &#39;rb&#39;))
output = xgb_model_loaded.predict(raw_data)
sc = load(&#39;Std_Scaler_1.bin&#39;)
output = sc.inverse_transform((bgl_output.reshape(-1,1)), copy=None)
print(output)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何反向转换加载的 pickle XGBoost 模型的预测输出？

问题

答案1

答案2

Discord.py 2.0.0：discord.errors.ClientException：此客户端已有关联的命令树

如何将多个参数参数作为单个变量传递给Python函数

ModuleNotFoundError: 未找到模块名称 states

在一列中计算连续NaN值的快速方法

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。