2023年8月10日 23:32:53go评论196阅读模式

英文:

MLPRegressor result

问题

这是一个我无法弄明白或使其工作的书的示例。其他情况我都能解决，但这变成了一个挑战。
当我运行它时，它显示了这条消息：
粘贴完整输出：

回溯（最近的调用最后）：
文件“c：\BAULO\PYTHON\ESTRUCTURAS\P_ML\UNIDAD6\Book16.py”，第14行，在中
data_y = msu_df[w]
文件“C：\Users\Mauri\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\frame.py”，第3810行，在__getitem__中
indexer = self.columns._get_indexer_strict(key，“columns”)1
文件“C：\Users\Mauri\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexes\base.py”，第6111行，在_get_indexer_strict中
self._raise_if_missing(keyarr, indexer, axis_name)
文件“C：\Users\Mauri\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexes\base.py”，第6171行，在_raise_if_missing中
raise KeyError(f“在[{axis_name}]中没有[{key}]”)
KeyError: “在[columns]中没有[Index([('N_Applications',)], dtype='object')]”


可以从以下链接下载csv文件：https://github.com/PacktPublishing/Hands-On-Data-Preprocessing-in-Python/blob/main/Chapter06/MSU%20applications.csv
我尝试过`axis=1`和`reshape`，但我无法找出错误。我知道这个主题已经讨论过了，但我找到的也对我不起作用。
```python
import  pandas as pd
import numpy as np
from sklearn.neural_network import MLPRegressor
msu_df = pd.read_csv('MSU applications.csv')
msu_df.set_index('Year', drop=True, inplace=True)
X = ['P_Football_Performance','SMAn2']
y = 'N_Applications'
w = np.reshape(y, (1,-1))
data_X = msu_df[X]
data_y = msu_df[w]
mlp = MLPRegressor(hidden_layer_sizes=6, max_iter=100000)
print(mlp.predict(mlp.fit(data_X, data_y)))

英文:

This is an example of a book that I can't figure out or make it work. Other cases I could solve but it became a challenge.
When I run it, it shows me this message:
Paste complete output:

Traceback (most recent call last):
  File &quot;c:\BAULO\PYTHON\ESTRUCTURAS\P_ML\UNIDAD6\Book16.py&quot;, line 14, in &lt;module&gt;
    data_y = msu_df[w]
  File &quot;C:\Users\Mauri\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\frame.py&quot;, line 3810, in __getitem__
    indexer = self.columns._get_indexer_strict(key, &quot;columns&quot;)[1]
  File &quot;C:\Users\Mauri\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexes\base.py&quot;, line 6111, in _get_indexer_strict
    self._raise_if_missing(keyarr, indexer, axis_name)
  File &quot;C:\Users\Mauri\AppData\Local\Programs\Python\Python310\lib\site-packages\pandas\core\indexes\base.py&quot;, line 6171, in _raise_if_missing
    raise KeyError(f&quot;None of [{key}] are in the [{axis_name}]&quot;)
KeyError: &quot;None of [Index([(&#39;N_Applications&#39;,)], dtype=&#39;object&#39;)] are in the [columns]&quot;

The csv can be downloaded from: https://github.com/PacktPublishing/Hands-On-Data-Preprocessing-in-Python/blob/main/Chapter06/MSU%20applications.csv

I tried axis=1 and reshape, but I can't figure out the error. I know this topic has already been discussed but what I found doesn't work for me either.

import  pandas as pd
import numpy as np
from sklearn.neural_network import MLPRegressor
msu_df = pd.read_csv(&#39;MSU applications.csv&#39;)
msu_df.set_index(&#39;Year&#39;, drop=True, inplace=True)
X = [&#39;P_Football_Performance&#39;,&#39;SMAn2&#39;]
y = &#39;N_Applications&#39;
w = np.reshape(y, (1,-1))
data_X = msu_df[X]
data_y = msu_df[w]
mlp = MLPRegressor(hidden_layer_sizes=6, max_iter=100000)
print(mlp.predict(mlp.fit(data_X, data_y)))

答案1

得分: 0

你在这里做了一个不必要的步骤：

w = np.reshape(y, (1,-1))

我不确定你为什么这样做，但这一步只是将 y 从一个字符串转换成一个数组。你可以直接将 y 传递给数据框 msu_df 中的 label 来获取它：

y = 'N_Applications'
data_y = msu_df[y]

额外提示：你在 fit() 方法之上调用了 predict() 方法，这不是做预测的正确方法。fit() 方法实质上是训练阶段，此阶段使用的数据是训练数据，在这种情况下是你的 data_X 和 data_y。你应该在未见过/新数据上进行预测，而不是在模型已经训练过的数据上。你应该将这行代码：

print(mlp.predict(mlp.fit(data_X, data_y)))

替换为这行代码：

mlp.fit(data_X, data_y)

从你正在遵循的教程笔记本中获取预测的示例代码：

newData = pd.DataFrame({'P_Football_Performance': 0.364, 'SMAn2': 17198}, index=[2022])
mlp.predict(newData)

英文:

You are doing an unnecessary step here:

w = np.reshape(y, (1,-1))

I'm not sure why you are doing it but that step is just converting y from a string to an array. You can directly pass y to get the label from the dataframe msu_df:

y = &#39;N_Applications&#39;
data_y = msu_df[y]

Additional: You are calling the predict() method on top of fit() method which is not the correct way to make a prediction. The fit() method is essentially the training stage and the data used in this stage is the training data which in this case your data_X and data_y. You would want to make predictions on unseen/new data, not on the ones where the model is already trained. You should replace this line:

print(mlp.predict(mlp.fit(data_X, data_y)))

with this:

mlp.fit(data_X, data_y)

Sample code for prediction from the tutorial notebook you are following:

newData = pd.DataFrame({&#39;P_Football_Performance&#39;:0.364,&#39;SMAn2&#39;:17198},
                   index=[2022]) 
mlp.predict(newData)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

MLPRegressor 结果

问题

答案1

每行文本部件的开头都有双倍空格，无论插入什么内容。

Django本地服务器显示模板未找到。

删除不满足条件的行。

Django. ListView. “image”属性没有关联的文件。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。