2020年1月7日 01:39:56go评论111阅读模式

英文:

Keras treat Image as array of arrays not a single picture

问题

问题可能出在你的输入图像形状重塑上。在神经网络中，输入的形状必须与模型的输入层匹配。你的模型的输入层期望形状是(1, 240, 256, 3)，但你的 picture 数组似乎已经具有正确的形状(1, 240, 256, 3)。

问题可能出在你对 picture 的重塑方式上。请确保在调用self.model.predict(picture, verbose=1)之前，picture 的形状是(1, 240, 256, 3)，而不是更多的嵌套维度。

如果问题仍然存在，还请检查模型的架构，确保模型的输出层是正确的，并且适合你的问题。如果输出仍然不正确，可能需要进一步调查模型和数据的问题。

英文:

So I have an NN inside of a class

      self.model = Sequential()
      self.model.add(Conv2D(50, (3, 3), activation=&#39;relu&#39;, input_shape=(240,256,3)))
      self.model.add(Dense(264,activation=&#39;relu&#39;))
      self.model.add(Dense(7,activation=&#39;relu&#39;))
      self.model.compile(optimizer=Adam(lr=0.001),loss=&#39;categorical_crossentropy&#39;,metrics=[&#39;accuracy&#39;])

and I have an array that is in shape (240, 256, 3)

print(picture.shape) #(240, 256, 3)
picture = np.reshape(picture,(1,240,256,3))

and then try to

self.model.predict(picture,verbose=1)

but instead of output like this [ 0. 25.21973 0. 0. 0. 1.8569145 0.] I got something like

[[[[ 0.         25.21973     0.         ...  0.          1.8569145
     0.        ]
   [ 0.         25.21973     0.         ...  0.          1.8569145
     0.        ]
   [ 0.         25.21973     0.         ...  0.          1.8569145
     0.        ]
   ...
  [[ 0.         14.3257885   0.         ...  1.7455587   0.
     0.        ]
   [ 0.         25.417042    0.         ...  0.          7.501096
     0.        ]
   [ 0.         24.028965    0.         ... 14.10364     0.
     0.        ]
   ...
   [ 0.         17.480661    0.         ...  3.4586341   0.
     0.        ]]
  [[ 0.         21.477276    0.         ...  0.          0.
     0.        ]
   [ 0.         15.683931    0.         ...  0.          0.
     0.        ]
   [ 0.         10.419488    0.         ...  0.          0.29006004
     0.        ]
   ...
   [ 0.          7.038389    0.         ...  0.          0.
     0.        ]]
  [[ 0.         18.099554    0.         ...  0.          0.
     0.        ]
   [ 0.          8.225699    0.         ...  0.751534    0.
     0.        ]
   [ 0.         13.256775    0.         ...  0.          2.1235647
     0.        ]]]]

can you tell me what is the problem?

答案1

得分: 2

默认情况下，Keras的Dense层在输入的最后一个维度上操作，因此当您输入一张图像时，会得到另一张图像作为输出。问题出在您的模型上。如果您使用model.summary()，您会看到模型的输出形状实际上与通过predict看到的形状相同。

解决方法很简单，在最后一个Conv2D层之后添加一个Flatten层：

self.model = Sequential()
self.model.add(Conv2D(50, (3, 3), activation='relu', input_shape=(240, 256, 3)))
self.model.add(Flatten())
self.model.add(Dense(264, activation='relu'))
self.model.add(Dense(7, activation='relu'))

然后您的模型将按预期工作。

英文:

By default, Keras' Dense layers operate on the last dimension of the input, so when you input an image, you get another image as output. The problem is with your model. If you use model.summary() you will see that the output shape of your model is actually the one you are seeing through predict.

The solution is simple, add a Flatten layer after the last Conv2D:

self.model = Sequential()
self.model.add(Conv2D(50, (3, 3), activation=&#39;relu&#39;, input_shape=(240,256,3)))
self.model.add(Flatten())
self.model.add(Dense(264,activation=&#39;relu&#39;))
self.model.add(Dense(7,activation=&#39;relu&#39;))

Then your model will behave as expected.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Keras将图像视为数组的数组，而不是单个图片。

问题

答案1

<图大小 800×800，无坐标轴> 在使用 plt.show() 时，Jupyter Notebook

Spark任务数量不等于分区数量。

如何将多个Python模块导入一个Tkinter窗口

合并某些行的时间序列数据。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。