2023年3月7日 17:22:19go评论174阅读模式

英文:

proper input shape for video classification ,use folder picture

问题

以下是要翻译的内容：

这里是问题。我有多个文件夹，每个文件夹有37张照片，每张照片有126个关键点，我该如何调整输入形状，因为我得到了一个错误，错误信息为ValueError: Shapes (None, 1) and (None, 5) are incompatible。

输入形状

如果需要模型结构如下：

from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import LSTM, Dense
from tensorflow.keras.callbacks import TensorBoard
import tensorflow as tf
log_dir = os.path.join('logs')
callback = TensorBoard(log_dir=log_dir)
model = Sequential()
model.add(LSTM(64, input_shape=(37, 126,), return_sequences=True, activation='relu'))
model add(LSTM(128, return_sequences=True, activation='relu'))
model.add(LSTM(64, return_sequences=False, activation='relu'))
model.add(Dense(64, activation='relu'))
model.add(Dense(32, activation='relu'))
model.add(Dense(class_label.shape[0], activation='softmax'))
model.summary()

我想知道我应该检查哪里，感谢所有的回答。

英文:

Here are the problem issues.I have multiples folders ,each folder have 37 photos ,one photos have 126 keypoints,how can I reshape the input shape since I got error saying
ValueError: Shapes (None, 1) and (None, 5) are incompatible

input shape

the model structe if needed

from tensorflow.keras.models import  Sequential
from  tensorflow.keras.layers import LSTM,Dense
from  tensorflow.keras.callbacks import TensorBoard
import  tensorflow as tf
log_dir = os.path.join(&#39;logs&#39;)
callback = TensorBoard(log_dir=log_dir)
model = Sequential()
model.add(LSTM(64,input_shape=(37,126,),return_sequences=True,activation=&#39;relu&#39;))
model.add(LSTM(128,return_sequences=True,activation=&#39;relu&#39;))
model.add(LSTM(64,return_sequences=False,activation=&#39;relu&#39;))
model.add(Dense(64,activation=&#39;relu&#39;))
model.add(Dense(32, activation=&#39;relu&#39;))
model.add(Dense(class_label.shape[0],activation=&#39;softmax&#39;))
model.summary()

I want to know where i should check ,thank you for all the answer

答案1

得分: 1

这是我尝试复制您的代码，假设有5个输出类（为了保持清晰，我添加了额外的Input层: https://stackoverflow.com/questions/45217973/what-is-the-advantage-of-using-an-inputlayer-or-an-input-in-a-keras-model-with）：

from tensorflow.keras import Input
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import LSTM, Dense
from tensorflow.keras.callbacks import TensorBoard
import tensorflow as tf
model = Sequential()
model.add(Input(shape=(37, 126,)))
model.add(LSTM(64, input_shape=(37, 126,), return_sequences=True, activation='relu', name='lstm1'))
model add(LSTM(128, return_sequences=True, activation='relu', name='lstm2'))
model.add(LSTM(64, return_sequences=False, activation='relu', name='lstm3'))
model.add(Dense(64, activation='relu', name='d1'))
model.add(Dense(32, activation='relu', name='d2'))
model.add(Dense(5, activation='softmax', name='out'))
model.summary()

结果如下所示：

Model: "sequential_2"
_________________________________________________________________
 Layer (type)           Output Shape         Param #   
================================================================
 lstm1 (LSTM)           (None, 37, 64)       48896     
                                                         
 lstm2 (LSTM)           (None, 37, 128)      98816     
                                                         
 lstm3 (LSTM)           (None, 64)           49408     
                                                         
 d1 (Dense)             (None, 64)           4160      
                                                         
 d2 (Dense)             (None, 32)           2080      
                                                         
 out (Dense)            (None, 5)            528       
                                                         
================================================================
Total params: 203,888
Trainable params: 203,888
Non-trainable params: 0
_________________________________________________________________

也许问题不在于您的输入形状，而是输出形状？如果您的图像是正确的，模型预测图像到？个类别，但只验证到一个类别，因此不能直接进行比较。首先考虑填充真实标签和预测结果以使它们具有相同的大小。

英文:

This is my attempt at replicating your code, assuming 5 output classes (with an extra Input layer for my sanity: https://stackoverflow.com/questions/45217973/what-is-the-advantage-of-using-an-inputlayer-or-an-input-in-a-keras-model-with):

from tensorflow.keras import Input
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import LSTM, Dense
from tensorflow.keras.callbacks import TensorBoard
import tensorflow as tf
#log_dir = os.path.join(&#39;logs&#39;)
#callback = TensorBoard(log_dir=log_dir)
model = Sequential()
model.add(Input(shape = (37, 126,)))
model.add(LSTM(64, input_shape = (37, 126,), return_sequences = True, activation = &#39;relu&#39;, name = &#39;lstm1&#39;))
model.add(LSTM(128, return_sequences = True, activation = &#39;relu&#39;, name = &#39;lstm2&#39;))
model.add(LSTM(64, return_sequences = False, activation = &#39;relu&#39;, name = &#39;lstm3&#39;))
model.add(Dense(64, activation = &#39;relu&#39;, name = &#39;d1&#39;))
model.add(Dense(32, activation = &#39;relu&#39;, name = &#39;d2&#39;))
model.add(Dense(5, activation = &#39;softmax&#39;, name = &#39;out&#39;))
model.summary()

The result is as shown below:

Model: &quot;sequential_2&quot;
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
=================================================================
 lstm1 (LSTM)                (None, 37, 64)            48896     
                                                                 
 lstm2 (LSTM)                (None, 37, 128)           98816     
                                                                 
 lstm3 (LSTM)                (None, 64)                49408     
                                                                 
 d1 (Dense)                  (None, 64)                4160      
                                                                 
 d2 (Dense)                  (None, 32)                2080      
                                                                 
 out (Dense)                 (None, 5)                528       
                                                                 
=================================================================
Total params: 203,888
Trainable params: 203,888
Non-trainable params: 0
_________________________________________________________________

Perhaps the problem is not with your input shape, but output? If your image is correct, the model predicts images into ? classes, but only validate to a single class, and thus can not be compared directly. Consider padding the ground truth and prediction result to be the same size first.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

适用于视频分类的正确输入形状，使用图像文件夹。

问题

答案1

运行数千个相同爬虫实例

在PyCharm上使用Jupyter包导入Python模块时出错。

根据字符串和条件来拆分Python Pandas数据框。

Python 电报机器人发送带有按钮的消息

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。