2023年2月27日 11:50:26go评论76阅读模式

英文:

Keras model.predict is nearly always incorrect on training dataset; even when training it to near 100% accuracy

问题

以下是您要翻译的内容：

I am trying to do multiclass classification with keras for videogame characters. The problem is that even when the training accuracy/validation accuracy hits near 100% and 70% respectively; when I actually run model.predict() on an image that I literally just trained on; it classifies it completely incorrectly. I have around 300 images per class.

I have tried everything I can think of; tried many ways to load the data initially, changed the model architecture a million times and the preprocessing functions to prepare the image that model.predict will be ran on.

Here is my code:

class_names = ['ana', 'ashe','baby', 'ball','bap', 'bastion','brig', 'cass','doom', 'dva'
,'echo', 'genji','hanzo', 'hog','jq', 'junkrat','kiriko', 'lucio','mei', 'mercy','moira', 'orisa'
,'pharah', 'ram','reaper', 'rein','sigma', 'sojourn','soldier', 'sombra','sym', 'torb','tracer', 'widow'
,'winston', 'zarya','zen']
class_names_label = {class_name:i for i, class_name in enumerate(class_names)}
nb_classes = len(class_names)
print(class_names_label)
IMAGE_SIZE = (128,128)
def load_data():
    DIRECTORY = r'D:\crop-id\enemy-crop'
    CATEGORY = ['train', 'test']
    output = []
    for category in CATEGORY:
        path = os.path.join(DIRECTORY, category)
        print(path)
        images = []
        labels = []
        print('Loading {}'.format(category))
        for folder in os.listdir(path):
            label = class_names_label[folder]
            #iterate through each img in folder
            for file in os.listdir(os.path.join(path,folder)):
                #get path name of img
                img_path = os.path join(os.path.join(path, folder), file)
                #open and resize img
                image = cv.imread(img_path)
                image = cv.cvtColor(image, cv.COLOR_BGR2RGB)
                image = cv.resize(image, IMAGE_SIZE)
                #append the image and its corresponding label to output
                images.append(image)
                labels.append(label)
        images = np.array(images, dtype = 'float32')
        labels = np.array(labels, dtype='int32')
        output.append((images, labels))
    return output
(train_images, train_labels), (test_images, test_labels) = load_data()
train_images, train_labels = shuffle(train_images, train_labels, random_state=25)
model = Sequential()
model.add(Conv2D(32, (3,3), activation='relu', input_shape=(128, 128, 3))
model.add(MaxPooling2D())
model.add(BatchNormalization())
model.add(Conv2D(64, (3,3), activation='relu'))
model.add(MaxPooling2D())
model.add(BatchNormalization())
model.add(Conv2D(64, (3,3), activation='relu'))
model.add(MaxPooling2D()) 
model.add(BatchNormalization()
model.add(Flatten())
model.add(Dense(256, activation='relu'))
model.add(Dense(37, activation='softmax'))
model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=0.0001), loss='sparse_categorical_crossentropy', metrics=['sparse_categorical_accuracy'])
model.summary()
model.fit(train_images, train_labels, batch_size=32, epochs=10, validation_split=.2)
model.save(os.path.join('models','plzwork14.h5'))
predictions = model.predict(test_images)
pred_labels = np.argmax(predictions, axis = 1)
print(classification_report(test_labels, pred_labels))

That is for the model and this is the preprocessing I run on the image for the model.predict():

def prepare(img):
    img = cv.resize(img, (128,128))
    img = np.reshape(img, (128,128,3))
    img = np.expand_dims(img/255, 0)
    prediction = model.predict(img)
    prediction = prediction[0]
    print(prediction)
    print(class_names[np.argmax(prediction)])
img1 = cv.imread(r'C:\Users\andrew\Desktop\sombra.jpg')
prepare(img1)

Thanks for your help!

英文:

Here is my code:

`class_names = [&#39;ana&#39;, &#39;ashe&#39;,&#39;baby&#39;, &#39;ball&#39;,&#39;bap&#39;, &#39;bastion&#39;,&#39;brig&#39;, &#39;cass&#39;,&#39;doom&#39;, &#39;dva&#39;
,&#39;echo&#39;, &#39;genji&#39;,&#39;hanzo&#39;, &#39;hog&#39;,&#39;jq&#39;, &#39;junkrat&#39;,&#39;kiriko&#39;, &#39;lucio&#39;,&#39;mei&#39;, &#39;mercy&#39;,&#39;moira&#39;, &#39;orisa&#39;
,&#39;pharah&#39;, &#39;ram&#39;,&#39;reaper&#39;, &#39;rein&#39;,&#39;sigma&#39;, &#39;sojourn&#39;,&#39;soldier&#39;, &#39;sombra&#39;,&#39;sym&#39;, &#39;torb&#39;,&#39;tracer&#39;, &#39;widow&#39;
,&#39;winston&#39;, &#39;zarya&#39;,&#39;zen&#39;]
class_names_label = {class_name:i for i, class_name in enumerate(class_names)}
nb_classes = len(class_names)
print(class_names_label)
IMAGE_SIZE = (128,128)
def load_data():
DIRECTORY = r&#39;D:\crop-id\enemy-crop&#39;
CATEGORY = [&#39;train&#39;, &#39;test&#39;]
output = []
for category in CATEGORY:
path = os.path.join(DIRECTORY, category)
print(path)
images = []
labels = []
print(&#39;Loading {}&#39;.format(category))
for folder in os.listdir(path):
label = class_names_label[folder]
#iterate through each img in folder
for file in os.listdir(os.path.join(path,folder)):
#get path name of img
img_path = os.path.join(os.path.join(path, folder), file)
#open and resize img
image = cv.imread(img_path)
image = cv.cvtColor(image, cv.COLOR_BGR2RGB)
image = cv.resize(image, IMAGE_SIZE)
#append the image and its corresponding label to output
images.append(image)
labels.append(label)
images = np.array(images, dtype = &#39;float32&#39;)
labels = np.array(labels, dtype=&#39;int32&#39;)
output.append((images, labels))
return output
(train_images, train_labels), (test_images, test_labels) = load_data()
train_images, train_labels = shuffle(train_images, train_labels, random_state=25)
model = Sequential()
model.add(Conv2D(32, (3,3), activation=&#39;relu&#39;, input_shape=(128, 128, 3)))
model.add(MaxPooling2D())
model.add(BatchNormalization())
model.add(Conv2D(64, (3,3), activation=&#39;relu&#39;))
model.add(MaxPooling2D())
model.add(BatchNormalization())
model.add(Conv2D(64, (3,3), activation=&#39;relu&#39;))
model.add(MaxPooling2D()) 
model.add(BatchNormalization())
model.add(Flatten())
model.add(Dense(256, activation=&#39;relu&#39;))
model.add(Dense(37, activation=&#39;softmax&#39;))
model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=0.0001), loss=&#39;sparse_categorical_crossentropy&#39;, metrics=[&#39;sparse_categorical_accuracy&#39;])
model.summary()
model.fit(train_images, train_labels, batch_size=32, epochs=10, validation_split=.2)
model.save(os.path.join(&#39;models&#39;,&#39;plzwork14.h5&#39;))
predictions = model.predict(test_images)
pred_labels = np.argmax(predictions, axis = 1)
print(classification_report(test_labels, pred_labels))`

That is for the model and this is the preprocessing I run on the image for the model.predict():

`def prepare(img):
img = cv.resize(img, (128,128))
img = np.reshape(img, (128,128,3))
img = np.expand_dims(img/255, 0)
prediction = model.predict(img)
prediction = prediction[0]
print(prediction)
print(class_names[np.argmax(prediction)])
img1 = cv.imread(r&#39;C:\Users\andrew\Desktop\sombra.jpg&#39;)
prepare(img1)`

Thanks for your help!

答案1

得分: 1

在生成训练和测试图像的代码中，您有以下代码：

image = cv.cvtColor(image, cv.COLOR_BGR2RGB)

因此，您的训练和测试数据是RGB图像。然而，在进行预测时，您没有这行代码，所以您要预测的图像是BGR图像。因此，只需添加代码来将要预测的图像从BGR转换为RGB。

英文:

in the code for generation of your training and test images you have the code

image = cv.cvtColor(image, cv.COLOR_BGR2RGB)

Thus your train and test data are RGB images. However when you do your predictions you do not have this line of code so the image your trying to predict on are BGR images. So just add the code to convert from BGR to RGB for the images you want to predict

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Keras model.predict is nearly always incorrect on training dataset; even when training it to near 100% accuracy

问题

答案1

Number of days between today and a certain date (Pandas)

Create numpy array start from 0 to 1 with increment 0.1

如何从三个列表中创建热图在Python中？

根据数值和窗口大小从另一个数组构建两个数组

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。