问题

我有一个ResNet50模型，它输出一个类别预测（1、2或3）。基于分类器的输出，我想进行另一个预测，选择下一个模型，根据类别预测。

这是我目前的代码。

import torch

class SimpleModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.model1 = torch.nn.Linear(1, 1, bias=False)
        torch.nn.init.ones_(self.model1.weight)

        self.model2 = torch.nn.Linear(1, 1, bias=False)
        torch.nn.init.ones_(self.model2.weight)

        self.model3 = torch.nn.Linear(1, 1, bias=False)
        torch.nn.init.ones_(self.model3.weight)

    def forward(self, x):
        
        # 获取批处理大小
        batch_size = x.size(1)
        output = torch.zeros(batch_size, 1, device=x.device)
        
        # 循环遍历批处理中的每个值
        for i in range(batch_size):
            value = x[:, i]
            if value == 1:
                output[i] = self.model1(value)
            elif value == 2:
                output[i] = self.model2(value)
            else:
                output[i] = self.model3(value)

        return output

model = SimpleModel()

output = model(torch.tensor([[1,2,3]], dtype=torch.float32))
output

我的担忧是，在循环的每次迭代中只进行一次前向传播，这似乎非常低效。如果我将批处理大小增加到64会发生什么？前向传播会并行计算吗？

欢迎任何想法和建议。

英文:

I have a resnet50 model that outputs a class prediction (1, 2 or 3). Based on the output of the classifier, I want to make another prediction that selects the next model based on the class prediction.

This is what I have so far.

import torch

class SimpleModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.model1 = torch.nn.Linear(1, 1, bias=False)
        torch.nn.init.ones_(self.model1.weight)

        self.model2 = torch.nn.Linear(1, 1, bias=False)
        torch.nn.init.ones_(self.model2.weight)

        self.model3 = torch.nn.Linear(1, 1, bias=False)
        torch.nn.init.ones_(self.model3.weight)

    def forward(self, x):
        
        # Get batch_size
        batch_size = x.size(1)
        output = torch.zeros(batch_size, 1, device=x.device)
        
        # Loop over every value in batch
        for i in range(batch_size):
            value = x[:, i]
            if value == 1:
                output[i] = self.model1(value)
            elif value == 2:
                output[i] = self.model2(value)
            else:
                output[i] = self.model3(value)

        return output

model = SimpleModel()

output = model(torch.tensor([[1,2,3]], dtype=torch.float32))
output

My concern is that I am only computing one forward pass on each iteration of the loop which seems very inefficient. What happens if I increase the batch size to 64? Will the forward pass be computed in parallel?

Any thoughts/ideas would be appreciated.

答案1

得分: 1

以下是翻译好的代码部分：

def forward(self, x):
        
        # 获取批量大小
        batch_size = x.size(1)
        output = torch.zeros(batch_size, 1, device=x.device)
        
        # 为每个条件计算一个掩码

        value_mask_1 = (x == 1)
        value_mask_2 = (x == 2)
        value_mask_3 = (x == 3)
        
        # 然后只需在每个条件的掩码所选的项目上运行模型。
        # 然后将模型的输出分配给输出变量中的相应位置。

        output[value_mask_1.view_as(output)] = self.model1(x[value_mask_1])
        output[value_mask_2.view_as(output)] = self.model1(x[value_mask_2])
        output[value_mask_3.view_as(output)] = self.model1(x[value_mask_3])

        return output

英文:

You can do as follows. The code runs each one of the three models just once by using masks as conditions without using any for loop:

def forward(self, x):
        
        # Get batch_size
        batch_size = x.size(1)
        output = torch.zeros(batch_size, 1, device=x.device)
        
        # Compute one mask for each condition

        value_mask_1 = (x == 1)
        value_mask_2 = (x == 2)
        value_mask_3 = (x == 3)
        
        # Then just run the model on the items selected by each condition&#39;s mask.
        # And then assign model&#39;s outputs to the corresponding positions in the output variable.

        output[value_mask_1.view_as(output)] = self.model1(x[value_mask_1])
        output[value_mask_2.view_as(output)] = self.model1(x[value_mask_2])
        output[value_mask_3.view_as(output)] = self.model1(x[value_mask_3])

        return output

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在PyTorch中创建一个高效的条件层？

问题

答案1

‘tuple’ object does not support item assignment in torch.cat()

将TensorFlow模型转换为PyTorch模型 – 模型没有学习

RuntimeError: Given groups=1, weight of size [128, 55, 11, 11], expected input[64, 57, 28, 28] to have 55 channels, but got 57 channels instead

Pytorch与已训练的模型+预训练模型（Intel OpenVINO）不兼容。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论