2023年4月4日 11:14:02go评论83阅读模式

英文:

How can I update parameters of my model with optimiser in pytorch?

问题

感谢您的关注。我只会提供代码的翻译，以下是您要翻译的内容：

for k in range(100):
  c_train = cluster.forward(context)
  loss_Cluster = cluster_L(train_num, args['lambda_c'], scalar_f, c_train)
  optimiser_c.zero_grad()
  loss_Cluster.backward()
  grad_norm = th.nn.utils.clip_grad_norm_(c_param, 10)
  optimiser_c.step()

一些配置和cluster模型：

args = {'c_emb': 16, 'use_cuda': False, 'ally_type': [1, 0, 0, 0, 0, 0, 0, 0, 0],
        'enemy_type': [1, 0, 0, 0, 0, 0, 0, 0, 0], 'n_classes': 5, 'lambda_c': 2}
cluster = CLUSTER_L(args)
context = th.tensor(args['ally_type']+args['enemy_type'], dtype=th.float)
train_num = args['n_classes']*args['lambda_c']
scalar_f = (args['n_classes']*(train_num-1))/(args['lambda_c']*args['lambda_c'])
c_param = list(cluster.parameters())
optimiser_c = RMSprop(c_param, lr=0.1)

import torch.nn as nn
import torch.nn.functional as F
import torch
import numpy as np
class CLUSTER_L(nn.Module):
    def __init__(self, args):
        super(CLUSTER_L, self).__init__()
        self.args = args
        self.embed_dim = self.args['c_emb']
        if self.args['use_cuda']:
            torch.cuda.set_device(torch.device('cuda:0'))
        self.input_size = len(self.args['ally_type']) + len(self.args['enemy_type'])
        self.n = self.args['n_classes']
        self.lbd = self.args['lambda_c']
        self.cl = nn.Sequential(nn.Linear(self.input_size, self.embed_dim),
                               nn.ReLU(),
                               nn.Linear(self.embed_dim, self.embed_dim),
                               nn.ReLU(),
                               nn.Linear(self.embed_dim, self.n*self.lbd*self.input_size)
                               )
        self.af = nn.ReLU()
    def forward(self, inputs):
        d_outputs = self.cl(inputs).view(self.n*self.lbd, self.input_size)
        outputs = ((self.af(d_outputs)+0.1)*10).round()
        outputs2 = outputs*inputs
        return outputs2

我尝试将损失函数更改为nn.functional中的一些标准函数，例如cross_entropy，但问题仍然存在。能否有人告诉我如何更新模型(cluster)的参数？我感激您的帮助。

英文:

Thanks for your attention. I just can't update the parameters of my model. It stays the same although I backward my loss and step the optimiser.
The codes are below.

for k in range(100):
  c_train = cluster.forward(context)
  # pdb.set_trace() # debug here
  loss_Cluster = cluster_L(train_num, args[&#39;lambda_c&#39;], scalar_f, c_train)
  #loss_Cluster = F.cross_entropy(c_train, test) # just for testing, proving that loss_Cluster is right
  optimiser_c.zero_grad()
  loss_Cluster.backward()
  grad_norm = th.nn.utils.clip_grad_norm_(c_param, 10)
  # for para in cluster.parameters():
  #  print(para)
  optimiser_c.step()

some configs and the cluster model:

args = {&#39;c_emb&#39;:16, &#39;use_cuda&#39;:False, &#39;ally_type&#39;:[1,0,0,0,0,0,0,0,0],
        &#39;enemy_type&#39;:[1,0,0,0,0,0,0,0,0], &#39;n_classes&#39;:5, &#39;lambda_c&#39;:2}
cluster = CLUSTER_L(args)
context = th.tensor(args[&#39;ally_type&#39;]+args[&#39;enemy_type&#39;], dtype=th.float)
train_num = args[&#39;n_classes&#39;]*args[&#39;lambda_c&#39;]
scalar_f = (args[&#39;n_classes&#39;]*(train_num-1))/(args[&#39;lambda_c&#39;]*args[&#39;lambda_c&#39;]) 
c_param = list(cluster.parameters())
optimiser_c = RMSprop(c_param, lr = 0.1)

import torch.nn as nn
import torch.nn.functional as F
import torch
import numpy as np
class CLUSTER_L(nn.Module):
    def __init__(self, args):
        super(CLUSTER_L, self).__init__()
        self.args = args
        self.embed_dim = self.args[&#39;c_emb&#39;]
        if self.args[&#39;use_cuda&#39;]:
            torch.cuda.set_device(torch.device(&#39;cuda:0&#39;))
        self.input_size = len(self.args[&#39;ally_type&#39;])+len(self.args[&#39;enemy_type&#39;])
        self.n = self.args[&#39;n_classes&#39;]
        self.lbd = self.args[&#39;lambda_c&#39;]
        # the input of cl should be 1*2T , output should be 1*(2T*self.n*self.lbd)
        
        
        self.cl = nn.Sequential(nn.Linear(self.input_size, self.embed_dim),
                               nn.ReLU(),
                               nn.Linear(self.embed_dim, self.embed_dim),
                               nn.ReLU(),
                               nn.Linear(self.embed_dim, self.n*self.lbd*self.input_size)
                               )
        self.af = nn.ReLU()
        
        
        
        
    # inputs should be 1*2T size    
    def forward(self, inputs):
        d_outputs = self.cl(inputs).view(self.n*self.lbd, self.input_size)
        outputs = ((self.af(d_outputs)+0.1)*10).round() # make sure the outputs are positive and &gt;=1
        print(outputs)
        outputs2 = outputs*inputs # mask and output
        return outputs2

I've tried changing the loss function to some standard functions in nn.functional such as cross_entropy but the issue remains the same.

Could anyone tell me how can I update the parameters of the model(cluster)? I appreciate your help.

答案1

得分: 0

只移除 round() 并修改视图以保留批量维度。

英文:

Just remove the round() and modify the view to keep the batch dim.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

你可以如何在PyTorch中使用优化器来更新模型的参数？

问题

答案1

Pytorch 2.0.1+cu118 不兼容 Cuda 12.1。

释放PyTorch中的CPU内存（用于大规模推理）。

为什么我的神经网络无法学习 XOR 问题？

Golang的…语法真的只是可变参数吗？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。