如何在PyTorch中为CNN模型添加额外的层?

huangapple go评论91阅读模式
英文:

How to add additional layers to CNN model in PyTorch?

问题

我有一个与神经网络相关的问题。在指定模型参数方面,我是一个初学者。我发现了这个关于在PyTorch中构建的DNA序列模型的示例,我想进行改进。在示例中,部署了一个基本的CNN模型,现在我想部署一个更深的模型,具有更多的层次。

基本的CNN模型

这些不是优化过的模型,只是一些初始尝试,用于测试PyTorch在DNA上下文中的使用

class DNA_CNN(nn.Module):
def init(self,
seq_len,
num_filters=32,
kernel_size=3):
super().init()
self.seq_len = seq_len

  1. self.conv_net = nn.Sequential(
  2. # 4代表4个核苷酸
  3. nn.Conv1d(4, num_filters, kernel_size=kernel_size),
  4. nn.ReLU(inplace=True),
  5. nn.Flatten(),
  6. nn.Linear(num_filters*(seq_len-kernel_size+1), 1)
  7. )
  8. def forward(self, xb):
  9. # 重新整形视图为batch_size x 4通道 x seq_len
  10. # 转置以正确排序通道
  11. xb = xb.permute(0,2,1)
  12. #print(xb.shape)
  13. out = self.conv_net(xb)
  14. return out
英文:

I have a question related to neural networks. I am a beginner in terms of specifying model parameters. I found this amazing example about DNA seq model built in PyTorch, which I want to improve. In the example, a basic CNN model was deployed and now I want to deploy a deeper model with more layers.

  1. # basic CNN model
  2. # These aren't optimized models, just something to start with, just testing pytorch with context of DNA
  3. class DNA_CNN(nn.Module):
  4. def __init__(self,
  5. seq_len,
  6. num_filters=32,
  7. kernel_size=3):
  8. super().__init__()
  9. self.seq_len = seq_len
  10. self.conv_net = nn.Sequential(
  11. # 4 is for the 4 nucleotides
  12. nn.Conv1d(4, num_filters, kernel_size=kernel_size),
  13. nn.ReLU(inplace=True),
  14. nn.Flatten(),
  15. nn.Linear(num_filters*(seq_len-kernel_size+1), 1)
  16. )
  17. def forward(self, xb):
  18. # reshape view to batch_size x 4channel x seq_len
  19. # permute to put channel in correct order
  20. xb = xb.permute(0,2,1)
  21. #print(xb.shape)
  22. out = self.conv_net(xb)
  23. return out

答案1

得分: 1

  1. 使用相同的填充来实现模块化代码以保持序列的长度在应用卷积之前在边界添加零):
  2. from typing import List
  3. class DNA_CNN(nn.Module):
  4. def __init__(self,
  5. seq_len: int,
  6. num_filters: List[int] = [32, 64],
  7. kernel_size: int = 3):
  8. super().__init__()
  9. self.seq_len = seq_len
  10. # CNN 模块
  11. self.conv_net = nn.Sequential()
  12. num_filters = [4] + num_filters
  13. for idx in range(len(num_filters) - 1):
  14. self.conv_net.add_module(
  15. f"conv_{idx}",
  16. nn.Conv1d(num_filters[idx], num_filters[idx + 1],
  17. kernel_size=kernel_size, padding='same')
  18. )
  19. self.conv_net.add_module(f"relu_{idx}", nn.ReLU(inplace=True))
  20. self.conv_net.add_module("flatten", nn.Flatten())
  21. self.conv_net.add_module(
  22. "linear",
  23. nn.Linear(num_filters[-1]*seq_len, 1)
  24. )
  25. def forward(self, xb: torch.Tensor):
  26. """前向传播."""
  27. xb = xb.permute(0, 2, 1)
  28. out = self.conv_net(xb)
  29. return out

要更改内核大小,您可以传递一个列表给 kernel_size,然后在卷积中使用 kernel_size=kernel_size[idx]

如果出于某种原因您想要移除填充,您可以在卷积中移除 padding='same',并更改 Linear 定义以匹配新的形状:

  1. nn.Linear(num_filters[-1] * (seq_len - (len(num_filters)-1) * (kernel_size-1), 1)
英文:

Modular code to do so using padding same to keep the length of the sequence (by adding zeros in the borders before applying convolutions):

  1. from typing import List
  2. class DNA_CNN(nn.Module):
  3. def __init__(self,
  4. seq_len: int,
  5. num_filters: List[int] = [32, 64],
  6. kernel_size: int = 3):
  7. super().__init__()
  8. self.seq_len = seq_len
  9. # CNN module
  10. self.conv_net = nn.Sequential()
  11. num_filters = [4] + num_filters
  12. for idx in range(len(num_filters) - 1):
  13. self.conv_net.add_module(
  14. f"conv_{idx}",
  15. nn.Conv1d(num_filters[idx], num_filters[idx + 1],
  16. kernel_size=kernel_size, padding='same')
  17. )
  18. self.conv_net.add_module(f"relu_{idx}", nn.ReLU(inplace=True))
  19. self.conv_net.add_module("flatten", nn.Flatten())
  20. self.conv_net.add_module(
  21. "linear",
  22. nn.Linear(num_filters[-1]*seq_len, 1)
  23. )
  24. def forward(self, xb: torch.Tensor):
  25. """Forward pass."""
  26. xb = xb.permute(0, 2, 1)
  27. out = self.conv_net(xb)
  28. return out

To change the kernel size, you can pass a list to kernel_size and simply use kernel_size=kernel_size[idx] in the convolution.

If for some reasons you want to remove the padding you can remove padding='same' in convolution and change the Linear definition to match the new shape:

  1. nn.Linear(num_filters[-1] * (seq_len - (len(num_filters)-1) * (kernel_size-1), 1)

huangapple
  • 本文由 发表于 2023年7月17日 22:46:39
  • 转载请务必保留本文链接:https://go.coder-hub.com/76705628.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定