问题

PyTorch 提供了懒加载层，例如 torch.nn.LazyLinear。我想要冻结其中的一些层，即确保它们不进行梯度更新。当我尝试使用 .requires_grad_(False) 时，我收到以下错误信息：

ValueError: 尝试在未初始化的参数上使用 <method 'requires_grad_' of 'torch._C._TensorBase' 对象>。当您使用 LazyModule 或明确操作 torch.nn.parameter.UninitializedParameter 对象时，会出现此错误。在使用 LazyModules 时，请先使用虚拟批次调用 forward 以初始化参数，然后再调用 torch 函数

如何冻结懒加载层？

英文:

PyTorch offers lazy layers e.g. torch.nn.LazyLinear. I want to freeze some of these layers in my network i.e. ensure they take no gradient steps. When I try .requires_grad_(False), I receive the error:

ValueError: Attempted to use an uninitialized parameter in <method 'requires_grad_' of 'torch._C._TensorBase' objects>. This error happens when you are using a LazyModuleor explicitly manipulatingtorch.nn.parameter.UninitializedParameterobjects. When using LazyModules Callforward with a dummy batch to initialize the parameters before calling torch functions

How do I freeze lazy layers?

答案1

得分: 1

根据错误信息的建议，你需要对懒惰模块执行一个虚拟推断，以完全初始化其所有参数。

换句话说，考虑到你的正确输入形状 shape：

>>> model(torch.rand(shape))

然后你才能使用 requires_grad_(False) 冻结所需的层。

英文:

As the error message suggests you need to perform a dummy inference on a lazy module in order to fully initialize all of its parameters.

In other words, considering your correct input shape shape:

&gt;&gt;&gt; model(torch.rand(shape))

And only then can you freeze the desired layer with requires_grad_(False).

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何将 requires_grad_ 设置为 false（冻结）PyTorch 惰性层？

问题

答案1

skorch超参数网格搜索在将概率向量用作标签的分类任务中失败（pytorch）

使用Pytorch的nn.Sequential时出现错误：运行时错误：mat1和mat2的形状无法相乘。

PyTorch版本适用于CUDA 12.2。

好的，以下是翻译好的内容：在VScode中查看矩阵和更高维数组的好方法。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论