问题

I'm running some python code that uses the pytorch Lightning framework. I get the error
> File "/Home/LightningVersion.py", line 45, in init
super().init()
File "/Home/.local/lib/python3.9/site-packages/pytorch_lightning/core/module.py", line 128, in init
self._register_sharded_tensor_state_dict_hooks_if_available()
File "/Home/.local/lib/python3.9/site-packages/pytorch_lightning/core/module.py", line 1570, in _register_sharded_tensor_state_dict_hooks_if_available
from torch.distributed._shard.sharded_tensor import pre_load_state_dict_hook, state_dict_hook
ModuleNotFoundError: No module named 'torch.distributed._shard'

我正在运行一些使用pytorch Lightning框架的Python代码。我遇到了以下错误：

> File "/Home/LightningVersion.py", 第45行，在 init 中
super().init()
File "/Home/.local/lib/python3.9/site-packages/pytorch_lightning/core/module.py", 第128行，在 init 中
self._register_sharded_tensor_state_dict_hooks_if_available()
File "/Home/.local/lib/python3.9/site-packages/pytorch_lightning/core/module.py", 第1570行，在 _register_sharded_tensor_state_dict_hooks_if_available 中
from torch.distributed._shard.sharded_tensor import pre_load_state_dict_hook, state_dict_hook
ModuleNotFoundError: 没有名为 'torch.distributed._shard' 的模块。

I am using CUDA 11.4 and python 3.9.10.

Does anyone know how to fix this?

我正在使用CUDA 11.4和Python 3.9.10。

Does anyone know how to fix this?

有人知道如何修复这个问题吗？

I cannot find anything online that helps, despite searching.

尽管搜索了很多，但我在网上找不到任何有用的信息。

英文:

I am using CUDA 11.4 and python 3.9.10.

Does anyone know how to fix this?

I cannot find anything online that helps, despite searching.

答案1

得分: 3

此模块负责在多个GPU上分片张量，并在PyTorch版本1.8及更高版本中可用。通过运行以下命令来升级您的PyTorch版本到1.8或更高版本：

!pip install torch==1.8.0

英文:

This module is responsible for sharding tensors across multiple GPUs, and it is available in PyTorch versions 1.8 and higher. upgrade your PyTorch version to 1.8 or higher by running

!pip install torch==1.8.0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Getting ModuleNotFoundError: No module named ‘torch.distributed._shard’

问题

答案1

如何在不知道维度的情况下拼接张量？

PyTorch在进行简单的乘法运算时为什么会内存不足？

如何使用PIL将四维张量转换为图像？

如何在PyTorch的ResNET50中更改conv2d.layer4.conv3中的out_channels数量？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论