问题

I have a numpy ndarray with a shape (16699, 128, 128), where each element is an image of 128 by 128 pixels, each image normalized to a range of 0 to 1.
现在，要将图像放入神经网络模型中，我必须取数组的每个元素，将其转换为张量，并使用.unsqueeze(0)添加一个额外维度，将其格式变为(C, W, H)。
因此，我想使用PyTorch提供的dataloader和dataset方法来简化所有这些，以便使用批处理等。我该如何做？

This is the method I have now:
这是我目前的方法:

epochs = 3

for epoch in range(epochs):
    for i in range(X):
        y = torch.from_numpy(y[i])
        x = torch.from_numpy(X[i]).unsqueeze(0)
        ...

英文:

I have a numpy ndarray with a shape (16699, 128, 128), where each element is an image of 128 by 128 pixels, each image normalized to a range of 0 to 1.
Now, to put the image into a neural network model, I have to take each element of the array, convert it to a tensor, and add one extra-dimension with .unsqueeze(0) to it to bring it to the format (C, W, H).
So I'd like to simplify all this with the dataloader and dataset methods that PyTorch has to use batches and etc. How I can do it?

This is the method I have now:

epochs = 3

for epoch in range(epochs):
    for i in range(X):
        y = torch.from_numpy(y[i])
        x = torch.from_numpy(X[i]).unsqueeze(0)
        ...

答案1

得分: 1

One way is to convert X and y to two tensors (both with the same length), then wrap them in a torch.utils.data.TensorDataset.

from torch.utils.data import TensorDataset, DataLoader

batch_size = 128
dataset = TensorDataset(torch.from_numpy(X).unsqueeze(1), torch.from_numpy(y))
loader = DataLoader(dataset, shuffle=True, batch_size=batch_size)

...

# training loop
for epoch in range(epochs):
    for x, y in loader:
        # x is a tensor batch of images with shape (batch_size, 1, H, W)
        # y is a tensor with the corresponding labels
        ...

英文:

One way is to convert X and y to two tensors (both with the same length), then wrap them in a torch.utils.data.TensorDataset.

from torch.utils.data import TensorDataset, DataLoader

batch_size = 128
dataset = TensorDataset(torch.from_numpy(X).unsqueeze(1), torch.from_numpy(y))
loader = DataLoader(dataset, shuffle=True, batch_size=batch_size)

...

# training loop
for epoch in range(epochs):
    for x, y in loader:
        # x is a tensor batch of images with shape (batch_size, 1, H, W)
        # y is a tensor with the corresponding labels
        ...


</details>

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何将一个NumPy的ndarray转换成PyTorch的数据集？

问题

答案1

python3.11的StrEnum的MRO在str和repr方面有何不同？

Pandas列值排列

如何按特定索引检查，删除列表中的重复列表？

如何在重力粒子模拟中考虑静止在地面上的粒子？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论