2023年7月17日 15:41:02go评论177阅读模式

英文:

IndexError: Index out of range in self while implementing transformer model for translation

问题

我正在尝试实现一个用于翻译任务的Transformer模型，参考了一些YouTube教程。但是我遇到了索引超出范围的错误。看起来问题出在输入维度上，但我搞不清楚是什么问题。

这是代码（Google Colab链接）

你可以在这里找到数据集

我尝试过更改维度，但没有帮助，或者我没有正确地做到。希望有人能帮助解决这个问题。谢谢！

英文:

I am trying to implement a transformer model for the translation task, from some youtube tutorials. But I am getting the index out-of-range error. It seems The problem is with the input dimensions, but I can't figure it out.
Here is the code (google colab link)

You can find the datasets here

I tried to change the dimensions but It didn't help or I couldn't do it correctly. I hope someone can help solve this problem. Thanks

答案1

得分: 0

Here is the code with the suggested modifications:

# Define unique tags for special tokens
START_TOKEN = "START"
PADDING_TOKEN = "PAD"
END_TOKEN = "END"

# Modify english_vocabulary to use unique tags
english_vocabulary = {
    START_TOKEN: 0,
    PADDING_TOKEN: 1,
    END_TOKEN: 2,
    # Add other words from your vocabulary here
}

# Modify the forward method in SentenceEmbedding class
def forward(self, x, start_token, end_token):
    x = self.batch_tokenize(x, start_token, end_token)
    print(torch.max(x))  # Print the max value in x for debugging
    x = self.embedding(x)
    pos = self.position_encoder().to(get_device())
    x = self.dropout(x + pos)

# Add '\\\\' to the english_vocabulary
english_vocabulary['\\\\'] = len(english_vocabulary)

# Rest of your code remains the same

Please note that these modifications are intended to address the issue you mentioned in your message. Make sure to integrate them into your existing code as needed.

英文:

I went through your code and found out that in the error trace of yours (error in forward call of SentenceEmbedding, encoder stage)
> 69 def forward(self, x, start_token, end_token): # sentence
> 70 x = self.batch_tokenize(x, start_token, end_token)
> 71 ---> x = self.embedding(x)
> 72 pos = self.position_encoder().to(get_device())
> 73 x = self.dropout(x + pos)

If you add print(torch.max(x)) before the line x = self.embedding(x)

Then you can see that the error is because x contains id that is >=68. If the value is greater than 68, then Pytorch will raise the error mentioned in the stack trace.

It means that while you are converting tokens to ids, you are assigning a value greater than 68.

To prove my point:

when you are creating english_to_index, since there are three "" in your english_vocabulary (START_TOKEN, PADDING_TOKEN, END_TOKEN are all "") you end up generating { "": 69 }. Since this value is greater than the len(english_to_index) # length = 68.
Hence, you are getting IndexError: index out of range in self

Solution

As a solution, you can give unique tags to these tokens (which is generally prescribed) as:

START_TOKEN = &quot;START&quot;
PADDING_TOKEN = &quot;PAD&quot;
END_TOKEN = &quot;END&quot;

This will make sure that the generated dictionaries will have the correct sizes.
Please find the working Google Colaboratory file here with the solution section.

I added '\\' to the english_vocabulary since after a few iterations we get a KeyError: '\\'.

Hope it helps.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

IndexError: 在实现翻译的Transformer模型时，self中的索引超出范围

问题

答案1

To prove my point:

Solution

为什么os.walk()（Python）会根据目录中的文件数量忽略OneDrive目录？

Why do I get "None" as output for a function? Also, how to make a function run for both strings and numbers?

Struggling with a Type Error on my coding for a basic calculation.

如何将嵌套的JSON API响应转换为Python中的数据框

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论