2023年7月31日 23:52:27go评论1022阅读模式

英文:

OpenAI gym with Grayscling Wrapper return ValueError: too many values to unpack (expected 2)

问题

由于我从gym.wrappers导入了GrayScaleObservation，所以出现了与gym包相关的问题。
我知道问题出在哪里，因为在日志中提到了，但我不知道如何修复它...

这里是导入部分：

# 导入Frame Stacker Wrapper和GrayScale Wrapper
from gym.wrappers import FrameStack, GrayScaleObservation

这是错误信息：

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
/Users/adhuy/code/led8/AI tools/Nicholas Renotte/Reinforcement Learning/mario_tutotial.ipynb Cell 19 in 5
      2 env = JoypadSpace(env, SIMPLE_MOVEMENT)
      3 env = GrayScaleObservation(env, keep_dim=True) 
----> 5 env.reset()

File ~/.pyenv/versions/3.10.6/envs/lewagon/lib/python3.10/site-packages/gym/core.py:379, in ObservationWrapper.reset(self, **kwargs)
    377 def reset(self, **kwargs):
    378     &quot;&quot;&quot;Resets the environment, returning a modified observation using :meth:`self.observation`.&quot;&quot;&quot;
--&gt; 379     obs, info = self.env.reset(**kwargs)
    380     return self.observation(obs), info

ValueError: too many values to unpack (expected 2)

通常情况下，下面的代码应该返回(240, 256, 1)，即数组的形状。实际上，当我在没有GrayScaleObservation(env, keep_dim=True)的情况下运行此代码时，一切都正常工作。

这是我的代码和我尝试满足的要求：

env = gym_super_mario_bros.make('SuperMarioBros-v0')
env = JoypadSpace(env, SIMPLE_MOVEMENT)
env = GrayScaleObservation(env, keep_dim=True) 

state = env.reset()
state.shape

英文:

Got issue with the gym package since I imported GrayScaleObservation from gym.wrappers.
I know where the issue is located as it is mentioned in the logs but I don't know how the fix it ...

Here the imports:

# Import Frame Stacker Wrapper and Grayscling Wrapper
from gym.wrappers import FrameStack, GrayScaleObservation

Here is the Error:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
/Users/adhuy/code/led8/AI tools/Nicholas Renotte/Reinforcement Learning/mario_tutotial.ipynb Cell 19 in 5
      2 env = JoypadSpace(env, SIMPLE_MOVEMENT)
      3 env = GrayScaleObservation(env, keep_dim=True) 
----&gt; 5 env.reset()

File ~/.pyenv/versions/3.10.6/envs/lewagon/lib/python3.10/site-packages/gym/core.py:379, in ObservationWrapper.reset(self, **kwargs)
    377 def reset(self, **kwargs):
    378     &quot;&quot;&quot;Resets the environment, returning a modified observation using :meth:`self.observation`.&quot;&quot;&quot;
--&gt; 379     obs, info = self.env.reset(**kwargs)
    380     return self.observation(obs), info

ValueError: too many values to unpack (expected 2)

Normally the code below should return (240, 256, 1), the shape of the array. Actually when I ran this code without GrayScaleObservation(env, keep_dim=True) , everything works well.

Here is my code and what I try to meet:

env = gym_super_mario_bros.make(&#39;SuperMarioBros-v0&#39;)
env = JoypadSpace(env, SIMPLE_MOVEMENT)
env = GrayScaleObservation(env, keep_dim=True) 

state = env.reset()
state.shape

答案1

得分: 1

替换 core.py 中引起错误的行：

def reset(self, **kwargs):
    obs = self.env.reset(**kwargs)
    return self.observation(obs)

在使用更多包装器的情况下，删除信息可能会导致后续包装器中出现类似的错误。我还使用了 DummyVecEnv 进行进一步处理，这导致我还需要替换 dummy_vec_env.py 中的 reset 方法：

def reset(self) -> VecEnvObs:
    for env_idx in range(self.num_envs):
        obs = self.envs[env_idx].reset(seed=self._seeds[env_idx])
        self._save_obs(env_idx, obs)
    # 种子只使用一次
    self._reset_seeds()
    return self._obs_from_buf()

... 替换这原始的行：

obs, self.reset_infos[env_idx] = self.envs[env_idx].reset(seed=self._seeds[env_idx])

按照相似的方式从一个错误到另一个错误来修复它们应该能解决问题。我不确定删除提供的信息是否会在其他项目中引发问题，但是为了使 gym_super_mario_bros 运行起来，似乎可以工作。

英文:

Working through a similar project myself right now and encountered the same error.

Even though I cannot give more insight to why exactly things are causing errors (hopefully someone else will eventually enlighten us) I can give you the fix that seems to have worked for me and produces the expected results:

Replace the lines causing errors within core.py with:

def reset(self, **kwargs):
        obs = self.env.reset(**kwargs)
        return self.observation(obs)

Deleting info might cause similar errors in following wrappers in case you´re using more. I also used DummyVecEnv for further processing, leading me to replace the reset method within dummy_vec_env.py aswell:

def reset(self) -&gt; VecEnvObs:
        for env_idx in range(self.num_envs):
            obs = self.envs[env_idx].reset(seed=self._seeds[env_idx])
            self._save_obs(env_idx, obs)
        # Seeds are only used once
        self._reset_seeds()
        return self._obs_from_buf()

...replacing this original line:

obs, self.reset_infos[env_idx] = self.envs[env_idx].reset(seed=self._seeds[env_idx])

Going from error to error fixing them in similar ways should do the trick. I´m unsure weather removing the info provided might cause problems in other projects, but for getting gym_super_mario_bros to run it seems to work.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

OpenAI Gym 使用 Grayscale 包装器返回 ValueError：要拆包的值过多（期望 2 个）。

问题

答案1

我想分割一个字符串，然后统计数据框中列的值。

如何在Django中使用具有ManyToManyFields的模型对象填充测试数据库？

如何使用给定的数据框形状创建堆叠条形图？

我的代码为什么出现了TLE，而类似的实现通过了（BFS）？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论