2023年1月9日 18:59:54go评论130阅读模式

英文:

Explode raises values error ValueError: columns must have matching element counts

问题

我有以下的数据框：

list1 = [1, 6, 7, [46, 56, 49], 45, [15, 10, 12]]
list2 = [[49, 57, 45], 3, 7, 8, [16, 19, 12], 41]
data = {'A': list1, 'B': list2}
data = pd.DataFrame(data)

我可以使用以下代码来展开数据框：

data.explode('A').explode('B')

但是当我运行以下代码来执行相同的操作时，会引发一个值错误：

data.explode(['A', 'B'])

ValueError                                Traceback (most recent call last)
<ipython-input-97-efafc6c7cbfa> in <module>
      5         'B': list2}
      6 data = pd.DataFrame(data)
----> 7 data.explode(['A', 'B'])
...
ValueError: columns must have matching element counts

有人能解释为什么吗？

英文:

I have the following dataframe:

list1 = [1, 6, 7, [46, 56, 49], 45, [15, 10, 12]]
list2 = [[49, 57, 45], 3, 7, 8, [16, 19, 12], 41]
data = {&#39;A&#39;:list1,
        &#39;B&#39;: list2}
data = pd.DataFrame(data)

I can explode the dataframe using this piece of code:

data.explode(&#39;A&#39;).explode(&#39;B&#39;)

but when I run this one to do the same operation a value error is raised:

data.explode([&#39;A&#39;, &#39;B&#39;])
ValueError                                Traceback (most recent call last)
&lt;ipython-input-97-efafc6c7cbfa&gt; in &lt;module&gt;
      5         &#39;B&#39;: list2}
      6 data = pd.DataFrame(data)
----&gt; 7 data.explode([&#39;A&#39;, &#39;B&#39;])
~\AppData\Roaming\Python\Python38\site-packages\pandas\core\frame.py in explode(self, column, ignore_index)
   9033             for c in columns[1:]:
   9034                 if not all(counts0 == self[c].apply(mylen)):
-&gt; 9035                     raise ValueError(&quot;columns must have matching element counts&quot;)
   9036             result = DataFrame({c: df[c].explode() for c in columns})
   9037         result = df.drop(columns, axis=1).join(result)
ValueError: columns must have matching element counts

Can anyone explain why?

答案1

得分: 1

df.explode(["A", "B"]) 和 df.explode("A").explode("B") 不是相同的操作。看起来你的目标是获取所有组合，其中多列的explode尝试解决不同的情况，其中你的列中有成对的列表。你可以在原始 GitHub 特性请求中看到其理由。这似乎是为了避免在其中一列中重复值。

在特性请求中有一个链接到 GitHub 的 gist/notebook，它探讨了如何实现explode，但似乎无法处理并行的不匹配列表长度。

英文:

df.explode(["A", "B"]) and df.explode("A").explode("B") do not do the same thing. It seems that you are aiming to get all the combinations where are the multi-column explode attempts to resolve a different scenario, one where you have paired lists in your columns. You can see the rationale in the original GitHub feature request. This seems to have been chosen to avoid duplicating values in one of the columns.

In the feature request there is a link to a GitHub gist/notebook that explores how explode could be implemented, but they seem to have not been able to explode with mis-matched list lengths in parallel.

答案2

得分: 1

尝试这个，如果在你的情况下有效。

import numpy as np
data = pd.DataFrame({'A': np.hstack(list1), 'B': np.hstack(list2)})

英文:

try this if it work in your case.

import numpy as np
data = pd.DataFrame({&#39;A&#39; : np.hstack(list1), &#39;B&#39; : np.hstack(list2)})

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Explode引发了值错误ValueError：列必须具有匹配的元素计数。

问题

答案1

答案2

Python和HTTP触发器在将工作簿转换为Base64或保存到临时文件夹后没有返回。

Popular Python type checkers give a false negative with Any annotation.

swig python 派生类和基类位于不同模块中

mutate()函数在列中用均值替换-1，但所有值都无条件替换。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。