2023年5月11日 03:16:27go评论99阅读模式

英文:

MultiIndex names when using pd.concat disappeared

问题

考虑到你的要求，以下是翻译好的部分：

考虑以下数据框 `df1` 和 `df2`：
    df1: 
    sim_names       Model 1          
    signal_names     my_y1     my_y2
    units               &#176;C       kPa
    (Time, s)                       
    0.0           0.738280  1.478617
    0.1           1.078653  0.486527
    0.2           0.794123  0.604792
    0.3           0.392690  1.072772 
    
    df2: 
     空的数据框
    列: []
    索引: [0.0, 0.1, 0.2, 0.3] 
正如你所见，`df1` 有三个级别的名称分别为 `&quot;sim_names&quot;`、`&quot;signal_names&quot;` 和 `&quot;units&quot;`。
接下来，我想要将这两个数据框连接起来，因此我运行了以下命令：
        df2 = pd.concat(
            [df1, df2],
            axis=&quot;columns&quot;,
        )
但是我得到了以下结果：
     df2:
                 Model 1          
                  my_y1     my_y2
                     &#176;C       kPa
    (Time, s)                    
    0.0        0.738280  1.478617
    0.1        1.078653  0.486527
    0.2        0.794123  0.604792
    0.3        0.392690  1.072772 
正如你所见，级别名称消失了。
我应该怎么做才能在结果的 `df2` 中保留 `df1` 的级别名称？
我想要的结果 `df2` 应该像下面这样：
    df2: 
    sim_names       Model 1          
    signal_names     my_y1     my_y2
    units               &#176;C       kPa
    (Time, s)                       
    0.0           0.738280  1.478617
    0.1           1.078653  0.486527
    0.2           0.794123  0.604792
    0.3           0.392690  1.072772 
我尝试将 `names=[&quot;sim_names&quot;, &quot;signal_names&quot;, &quot;units&quot;]` 作为参数传递给 `pd.concat`，但是得到了与上述相同的错误结果。

英文:

Consider the following dataframes df1 and df2:

df1: 
sim_names       Model 1          
signal_names     my_y1     my_y2
units               &#176;C       kPa
(Time, s)                       
0.0           0.738280  1.478617
0.1           1.078653  0.486527
0.2           0.794123  0.604792
0.3           0.392690  1.072772 
df2: 
 Empty DataFrame
Columns: []
Index: [0.0, 0.1, 0.2, 0.3]

As you see, df1 has three levels with names "sim_names", "signal_names" and "units".

Next, I want to concatenate the two dataframes, and therefore I run the following command:

    df2 = pd.concat(
        [df1, df2],
        axis=&quot;columns&quot;,
    )

but what I get is the following:

 df2:
             Model 1          
              my_y1     my_y2
                 &#176;C       kPa
(Time, s)                    
0.0        0.738280  1.478617
0.1        1.078653  0.486527
0.2        0.794123  0.604792
0.3        0.392690  1.072772

As you see, the levels names are gone.

What should I do to keep the levels names of df1 in the resulting df2?

My wanted resulting df2 should be like the following:

df2: 
sim_names       Model 1          
signal_names     my_y1     my_y2
units               &#176;C       kPa
(Time, s)                       
0.0           0.738280  1.478617
0.1           1.078653  0.486527
0.2           0.794123  0.604792
0.3           0.392690  1.072772

I tried to pass names=["sim_names", "signal_names", "units"] as argument to pd.concat but I got the same wrong result as above.

答案1

得分: 1

I'm not sure but seems like this is the normal behaviour (see GH13475).

作为一种解决方法，您可以使用 rename_axis/names :

out = pd.concat(
        [df1, df2],
        axis="columns",
    ).rename_axis(df1.columns.names, axis=1) # <- added chain

Output :

print(out)
sim_names    Model 1      
signal_names   my_y1 my_y2
units             ℃   kPa
(Time, s)                 
0.00            0.74  1.48
0.10            1.08  0.49
0.20            0.79  0.60
0.30            0.39  1.07

英文:

I'm not sure but seems like this is the normal behaviour (see GH13475).

As a workaround, you can use rename_axis/names :

out = pd.concat(
        [df1, df2],
        axis=&quot;columns&quot;,
    ).rename_axis(df1.columns.names, axis=1) # &lt;- added chain

Output :

print(out)
sim_names    Model 1      
signal_names   my_y1 my_y2
units             &#176;C   kPa
(Time, s)                 
0.00            0.74  1.48
0.10            1.08  0.49
0.20            0.79  0.60
0.30            0.39  1.07

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

MultiIndex 在使用 pd.concat 时的名称消失

问题

答案1

使用Matplotlib进行阴影渲染

PySpark 使用 OR 运算符在筛选中

Kivy应用在使用cv.VideoCapture()时无法检测到触摸事件。

如何在pytest测试中使用runpy时防止缓存的模块/变量？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。