2023年7月3日 16:57:07go评论117阅读模式

英文:

I'm trying to append the repeated column in a dafaframe to new dataframe

问题

a b c
1 12 123
2 23 456
3 34 789
4 4 567
45 45 876
47 47 345
7 12 456
4 34 234
6 76 769

英文:

Inuput

a	b	c	a	b	c	a	b	c
1	12	123	4	23	567	7	12	456
2	23	456	45	56	876	4	34	234
3	34	789	47	67	345	6	76	769

output

I'm trying to solve this in python

for i in range(0,len(supplier.columns),len(col_list)):
    dfnew = supplier.iloc[i : i + len(col_list)]

答案1

得分: 2

你可以使用groupby.cumcount来去除重复列，并计算一个MultiIndex，然后使用stack函数：

out = (df.set_axis(pd.MultiIndex
                     .from_arrays([df.columns,
                                   df.columns.to_series()
                                     .groupby(level=0).cumcount()]),
                   axis=1)
         .stack().sort_index(level=1)
      )

输出：

      a   b    c
0 0   1  12  123
1 0   2  23  456
2 0   3  34  789
0 1   4  23  567
1 1  45  56  876
2 1  47  67  345
0 2   7  12  456
1 2   4  34  234
2 2   6  76  769

或者使用groupby和concat函数：

group = df.columns.to_series().groupby(level=0).cumcount()
out = pd.concat([g for k,g in df.groupby(group, axis=1)], axis=0)

输出：

    a   b    c
0   1  12  123
1   2  23  456
2   3  34  789
0   4  23  567
1  45  56  876
2  47  67  345
0   7  12  456
1   4  34  234
2   6  76  769

英文:

You can de-duplicate the columns with groupby.cumcount and compute a MultiIndex, then stack:

out = (df.set_axis(pd.MultiIndex
                     .from_arrays([df.columns,
                                   df.columns.to_series()
                                     .groupby(level=0).cumcount()]),
                   axis=1)
         .stack().sort_index(level=1)
      )

Output:

      a   b    c
0 0   1  12  123
1 0   2  23  456
2 0   3  34  789
0 1   4  23  567
1 1  45  56  876
2 1  47  67  345
0 2   7  12  456
1 2   4  34  234
2 2   6  76  769

Or using groupby and concat:

group = df.columns.to_series().groupby(level=0).cumcount()
out = pd.concat([g for k,g in df.groupby(group, axis=1)], axis=0)

Output:

    a   b    c
0   1  12  123
1   2  23  456
2   3  34  789
0   4  23  567
1  45  56  876
2  47  67  345
0   7  12  456
1   4  34  234
2   6  76  769

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

我正在尝试将数据框中重复的列附加到新数据框中。

问题

答案1

Python日志记录器覆盖文件名

如何将一个1通道数组保存为具有适当颜色限制的图像

在CVXPY中进行的两次矩阵乘法操作会导致未知的曲率，并使问题不符合DCP。

Python从CSV文件中读取数据集

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。