2023年6月16日 06:02:05go评论161阅读模式

英文:

Group rows and add columns (delete repeated lines)

问题

如何分组行并添加新列。

看例子：

import pandas as pd

df = pd.DataFrame({
    'name': ['Andy', 'Bob', 'Chad', 'Andy', 'Chad', 'Bob', 'George', 'Hank'],
    'col_1': ['A1', 'A2', 'A3', 'A4', 'B1', 'B2', 'B3', 'B4'],
    'col_2': [1, 1, 2, 2, 1, 1, 2, 2]
    })

df.groupby(by="name")
df

这生成以下结果：

    name	col_1	col_2

0 Andy A1 1
1 Bob A2 1
2 Chad A3 2
3 Andy A4 2
4 Chad B1 1
5 Bob B2 1
6 George B3 2
7 Hank B4 2

但我需要它看起来像这样：

name col_1 col_2 col_1 col_2
0 Andy A1 1 A4 2
1 Bob A2 1 B2 1
2 Chad A3 2 B1 1
3 George B3 2
4 Hank B4 2

谢谢

英文:

How to group rows and add new columns.

See the example:

import pandas as pd

df = pd.DataFrame({
    &#39;name&#39;: [&#39;Andy&#39;, &#39;Bob&#39;, &#39;Chad&#39;, &#39;Andy&#39;, &#39;Chad&#39;, &#39;Bob&#39;, &#39;George&#39;, &#39;Hank&#39;],
    &#39;col_1&#39;: [&#39;A1&#39;, &#39;A2&#39;, &#39;A3&#39;, &#39;A4&#39;, &#39;B1&#39;, &#39;B2&#39;, &#39;B3&#39;, &#39;B4&#39;],
    &#39;col_2&#39;: [1, 1, 2, 2, 1, 1, 2, 2]
    })

df.groupby(by=&quot;name&quot;)
df

This generates the following result:

        name	col_1	col_2
   0	Andy	A1	    1
   1	Bob	    A2	    1
   2	Chad	A3	    2
   3	Andy	A4	    2
   4	Chad	B1	    1
   5	Bob	    B2	    1
   6	George	B3	    2
   7	Hank	B4	    2

But I need it to look like this:

  name    col_1   col_2    col_1   col_2
0 Andy    A1      1        A4      2
1 Bob     A2      1        B2      1
2 Chad    A3      2        B1      1 
3 George  B3      2        
4 Hank    B4      2

Thanks

答案1

得分: 1

尝试：

df['col'] = df.groupby('name').cumcount()
out = df.pivot(index='name', columns='col').swaplevel(axis=1).sort_index(axis=1).fillna('')
out.columns = (f'{b}_{a}' for a, b in out.columns)

print(out)

输出：

       col_0_1  col_0_2 col_1_1 col_1_2
name                                   
Andy        A1      1.0      A4     2.0
Bob         A2      1.0      B2     1.0
Chad        A3      2.0      B1     1.0
George      B3      2.0                
Hank        B4      2.0

英文:

Try:

df[&#39;col&#39;] = df.groupby(&#39;name&#39;).cumcount()
out = df.pivot(index=&#39;name&#39;, columns=&#39;col&#39;).swaplevel(axis=1).sort_index(axis=1).fillna(&#39;&#39;)
out.columns = (f&#39;{b}_{a}&#39; for a, b in out.columns)

print(out)

Prints:

       col_1_0  col_2_0 col_1_1 col_2_1
name                                   
Andy        A1      1.0      A4     2.0
Bob         A2      1.0      B2     1.0
Chad        A3      2.0      B1     1.0
George      B3      2.0                
Hank        B4      2.0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

分组行并添加列（删除重复行）

问题

答案1

converting the last word of a file into uppercase and writing the new content into a new file in Python

Create table instance not connected to a document with python-docx?

计算列表中第一个连续重复部分中有多少个 “1”。

如何在Python中进行动态计算

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论