2023年3月4日 03:04:19go评论114阅读模式

英文:

removing header and indexes when iterating the groups in pandas

问题

需要在迭代过程中删除每个组的标题。
我需要按几列分组，以及该组的数据框值。

import pandas as pd
df = pd.read_excel('D:\\Python-pandas-numpy-mat_learn\\panda-learn\\test.xlsx')
# 数据帧中的列为: ['col1', 'col2', 'col3', 'col4']
grp = df.groupby(by=['col1', 'col2'])
for each in grp.groups:
    print(df[(df['col1'] == each[0]) & (df['col2'] == each[1])])
# 输出如下：
   col1  col2  col3   col4
9    32   321    12  5mlds
   col1  col2  col3  col4
0   123    34    44  Row1
1   123    34    66  Row2
   col1  col2  col3 col4
6   214   321  3255  ere
# 希望的输出如下
   col1  col2  col3   col4
9    32   321    12  5mlds
0   123    34    44  Row1
1   123    34    66  Row2
6   214   321  3255  ere

我不想要每个组的标题（['col1', 'col2', 'col3', 'col4']）和索引。

英文:

Need to remove header for each group while iterating.
I have requirement to group by few column and dataframe value for that group

import pandas as pd
df=pd.read_excel(&#39;D:\\Python-pandas-numpy-mat_learn\\panda-learn\\test.xlsx&#39;)
#column in dataframes are: [&#39;col1&#39;,&#39;col2&#39;,&#39;col3&#39;,&#39;col4&#39;]
grp=df.groupby(by=[&#39;col1&#39;,&#39;col2&#39;])
for each in grp.groups:
    print(df[(df[&#39;col1&#39;]==each[0]) &amp; (df[&#39;col2&#39;]==each[1])])
#Output is: 
   col1  col2  col3   col4
9    32   321    12  5mlds
   col1  col2  col3  col4
0   123    34    44  Row1
1   123    34    66  Row2
   col1  col2  col3 col4
6   214   321  3255  ere
#Want output like 
   col1  col2  col3   col4
    32   321    12  5mlds
   123    34    44  Row1
   123    34    66  Row2
   214   321  3255  ere

I don't want headers (['col1','col2','col3','col4']) and indexes for each group

答案1

得分: 0

以下是代码部分的翻译：

grp=df.groupby(by=['col1','col2'])
for i, (each, g) in enumerate(grp):
    print(g.to_string(index=False).split('\n', maxsplit=min(i,1))[-1])

输出结果：

col1  col2  col3  col4
   32   321    12 5mlds
  123    34    44 Row1
  123    34    66 Row2
  214   321  3255  ere

英文:

Assuming you really need to use a loop, one option:

grp=df.groupby(by=[&#39;col1&#39;,&#39;col2&#39;])
for i, (each, g) in enumerate( grp):
    print(g.to_string(index=False).split(&#39;\n&#39;, maxsplit=min(i,1))[-1])

Output:

col1  col2  col3  col4
   32   321    12 5mlds
  123    34    44 Row1
  123    34    66 Row2
  214   321  3255  ere

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在 Pandas 中迭代分组时去除标题和索引。

问题

答案1

在列表推导中处理可能为None值的条件语句

谷歌应用引擎Go-Python/Java混合应用程序

如何在只针对一个列上使用pandas dataframe.where

cget 在 tkinter 中的成员测试中在哪里？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。