2023年4月6日 21:18:58go评论100阅读模式

英文:

How can i remove all duplicated elements from list, but updating first entrys value in other column, if not the same

问题

我有这样特性的虚构数据框架：

df = pd.DataFrame({
    'brand': ['Yum Yum', 'ByRice', 'LuxSoba', 'Indomie', 'Indomie'],
    'style': ['cup', 'cup', 'cup', 'pack', 'pack'],
    'flavour': [chili, chicken, chili, beef, cheese]
})
df
    brand style  flavour
0  Yum Yum   cup     chili
1  ByRice   cup     chicken
2  LuxSoba   cup     chili
3  Indomie  pack    beef
4  Indomie  pack     cheese

我的目标是以这种方式更改数据框架，即删除所有品牌的重复条目，但如果有多种口味，将它们全部附加到第一个条目的一个列中。因此，数据框架应该如下所示：

    brand style  flavour
0  Yum Yum   cup     chili
1  ByRice   cup     chicken
2  LuxSoba   cup     chili
3  Indomie  pack    beef, cheese

我不确定如何解决这个问题。

英文:

I have an imaginary dataframe of such nature:

df = pd.DataFrame({
    &#39;brand&#39;: [&#39;Yum Yum&#39;, &#39;ByRice&#39;, &#39;LuxSoba&#39;, &#39;Indomie&#39;, &#39;Indomie&#39;],
    &#39;style&#39;: [&#39;cup&#39;, &#39;cup&#39;, &#39;cup&#39;, &#39;pack&#39;, &#39;pack&#39;],
    &#39;flavour&#39;: [chili, chicken, chili, beef, cheese]
})
df
    brand style  flavour
0  Yum Yum   cup     chili
1  ByRice   cup     chicken
2  LuxSoba   cup     chili
3  Indomie  pack    beef
4  Indomie  pack     cheese

My goal is to change dataframe in such manner, that all duplicate entries of brands are deleted, but if there are several flavours, they all are appended into one column, to the first entry. So dataframe should look like this:

    brand style  flavour
0  Yum Yum   cup     chili
1  ByRice   cup     chicken
2  LuxSoba   cup     chili
3  Indomie  pack    beef, cheese

I'm not sure how to approach this problem.

答案1

得分: 1

你可以这样做：

df = pd.DataFrame({
    '品牌': ['Yum Yum', 'ByRice', 'LuxSoba', 'Indomie', 'Indomie'],
    '风味': ['杯装', '杯装', '杯装', '包装', '包装'],
    '口味': ['辣椒', '鸡肉', '辣椒', '牛肉', '芝士']
})
df2 = df.groupby(['品牌', '风味'])['口味'].agg(lambda x: ', '.join(x)).reset_index()

结果：

英文:

You can do this:

df = pd.DataFrame({
    &#39;brand&#39;: [&#39;Yum Yum&#39;, &#39;ByRice&#39;, &#39;LuxSoba&#39;, &#39;Indomie&#39;, &#39;Indomie&#39;],
    &#39;style&#39;: [&#39;cup&#39;, &#39;cup&#39;, &#39;cup&#39;, &#39;pack&#39;, &#39;pack&#39;],
    &#39;flavour&#39;: [&#39;chili&#39;, &#39;chicken&#39;, &#39;chili&#39;, &#39;beef&#39;, &#39;cheese&#39;]
})
df2 = df.groupby([&#39;brand&#39;, &#39;style&#39;])[&#39;flavour&#39;].agg(lambda x: &#39;, &#39;.join(x)).reset_index()

Result:

答案2

得分: 0

你可以使用 groupby_agg：

&gt;&gt;&gt; df.groupby(['brand', 'style'], sort=False, as_index=False)['flavour'].agg(', '.join)
     brand style       flavour
0  Yum Yum   杯子         辣椒味
1   ByRice   杯子       鸡肉味
2  LuxSoba   杯子         辣椒味
3  Indomie  包装袋  牛肉, 芝士味

英文:

You can use groupby_agg:

&gt;&gt;&gt; df.groupby([&#39;brand&#39;, &#39;style&#39;], sort=False, as_index=False)[&#39;flavour&#39;].agg(&#39;, &#39;.join)
     brand style       flavour
0  Yum Yum   cup         chili
1   ByRice   cup       chicken
2  LuxSoba   cup         chili
3  Indomie  pack  beef, cheese

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

How can i remove all duplicated elements from list, but updating first entrys value in other column, if not the same

问题

答案1

答案2

如何更改QtDesigner中QDateEdit对象中箭头的外观（例如颜色）？

将Bash函数参数分隔符作为字符串数组拆分。

Pandas：在数据框中循环行并返回匹配的结果。

加载Mongo中的.bson集合的方法在docker-compose中是怎样的？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。