2023年5月17日 19:32:10go评论103阅读模式

英文:

combine rows with the same IDs into the same row

问题

I understand your request. Here's the translated code part without any additional content:

我明白你的请求。以下是翻译好的代码部分，没有任何额外内容：
我有一个包含600行的数据集。数据有一个主要的ID=版本(Version)和第二个ID=任务(Task)。数据如下所示：
我想要更改格式，使得属于同一个版本(Version)的“任务”(Task)在同一行中，如下所示：

请注意，这只是代码的翻译部分，不包含任何其他内容。如果需要更多帮助，请告诉我。

英文:

I have a dataset with 600 rows. the data has one main ID= Version and second ID= Task. Data looks like this:

 Version  Task  Concept  Att 1 -  Att 2 -
       1     1        1        3        2
       1     1        2        1        1
       1     2        1        2        3
       1     2        2        1        2
       1     3        1        2        3
       1     3        2        3        1
       2     1        1        2        1
       2     1        2        3        2
       2     2        1        2        2
       2     2        2        1        3
       2     3        1        3        1
       2     3        2        1        3

I would like to change the format, so to have "Task" which belongs to the same "Version" in the same row like this:

 Version  Task  Concept  Att 1 -  Att 2 -  Version  Task  Concept  Att 1 -  Att 2 -
       1     1        1        3        2        1     1        2        1        1
       1     2        1        2        3        1     2        2        1        2
       1     3        1        2        3        1     3        2        3        1
       2     1        1        2        1        2     1        2        3        2
       2     2        1        2        2        2     2        2        1        3
       2     3        1        3        1        2     3        2        1        3

I have tried different things like groupby, pivot but I cannot find the right solution

答案1

得分: 0

I think a pivot is the clean way to reshape (df.pivot(index=['Version', 'Task'], columns='Concept'), optionally with flattening the columns MultiIndex).

That said if you really want to duplicate the columns, you could combine a groupby and concat:

out = (pd.concat([g.set_index(['Version', 'Task'], drop=False)
                 for k, g in df.groupby('Concept')], axis=1)
         .reset_index(drop=True)
      )

Output:

   Version  Task  Concept  Att 1 -  Att 2 -  Version  Task  Concept  Att 1 -  Att 2 -
0        1     1        1        3        2        1     1        2        1        1
1        1     2        1        2        3        1     2        2        1        2
2        1     3        1        2        3        1     3        2        3        1
3        2     1        1        2        1        2     1        2        3        2
4        2     2        1        2        2        2     2        2        1        3
5        2     3        1        3        1        2     3        2        1        3

英文:

I think a pivot is the clean way to reshape (df.pivot(index=['Version', 'Task'], columns='Concept'), optionally with flattening the columns MultiIndex).

That said if you really want to duplicate the columns, you could combine a groupby and concat:

out = (pd.concat([g.set_index([&#39;Version&#39;, &#39;Task&#39;], drop=False)
                 for k, g in df.groupby(&#39;Concept&#39;)], axis=1)
         .reset_index(drop=True)
      )

Output:

   Version  Task  Concept  Att 1 -  Att 2 -  Version  Task  Concept  Att 1 -  Att 2 -
0        1     1        1        3        2        1     1        2        1        1
1        1     2        1        2        3        1     2        2        1        2
2        1     3        1        2        3        1     3        2        3        1
3        2     1        1        2        1        2     1        2        3        2
4        2     2        1        2        2        2     2        2        1        3
5        2     3        1        3        1        2     3        2        1        3

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将具有相同ID的行合并为同一行。

问题

答案1

如何在使用NetworkX可视化图时固定节点位置。

如何捕获从Chem.MolFromSmiles(‘Formula’)中的错误消息。

TDD 修改我的测试以使我的代码通过

继承的类在APScheduler作业中使用抽象父类的方法。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。