2023年2月7日 05:02:52go评论83阅读模式

英文:

Is there an R function for collapsing characters into one cell if they have a matching character in another cell?

问题

我有一个包含两列字符的数据框，如下所示：

name	gene
GO:00001	Gene_1
GO:00001	Gene_2
GO:00002	Gene_3
GO:00002	Gene_4
GO:00002	Gene_5

但我需要合并列，使“name”列不重复，并且“gene”列包含与相同“name”匹配的每个基因，用逗号和空格分隔，如下所示：

name	gene
GO:00001	Gene_1, Gene_2
GO:00002	Gene_3, Gene_4, Gene_5

我已经查阅了有关melt、collapse和summarize的文档，但无法弄清楚如何使用字符执行此操作。非常感谢任何帮助！

英文:

I have a dataframe with two columns of characters that looks like this:

name	gene
GO:00001	Gene_1
GO:00001	Gene_2
GO:00002	Gene_3
GO:00002	Gene_4
GO:00002	Gene_5

But I need to collapse the columns so that the "name" column isn't repetitive and the "gene" column contains each gene that matches to the same "name", separated by a comma and a space, like so:

name	gene
GO:00001	Gene_1, Gene_2
GO:00002	Gene_3, Gene_4, Gene_5

I have looked into the documentation for melt, collapse, and summarize, but I can't figure out how to do this with characters. Any help is much appreciated!!

答案1

得分: 0

Using dplyr:

> df %>%
    group_by(name) %>%
    summarise(gene = paste0(gene, collapse = ","))
# A tibble: 2 × 2
  name     gene                
  <chr>    <chr>               
1 GO:00001 Gene_1,Gene_2       
2 GO:00002 Gene_3,Gene_4,Gene_5

Using R base:

aggregate(gene ~ name, FUN = paste0, data = df)

英文:

Using dplyr:

&gt; df %&gt;% 
    group_by(name) %&gt;% 
    summarise(gene = paste0(gene, collapse = &quot;,&quot;))
# A tibble: 2 &#215; 2
  name     gene                
  &lt;chr&gt;    &lt;chr&gt;               
1 GO:00001 Gene_1,Gene_2       
2 GO:00002 Gene_3,Gene_4,Gene_5

Using R base

aggregate(gene ~ name, FUN= paste0, data=df)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Is there an R function for collapsing characters into one cell if they have a matching character in another cell?

问题

答案1

选择行和列

在R中，我可以将参数的参数从一个变量传递（仅在该变量存在时）吗？

predict.lme 无法解释由变量定义的公式

如何从磁盘加载reactiveValues而不破坏观察者？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。