2023年6月8日 15:51:07go评论103阅读模式

英文:

How can I add a column with values from a list, iterating in a data frame over factor levels?

问题

I can help you translate the code-related parts into Chinese. Here it is:

给定数据框架 df：

df <- data.frame(1,
                 1:12,
                 groups = factor(rep(LETTERS[1:4], each = 3)))
df

以及列表 list_group：

list_group <- list(
  A = c("Jack","John","Joe"),
  B = c("Ed","Edd","Eddy"),
  C = c("Gianni","Franco","Ugo"),
  D = c("Bob","Rob","Frank"))
list_group

我想要得到以下结果：

如何在数据框架中迭代因子水平并添加来自列表的值的列？

因此，添加列 person，其中包含与 df$groups 水平对应的名称。这需要将行多次复制到因子水平，但要与 list_group 中相应的名称相乘（即 df$groups 水平 "A" = "Jack","John","Joe"; df$groups 水平 "B" = "Ed","Edd","Eddy"; 等等）。

请注意，我只提供了代码的翻译部分，没有包含其他内容。

英文:

I am struggling in performing and iterating the following operation in the R environment.

Given a data frame df:

df &lt;- data.frame(1,
                 1:12,
                 groups = factor(rep(LETTERS[1:4], each = 3)))
df

and the list list_group:

list_group &lt;- list(
  A = c(&quot;Jack&quot;,&quot;John&quot;,&quot;Joe&quot;),
  B = c(&quot;Ed&quot;,&quot;Edd&quot;,&quot;Eddy&quot;),
  C = c(&quot;Gianni&quot;,&quot;Franco&quot;,&quot;Ugo&quot;),
  D = c(&quot;Bob&quot;,&quot;Rob&quot;,&quot;Frank&quot;))
list_group

I would like to get the following result:

如何在数据框架中迭代因子水平并添加来自列表的值的列？

Thus, add the column person with the names in correspondence of the df$groups level. This requires to multiple the rows over the factor levels, but specifically with the corresponding names in the list_group (i.e., df$groups level "A" = "Jack","John","Joe"; df$groups level "B" = "Ed","Edd","Eddy"; etc.)

I went through the function of the Tidyverse, in particular dplyr,tidyr and purrr, but I did not find specific functions for my problem.

Also, I tried to dig into the theory behind Iteration in R, mainly following the guide 21 Iteration, but I did not succeed in that.

答案1

得分: 2

你可以使用 stack + merge：

merge(df, stack(list_group), by.x = "groups", by.y = "ind")

或者使用 dplyr 的等效方法：

library(dplyr)
full_join(df, stack(list_group), by = c("groups" = "ind"), relationship = "many-to-many")

输出：

#    groups X1 X1.12 values
# 1       A  1     1   Jack
# 2       A  1     1   John
# 3       A  1     1    Joe
# 4       A  1     2   Jack
# 5       A  1     2   John
# 6       A  1     2    Joe
# 7       A  1     3   Jack
# 8       A  1     3   John
# 9       A  1     3    Joe
# 10      B  1     4     Ed
# 11      B  1     4    Edd
# 12      B  1     4   Eddy
# 13      B  1     5     Ed
# 14      B  1     5    Edd
# 15      B  1     5   Eddy
# 16      B  1     6     Ed
# 17      B  1     6    Edd
# 18      B  1     6   Eddy
# 19      C  1     7 Gianni
# 20      C  1     7 Franco
# 21      C  1     7    Ugo
# 22      C  1     8 Gianni
# 23      C  1     8 Franco
# 24      C  1     8    Ugo
# 25      C  1     9 Gianni
# 26      C  1     9 Franco
# 27      C  1     9    Ugo
# 28      D  1    10    Bob
# 29      D  1    10    Rob
# 30      D  1    10  Frank
# 31      D  1    11    Bob
# 32      D  1    11    Rob
# 33      D  1    11  Frank
# 34      D  1    12    Bob
# 35      D  1    12    Rob
# 36      D  1    12  Frank

英文:

You can use stack + merge:

merge(df, stack(list_group), by.x = &quot;groups&quot;, by.y = &quot;ind&quot;)

Or the dplyr equivalent:

library(dplyr)
full_join(df, stack(list_group), by = c(&quot;groups&quot; = &quot;ind&quot;), relationship = &quot;many-to-many&quot;)

output

#    groups X1 X1.12 values
# 1       A  1     1   Jack
# 2       A  1     1   John
# 3       A  1     1    Joe
# 4       A  1     2   Jack
# 5       A  1     2   John
# 6       A  1     2    Joe
# 7       A  1     3   Jack
# 8       A  1     3   John
# 9       A  1     3    Joe
# 10      B  1     4     Ed
# 11      B  1     4    Edd
# 12      B  1     4   Eddy
# 13      B  1     5     Ed
# 14      B  1     5    Edd
# 15      B  1     5   Eddy
# 16      B  1     6     Ed
# 17      B  1     6    Edd
# 18      B  1     6   Eddy
# 19      C  1     7 Gianni
# 20      C  1     7 Franco
# 21      C  1     7    Ugo
# 22      C  1     8 Gianni
# 23      C  1     8 Franco
# 24      C  1     8    Ugo
# 25      C  1     9 Gianni
# 26      C  1     9 Franco
# 27      C  1     9    Ugo
# 28      D  1    10    Bob
# 29      D  1    10    Rob
# 30      D  1    10  Frank
# 31      D  1    11    Bob
# 32      D  1    11    Rob
# 33      D  1    11  Frank
# 34      D  1    12    Bob
# 35      D  1    12    Rob
# 36      D  1    12  Frank

答案2

得分: 0

你可以使用基础的stack()函数，然后使用整洁的full_join(multiple = "all")函数：

df %>% full_join(
  y = list_group %>% stack(),
  by = join_by(groups == ind),
  multiple = "all"
)

结果：

   X1 X1.12 groups values
1   1     1      A   Jack
2   1     1      A   John
3   1     1      A    Joe
4   1     2      A   Jack
5   1     2      A   John
6   1     2      A    Joe
7   1     3      A   Jack
8   1     3      A   John
9   1     3      A    Joe
10  1     4      B     Ed
11  1     4      B    Edd
12  1     4      B   Eddy
13  1     5      B     Ed
14  1     5      B    Edd
15  1     5      B   Eddy
16  1     6      B     Ed
17  1     6      B    Edd
18  1     6      B   Eddy
19  1     7      C Gianni
20  1     7      C Franco
21  1     7      C    Ugo
22  1     8      C Gianni
23  1     8      C Franco
24  1     8      C    Ugo
25  1     9      C Gianni
26  1     9      C Franco
27  1     9      C    Ugo
28  1    10      D    Bob
29  1    10      D    Rob
30  1    10      D  Frank
31  1    11      D    Bob
32  1    11      D    Rob
33  1    11      D  Frank
34  1    12      D    Bob
35  1    12      D    Rob
36  1    12      D  Frank

英文:

You can use base stack() and then a tidy full_join(multiple = "all"):

df %&gt;% full_join(
  y = list_group %&gt;% stack(),
  by = join_by(groups == ind),
  multiple = &quot;all&quot;
)

Result:

   X1 X1.12 groups values
1   1     1      A   Jack
2   1     1      A   John
3   1     1      A    Joe
4   1     2      A   Jack
5   1     2      A   John
6   1     2      A    Joe
7   1     3      A   Jack
8   1     3      A   John
9   1     3      A    Joe
10  1     4      B     Ed
11  1     4      B    Edd
12  1     4      B   Eddy
13  1     5      B     Ed
14  1     5      B    Edd
15  1     5      B   Eddy
16  1     6      B     Ed
17  1     6      B    Edd
18  1     6      B   Eddy
19  1     7      C Gianni
20  1     7      C Franco
21  1     7      C    Ugo
22  1     8      C Gianni
23  1     8      C Franco
24  1     8      C    Ugo
25  1     9      C Gianni
26  1     9      C Franco
27  1     9      C    Ugo
28  1    10      D    Bob
29  1    10      D    Rob
30  1    10      D  Frank
31  1    11      D    Bob
32  1    11      D    Rob
33  1    11      D  Frank
34  1    12      D    Bob
35  1    12      D    Rob
36  1    12      D  Frank

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在数据框架中迭代因子水平并添加来自列表的值的列？

问题

答案1

答案2

在Java 8中添加两个列表？

如何在列表中的行中添加单词/字符串？

在数据框的底部添加一行，该行对除了最后一行之外的所有先前行求和。

有没有一个R函数可以找到一行中最高的值，而不与列中的其他名称匹配？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论