创建一个数据框,其中包含R中顺序不重要的2个变量的成对比较。

huangapple go评论74阅读模式
英文:

Create a data frame with pairwise comparisons of 2 variables where order don't matter in R

问题

Sure, here's the translated code portion:

我有一些变量,想要进行两两比较,但要移除那些相等的行(例如,“A” == “A”),只保留其中一个比较,其中仅顺序发生变化,因此保留“A” vs “B”或“B” vs “A”。

我有以下R代码可以完成这个任务:

sp.all.var = c(LETTERS[1:10])
length(sp.all.var)^2

df.pairwise = expand.grid(sp.all.var, sp.all.var)

nrow(df.pairwise)

df.pairwise.sub1 = df.pairwise[df.pairwise$Var1 != df.pairwise$Var2,]

df.pairwise.sub1$compare = apply(df.pairwise.sub1, 1, function(x) paste(sort(x), collapse = "-"))
nrow(df.pairwise.sub1)

df.pairwise.sub2 = df.pairwise.sub1[!duplicated(df.pairwise.sub1$compare), ]

nrow(df.pairwise.sub2)

Please note that the code retains its original structure and comments for clarity.

英文:

I have variables that I want to make all pairwise comparisons but removing rows where the comparisons are equal (e.g., "A" == "A") and keep only one of the comparisons where only the order change, so keep "A" vs "B" OR "B" vs "A".

I have this code that does it in R:

sp.all.var = c(LETTERS[1:10])
length(sp.all.var)^2


df.pairwise = expand.grid(sp.all.var,sp.all.var)

nrow(df.pairwise)

df.pairwise.sub1 = df.pairwise[df.pairwise$Var1!=df.pairwise$Var2,]

df.pairwise.sub1$compare = apply(df.pairwise.sub1, 1, function(x) paste(sort(x), collapse = "-"))
nrow(df.pairwise.sub1)

df.pairwise.sub2 = df.pairwise.sub1[!duplicated(df.pairwise.sub1$compare), ]

nrow(df.pairwise.sub2)

I was wondering if there is a way to do it in a simpler fashion (Is there a built in function that does it? is there a package?).

答案1

得分: 2

You probably want combn.

combn(LETTERS[1:10], 2, paste, collapse = "-")
#> [1] "A-B" "A-C" "A-D" "A-E" "A-F" "A-G" "A-H" "A-I" "A-J" "B-C" "B-D" "B-E"
#> [13] "B-F" "B-G" "B-H" "B-I" "B-J" "C-D" "C-E" "C-F" "C-G" "C-H" "C-I" "C-J"
#> [25] "D-E" "D-F" "D-G" "D-H" "D-I" "D-J" "E-F" "E-G" "E-H" "E-I" "E-J" "F-G"
#> [37] "F-H" "F-I" "F-J" "G-H" "G-I" "G-J" "H-I" "H-J" "I-J"

Or as a data.frame:

as.data.frame(t(combn(LETTERS[1:10], 2, (x) c(x, paste(x, collapse = "-")))))
#> V1 V2 V3
#> 1 A B A-B
#> 2 A C A-C
#> 3 A D A-D
#> 4 A E A-E
#> 5 A F A-F
#> 6 A G A-G
#> 7 A H A-H
#> 8 A I A-I
#> 9 A J A-J
#> 10 B C B-C
#> 11 B D B-D
#> 12 B E B-E
#> 13 B F B-F
#> 14 B G B-G
#> 15 B H B-H
#> 16 B I B-I
#> 17 B J B-J
#> 18 C D C-D
#> 19 C E C-E
#> 20 C F C-F
#> 21 C G C-G
#> 22 C H C-H
#> 23 C I C-I
#> 24 C J C-J
#> 25 D E D-E
#> 26 D F D-F
#> 27 D G D-G
#> 28 D H D-H
#> 29 D I D-I
#> 30 D J D-J
#> 31 E F E-F
#> 32 E G E-G
#> 33 E H E-H
#> 34 E I E-I
#> 35 E J E-J
#> 36 F G F-G
#> 37 F H F-H
#> 38 F I F-I
#> 39 F J F-J
#> 40 G H G-H
#> 41 G I G-I
#> 42 G J G-J
#> 43 H I H-I
#> 44 H J H-J
#> 45 I J I-J

英文:

You probably want combn.

combn(LETTERS[1:10], 2, paste, collapse = "-")
#>  [1] "A-B" "A-C" "A-D" "A-E" "A-F" "A-G" "A-H" "A-I" "A-J" "B-C" "B-D" "B-E"
#> [13] "B-F" "B-G" "B-H" "B-I" "B-J" "C-D" "C-E" "C-F" "C-G" "C-H" "C-I" "C-J"
#> [25] "D-E" "D-F" "D-G" "D-H" "D-I" "D-J" "E-F" "E-G" "E-H" "E-I" "E-J" "F-G"
#> [37] "F-H" "F-I" "F-J" "G-H" "G-I" "G-J" "H-I" "H-J" "I-J"

Or as a data.frame:

as.data.frame(t(combn(LETTERS[1:10], 2, \(x) c(x, paste(x, collapse = "-")))))
#>    V1 V2  V3
#> 1   A  B A-B
#> 2   A  C A-C
#> 3   A  D A-D
#> 4   A  E A-E
#> 5   A  F A-F
#> 6   A  G A-G
#> 7   A  H A-H
#> 8   A  I A-I
#> 9   A  J A-J
#> 10  B  C B-C
#> 11  B  D B-D
#> 12  B  E B-E
#> 13  B  F B-F
#> 14  B  G B-G
#> 15  B  H B-H
#> 16  B  I B-I
#> 17  B  J B-J
#> 18  C  D C-D
#> 19  C  E C-E
#> 20  C  F C-F
#> 21  C  G C-G
#> 22  C  H C-H
#> 23  C  I C-I
#> 24  C  J C-J
#> 25  D  E D-E
#> 26  D  F D-F
#> 27  D  G D-G
#> 28  D  H D-H
#> 29  D  I D-I
#> 30  D  J D-J
#> 31  E  F E-F
#> 32  E  G E-G
#> 33  E  H E-H
#> 34  E  I E-I
#> 35  E  J E-J
#> 36  F  G F-G
#> 37  F  H F-H
#> 38  F  I F-I
#> 39  F  J F-J
#> 40  G  H G-H
#> 41  G  I G-I
#> 42  G  J G-J
#> 43  H  I H-I
#> 44  H  J H-J
#> 45  I  J I-J

答案2

得分: 1

以下是翻译好的部分:

你还可以使用 `rep` + `sequence` 来实现它
英文:

You can also make it with rep + sequence

x <- LETTERS[1:10]
paste0(
    rep(x, (length(x) - 1):0), "-",
    x[sequence((length(x) - 1):0, from = 2:length(x))]
)

which gives

 [1] "A-B" "A-C" "A-D" "A-E" "A-F" "A-G" "A-H" "A-I" "A-J" "B-C" "B-D" "B-E"
[13] "B-F" "B-G" "B-H" "B-I" "B-J" "C-D" "C-E" "C-F" "C-G" "C-H" "C-I" "C-J"
[25] "D-E" "D-F" "D-G" "D-H" "D-I" "D-J" "E-F" "E-G" "E-H" "E-I" "E-J" "F-G"
[37] "F-H" "F-I" "F-J" "G-H" "G-I" "G-J" "H-I" "H-J" "I-J"

huangapple
  • 本文由 发表于 2023年8月5日 00:34:01
  • 转载请务必保留本文链接:https://go.coder-hub.com/76837754.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定