在R中合并两个没有共同列的数据框。

huangapple go评论142阅读模式
英文:

Merging tow dataframes in R that do not have common columns

问题

我一直在尝试合并两个在R中没有相同列的数据框。我得到的输出是一个具有额外的行和重复值的数据框。我的尝试如下,欢迎对代码进行编辑。

  1. df1 <- data.frame(A=c("A","B", "C", "D", "E"),
  2. B=c(7, 9, 8, 3, 2),
  3. C=c(3, 5, 2, 9, 9))
  4. df2 <- data.frame(D=c(1, 3, 3, 4, 5),
  5. E=c(7, 7, 8, 3, 2))
  6. df3 <- merge(x=df1, y=df2, by=NULL)

这个代码能够工作,但是输出给我一个具有25个观察和5个变量的数据框。有一个额外的行集,就像是一个重复。请注意,这里并不是整个数据集,只是一个子集。

  1. A B C D E
  2. 1 A 7 3 1 7
  3. 2 B 9 5 1 7
  4. 3 C 8 2 1 7
  5. 4 D 3 9 1 7
  6. 5 E 2 9 1 7
  7. 6 A 7 3 3 7
  8. 7 B 9 5 3 7
  9. 8 C 8 2 3 7

我期望的输出是像下面这样的(5个观察和5个变量),其中我不需要额外的行集和重复的值。

  1. A B C D E
  2. 1 A 7 3 1 7
  3. 2 B 9 5 3 7
  4. 3 C 8 2 3 8
  5. 4 D 3 9 4 3
  6. 5 E 2 9 5 2
英文:

I have been trying to merge two dataframes in r that do not have the same column. The output I get is one that has an extra set of rows and duplicated values. My attempt is below and edits to the code a welcome.

  1. df1 &lt;- data.frame(A=c(&quot;A&quot;,&quot;B&quot;, &quot;C&quot;, &quot;D&quot;, &quot;E&quot;),
  2. B=c(7, 9, 8, 3, 2),
  3. C=c(3, 5, 2, 9, 9))
  4. df2 &lt;- data.frame(D=c(1, 3, 3, 4, 5),
  5. E=c(7, 7, 8, 3, 2))
  6. df3&lt;-merge(x=df1, y=df2, by=NULL)

This works but now the output gives me 25 observations of 5 variables. Where the is an extra set of rows like a duplication. ( disclaimer this is not the entire dataset here just a subset of it) .

  1. A B C D E
  2. 1 A 7 3 1 7
  3. 2 B 9 5 1 7
  4. 3 C 8 2 1 7
  5. 4 D 3 9 1 7
  6. 5 E 2 9 1 7
  7. 6 A 7 3 3 7
  8. 7 B 9 5 3 7
  9. 8 C 8 2 3 7

My desired output is something like the one below (5 observations and 5 variables), where I do not an additional set of rows and duplicated values.

  1. A B C D E
  2. 1 A 7 3 1 7
  3. 2 B 9 5 3 7
  4. 3 C 8 2 3 8
  5. 4 D 3 9 4 3
  6. 5 E 2 9 5 2

答案1

得分: 1

你可以使用 cbind() 来完成这个任务。

  1. df3 <- cbind(df1, df2)
英文:

As some of the comments already mentioned you can use cbind() to accomplish that.

  1. df3 &lt;- cbind(df1, df2)

答案2

得分: 1

cbind 数据框方法只是data.frame(..., check.names = FALSE)的一个包装器,您也可以使用:

  1. df3 <- data.frame(df1, df2, check.names = FALSE)
  2. #df3 <- data.frame(df1, df2) #替代方法
  3. df3
  4. # A B C D E
  5. #1 A 7 3 1 7
  6. #2 B 9 5 3 7
  7. #3 C 8 2 3 8
  8. #4 D 3 9 4 3
  9. #5 E 2 9 5 2
英文:

As the cbind data frame method is just a wrapper for data.frame(..., check.names = FALSE) you can also use:

  1. df3 &lt;- data.frame(df1, df2, check.names = FALSE)
  2. #df3 &lt;- data.frame(df1, df2) #Alternative
  3. df3
  4. # A B C D E
  5. #1 A 7 3 1 7
  6. #2 B 9 5 3 7
  7. #3 C 8 2 3 8
  8. #4 D 3 9 4 3
  9. #5 E 2 9 5 2

huangapple
  • 本文由 发表于 2023年3月7日 18:29:00
  • 转载请务必保留本文链接:https://go.coder-hub.com/75660777.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定