2023年6月29日 23:42:59go评论93阅读模式

英文:

How to "left_join" multiple columns into one column within a single dataset?

问题

我有一个包含三列的数据框。我想要用另一列中的值来填充其中一个列中的缺失值，但我不想覆盖任何数据。我应该如何获得以下结果？

# 初始数据框:
DF$ST_1 <- c(100, NA, 100, 100, 200, 200, NA, NA, NA, NA, 200)
DF$ST_2 <- c(50, NA, 50, 50, 12, NA, NA, 50, 50, NA, 12)
DF$ST_3 <- c(5, NA, 5, 2, 3, 1, 1, 3, 4, 2, 11)
我想要的结果:
DF$ST <- c(100, NA, 100, 100, 200, 200, 1, 50, 50, 2, 200)

如您所见，我想保留ST_1中的所有值，当出现NA时，用ST_2中的值填充它。然后，我想保留所有这些合并后的值，并用ST_3中的值填充剩下的NA。在所有这些合并后，仍然会有一些剩下的NA。

英文:

I have a dataframe with three columns. I would like to populate the NAs that are in one column with values in another column, but I do not want to overwrite any data. How can I get the following results?

# Starting Dataframe:
DF$ST_1 &lt;- c(100, NA, 100, 100, 200, 200, NA, NA, NA, NA, 200)
DF$ST_2 &lt;- c(50,  NA,  50,  50,  12,  NA, NA, 50, 50, NA, 12)
DF$ST_3 &lt;- c(5,   NA,   5,   2,   3,   1,  1,  3,  4,  2, 11)
Results I want:
DF$ST &lt;- c(100, NA,  100, 100, 200, 200, 1, 50, 50, 2, 200)

As you can see, I want to keep all the values in ST_1, and when there is an NA, fill it in with ST_2. Then, I want to keep all of the values from that merge, and fill in the remaining NAs with ST_3. There will still be some leftover NAs after all these merges.

答案1

得分: 1

library(dplyr)
DF %>%
  mutate(ST=coalesce(ST_1,ST_2,ST_3))
   ST_1 ST_2 ST_3  ST
1   100   50    5 100
2    NA   NA   NA  NA
3   100   50    5 100
4   100   50    2 100
5   200   12    3 200
6   200   NA    1 200
7    NA   NA    1   1
8    NA   50    3  50
9    NA   50    4  50
10   NA   NA    2   2
11  200   12   11 200

英文:

library(dplyr)
DF %&gt;%
  mutate(ST=coalesce(ST_1,ST_2,ST_3))
   ST_1 ST_2 ST_3  ST
1   100   50    5 100
2    NA   NA   NA  NA
3   100   50    5 100
4   100   50    2 100
5   200   12    3 200
6   200   NA    1 200
7    NA   NA    1   1
8    NA   50    3  50
9    NA   50    4  50
10   NA   NA    2   2
11  200   12   11 200

答案2

得分: 0

你想要每行中的最大值吗？

基本 R 代码：

DF$ST <- apply(DF, 1, max, na.rm = TRUE)

英文:

So you're wanting the max value from each row?

base R:

DF$ST &lt;- apply(DF,1,max,na.rm=TRUE)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在单个数据集中将多个列进行 “left_join” 合并为一列？

问题

答案1

答案2

在data.table中，查找在它们之间有其他类型事件的事件。

如何拆分并将输出文件转换为带列名的列？

将复杂的爆炸数据帧中的选定列添加到另一个PySpark数据帧中。

应用不同颜色于基于三个不同X轴值且填充有两种不同因素的多个geom_bar。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。