2023年3月4日 05:57:49go评论161阅读模式

英文:

Replacing column values that conditionally match a list of values in R

问题

我尝试在一个数据框中替换数值，当它与一个远小于其大小的第二个数据框中的标识符匹配时。下面是我尝试的一个示例：

df1 = data.frame(row = seq(1,6),
                   x = c("a","b","c","d","e","f"))
df2 = data.frame(row = c(5,3,1,15,10),
                 x2 = c("g","h","i","j","k"))
df3 = df1 %>% mutate(x = case_when(
  df1$row == df2$row ~ df2$x2,
  .default = df1$x
))

我试图实现这个操作，即当 df1$row 与 df2$row 匹配时，用 df2$x2 中的值替换 df1$x，否则保留 df1$x。预期输出如下：

df3
  row x
1   1 i
2   2 b
3   3 h
4   4 d
5   5 g
6   6 f

感谢任何帮助。

英文:

I am trying to replace values in one dataframe when it matches an identifier in a second dataframe of a much smaller size. A toy example of what I've tried:

df1 = data.frame(row = seq(1,6),
                   x = c(&quot;a&quot;,&quot;b&quot;,&quot;c&quot;,&quot;d&quot;,&quot;e&quot;,&quot;f&quot;))
df2 = data.frame(row = c(5,3,1,15,10),
                 x2 = c(&quot;g&quot;,&quot;h&quot;,&quot;i&quot;,&quot;j&quot;,&quot;k&quot;))
df3 = df1 %&gt;% mutate(x = case_when(
  df1$row == df2$row ~ df2$x2,
  .default = df1$x
))

I am attempting this to read, when df1$row matches df2$row, replace df1$x with the value from df2$x2 and otherwise leave df1$x. The expected output:

df3
  row x
1   1 i
2   2 b
3   3 h
4   4 d
5   5 g
6   6 f

Any help appreciated.

答案1

得分: 1

我们可以通过row进行join，然后使用coalesce：

library(dplyr)
df1 %>%
    left_join(df2, by = 'row') %>%
    mutate(x = coalesce(x2, x), .keep = 'unused')

row x
1 1 i
2 2 b
3 3 h
4 4 d
5 5 g
6 6 f

英文:

We can join by row, then use coalesce:

library(dplyr)
df1 %&gt;%
    left_join(df2, by = &#39;row&#39;) %&gt;%
    mutate(x = coalesce(x2, x), .keep = &#39;unused&#39;)
  row x
1   1 i
2   2 b
3   3 h
4   4 d
5   5 g
6   6 f
</details>
# 答案2
**得分**: 1
我们可以使用 {powerjoin}
``` r
df1 = data.frame(row = seq(1,6),
                 x = c("a","b","c","d","e","f"))
df2 = data.frame(row = c(5,3,1,15,10),
                 x2 = c("g","h","i","j","k"))
library(powerjoin)
power_left_join(df1, df2 |&gt; dplyr::rename(x = x2), by = "row", conflict = coalesce_yx)
#&gt;   row x
#&gt; 1   1 i
#&gt; 2   2 b
#&gt; 3   3 h
#&gt; 4   4 d
#&gt; 5   5 g
#&gt; 6   6 f

^{创建于2023年03月17日，使用 reprex v2.0.2}

英文:

We might use {powerjoin}

df1 = data.frame(row = seq(1,6),
                 x = c(&quot;a&quot;,&quot;b&quot;,&quot;c&quot;,&quot;d&quot;,&quot;e&quot;,&quot;f&quot;))
df2 = data.frame(row = c(5,3,1,15,10),
                 x2 = c(&quot;g&quot;,&quot;h&quot;,&quot;i&quot;,&quot;j&quot;,&quot;k&quot;))
library(powerjoin)
power_left_join(df1, df2 |&gt; dplyr::rename(x = x2), by = &quot;row&quot;, conflict = coalesce_yx)
#&gt;   row x
#&gt; 1   1 i
#&gt; 2   2 b
#&gt; 3   3 h
#&gt; 4   4 d
#&gt; 5   5 g
#&gt; 6   6 f

<sup>Created on 2023-03-17 with reprex v2.0.2</sup>

答案3

得分: 0

使用dplyr 1.1.0版本：

df1 %>% 
  rows_update(df2 %>% rename(x = x2), unmatched = "ignore")

结果：

匹配，按 = "row"
  行 x
1   1 i
2   2 b
3   3 h
4   4 d
5   5 g
6   6 f

如果两个表具有相同的行名称，会更简单：

df1 %>% 
  rows_update(df2, unmatched = "ignore")

英文:

With dplyr 1.1.0:

df1 %&gt;%
  rows_update(df2 %&gt;% rename(x = x2), unmatched = &quot;ignore&quot;)

Result

Matching, by = &quot;row&quot;
  row x
1   1 i
2   2 b
3   3 h
4   4 d
5   5 g
6   6 f

If both tables had the same rownames it would be simpler:

df1 %&gt;%
  rows_update(df2, unmatched = &quot;ignore&quot;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在R中有条件地替换匹配值列表的列数值。

问题

答案1

答案3

在ggplot2中，确保矩形和直方图图形的宽度完全匹配。

我如何在每个交叉验证折叠中的每个训练部分上应用预处理，使用tidymodels？

寻找二进制列中的模式 r

How can I use the MatchIt package from R to match control and case patients on age and multiple diagnosis codes (ICD10)?

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。