2023年2月19日 07:30:17go评论83阅读模式

英文:

Selecting Specific Data in an Data Frame to replace using the Row and Column names

问题

我试图将数据表中特定的NA值替换为0。我不想替换所有的NA，只想替换特定条件下的NA。例如，"当行是Cole_1并且列包含指定的'Fall1'时，将NA替换为0"。我的数据集非常大，所以我需要尽量减少手动指定，不考虑为每一列编号。基本上，我想能够像玩战舰游戏一样定位单元格。

我尝试过：

whentest <- count_order_site %>%
  when(select(contains("Fall1")) &
  count_order_site[count_order_site$Point_Name == "Cole_1", ],
  count_order_site[is.na(count_order_site)] <- 0 )

但出现错误"contains()必须在一个选择函数内使用"。我甚至不确定这是否是达到目标的正确路径。

基本布局理念（抱歉它们堆叠得怪怪的，我不知道如何让它们并排显示）：

Point Name	ACWO_Fall1
Cole_1	NA
Cole_2	3

ACWO_FAll2	HOSP_FAll1
3	NA
NA	5

经过函数处理后，数据将如下所示：

Point Name	ACWO_Fall1
Cole_1	0
Cole_2	3

ACWO_FAll2	HOSP_FAll1
3	0
NA	5

英文:

I am attempting to replace specific NA values with 0 in my data table. I do not want all NAs replaces, only those under certain conditions. For example, "replace NA with Zeros when the row is Cole_1 and the Column includes the designation 'Fall1'". I have a huge data set, so I need as little manual designating as possible, numbering each column is not an option. Basically, I want to be able to target the cells like playing battleship.

I have tried:

whentest &lt;- count_order_site %&gt;% 
  when(select(contains(&quot;Fall1&quot;)) &amp; 
  count_order_site[count_order_site$Point_Name == &quot;Cole_1&quot;, ], 
  count_order_site[is.na(count_order_site)] &lt;- 0 )

but get an error "contains() must be used within a selecting function."
I'm not even sure if this is the right path to get what I want.

The basic layout idea (Sorry it's stacked weird, I can't figure out how to make them next to each other):

Point Name	ACWO_Fall1
Cole_1	NA
Cole_2	3

ACWO_FAll2	HOSP_FAll1
3	NA
NA	5

After the functions the data would look like:

Point Name	ACWO_Fall1
Cole_1	0
Cole_2	3

ACWO_FAll2	HOSP_FAll1
3	0
NA	5

答案1

得分: 0

如果我理解正确，您可以使用mutate和across来包括包含特定字符值的列，例如"Fall1"。然后，使用replace函数，替换那些缺失的值，其中point_name具有特定值，例如"Cole_1"。

下面的示例具有一些额外的列，以演示逻辑是否正确。

library(tidyverse)
df %>%
  mutate(across(contains("Fall1"), ~replace(., is.na(.) & point_name == "Cole_1", 0)))

输出

  point_name ACWO_Fall1 ACWO_Fall2 HOSP_Fall1 Other1 Other_Fall1
1     Cole_1          0          3          0     NA           6
2     Cole_2          3         NA          5     NA          NA

请注意，这是给定的代码段的翻译。

英文:

If I understand correctly, you can use mutate across to include columns that contain certain character values, such as "Fall1". Then, with the replace function, replace those values that are missing using is.na and where the point_name has a specific value, such as "Cole_1".

The example below has a couple extra columns to demonstrate if the logic is correct.

library(tidyverse)
df %&gt;%
  mutate(across(contains(&quot;Fall1&quot;), ~replace(., is.na(.) &amp; point_name == &quot;Cole_1&quot;, 0)))

Output

  point_name ACWO_Fall1 ACWO_Fall2 HOSP_Fall1 Other1 Other_Fall1
1     Cole_1          0          3          0     NA           6
2     Cole_2          3         NA          5     NA          NA

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

选择数据框中的特定数据以替代，使用行和列名称。

问题

答案1

创建一个使用矢量化函数的新数据框。

高charter箱线图的图例颜色在R中

在数据框中，如果日期在另一列中的两个日期之间，如何变异一个新列？

从首选数字开始的Y轴

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。