2023年2月19日 20:30:14go评论92阅读模式

英文:

How to extend values down an r dataframe when row conditions are met?

问题

请注意，以下是您要翻译的部分：

"Note that there are solutions to other questions that may resolve this specific question, such as https://stackoverflow.com/questions/40040834/replace-na-with-previous-or-next-value-by-group-using dplyr. However, this question isn't about replacing NA's, NA's are OK in this question in certain circumstances. This question addresses replacing all cells in a group in a dataframe that fall after the first non-NA in that group, with that first non-NA value. When I researched this issue I didn't find solutions that fit because I only want to replace NA's in certain circumstances (NA's in a group that occur prior to the first non-NA in that group remain; and a group with all NA's and no non-NA in that group retain all their NA's).

Is there a method, with a preference for dplyr or data.table, for extending target values down an R dataframe range when specified row conditions are met in a row within a group? I vaguely remember an rleid function in data.table that may do the trick but I'm having trouble implementing. Either as a new column or by over-writing existing column "State" in my below example.

For example, if we start with the below example dataframe, I'd like to send the target value of 1 in each row to the end of each ID grouping, after the first occurrence of that target value of 1 in a group, and as better explained in the illustration underneath:

myDF <- data.frame(
ID = c(1,1,1,1,2,2,2,3,3,3),
State = c(NA,NA,1,NA,1,NA,NA,NA,NA,NA))"

英文:

Note that there are solutions to other questions that may resolve this specific question, such as https://stackoverflow.com/questions/40040834/replace-na-with-previous-or-next-value-by-group-using-dplyr. However, this question isn't about replacing NA's, NA's are OK in this question in certain circumstances. This question addresses replacing all cells in a group in a dataframe that fall after the the first non-NA in that group, with that first non-NA value. When I researched this issue I didn't find solutions that fit because I only want to replace NA's in certain circumstances (NA's in a group that occur prior to the first non-NA in that group remain; and a group with all NA's and no non-NA in that group retain all their NA's).

Is there a method, with a preference for dplyr or data.table, for extending target values down an R dataframe range when specified row conditions are met in a row within a group? I vaguely remember an rleid function in data.table that may do the trick but I'm having trouble implementing. Either as a new column or by over-writing existing column "State" in my below example.

myDF &lt;- data.frame(
  ID = c(1,1,1,1,2,2,2,3,3,3),
  State = c(NA,NA,1,NA,1,NA,NA,NA,NA,NA))

答案1

得分: 1

你可以使用 fill 函数：

library(tidyr)
myDF %>%
  group_by(ID) %>%
  fill(State, .direction = "down")
# A tibble: 10 × 2
# Groups:   ID [3]
      ID State
   <dbl> <dbl>
 1     1    NA
 2     1    NA
 3     1     1
 4     1     1
 5     2     1
 6     2     1
 7     2     1
 8     3    NA
 9     3    NA
10     3    NA

英文:

You can use fill:

library(tidyr)
myDF %&gt;%
  group_by(ID) %&gt;%
  fill(State, .direction = &quot;down&quot;)
# A tibble: 10 &#215; 2
# Groups:   ID [3]
      ID State
   &lt;dbl&gt; &lt;dbl&gt;
 1     1    NA
 2     1    NA
 3     1     1
 4     1     1
 5     2     1
 6     2     1
 7     2     1
 8     3    NA
 9     3    NA
10     3    NA

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何在满足行条件时扩展R数据框中的值？

问题

答案1

按组筛选具有两列NA值的情况。

如何安排重复的示例代码以在输出中按顺序排列

在 R 中更改 text_tokens 函数的输出

In R, how can I create a function that can take values from columns using dplyr::mutate, but also still take specific strings as values?

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。