2023年6月15日 20:30:44go评论88阅读模式

英文:

Forward fill first instances of NAs in R data.table

问题

我有一个数据表，其中有一列如下：

c(58,NA,NA,NA,NA,13,NA,NA,NA,12,23,NA,12)

我想要仅填充在该列中每个非NA值之后的前两个NA值，填充值应该是前一个非NA值。结果应该如下：

c(58,58,58,NA,NA,13,13,13,NA,12,23,23,12)

有什么建议吗？

英文:

I have a data.table with a column like:

c(58,NA,NA,NA,NA,13,NA,NA,NA,12,23,NA,12)

I would like to fill only the first two NAs following each non-NA value in the column by forwarding the last previous value. The result should be :

c(58,58,58,NA,NA,13,13,13,NA,12,23,23,12)

Any suggestions ?

答案1

得分: 1

一种基于基本R的简单方法：
```r
na_locf_max <- function(x, nmax){
  # 将向量拆分为数字和NA的序列
  s <- split(x, cumsum(!is.na(x)))
  # 将第一个nmax个值分配给第一个值，并修剪以匹配长度
  l <- mapply(\(x, y) {
    x[1:nmax+1] <- x[1]
    length(x) <- y
    x
  }, s, lengths(s))
  # 格式化为向量格式
  unlist(l, use.names = FALSE)
}
na_locf_max(x, nmax = 2)
# [1] 58 58 58 NA NA 13 13 13 NA 12 23 23 12


<details>
<summary>英文:</summary>
One crude way in base R:
```r
na_locf_max &lt;- function(x, nmax){
  # Split the vector in sequence of numbers and NAs
  s &lt;- split(x, cumsum(!is.na(x)))
  # Assign the first nmax value to the first value, and trim to match length
  l &lt;- mapply(\(x, y) {
    x[1:nmax+1] &lt;- x[1]
    length(x) &lt;- y
    x
  }, s, lengths(s))
  #Format to vector format
  unlist(l, use.names = FALSE)
}
na_locf_max(x, nmax = 2)
# [1] 58 58 58 NA NA 13 13 13 NA 12 23 23 12

答案2

得分: 0

A data.table solution using rleid for grouping and shift to access the numbers.

library(data.table)
dt[, .(V1, grp = rleid(V1), V1shift = shift(V1, 1)),][
   , .(V1, ifelse(is.na(V1shift), shift(V1shift, 1), V1shift)), by = grp][
   , .(V1, res = ifelse(is.na(V1), V2, V1)),]
    V1 res
 1: 58  58
 2: NA  58
 3: NA  58
 4: NA  NA
 5: NA  NA
 6: 13  13
 7: NA  13
 8: NA  13
 9: NA  NA
10: 12  12
11: 23  23
12: NA  23
13: 12  12

Data

dt <- structure(list(V1 = c(58, NA, NA, NA, NA, 13, NA, NA, NA, 12, 
23, NA, 12)), row.names = c(NA, -13L), class = c("data.table", 
"data.frame"))

英文:

A data.table solution using rleid for grouping and shift to access the numbers.

library(data.table)
dt[, .(V1, grp = rleid(V1), V1shift = shift(V1, 1)),][
   , .(V1, ifelse(is.na(V1shift), shift(V1shift, 1), V1shift)), by = grp][
   , .(V1, res = ifelse(is.na(V1), V2, V1)),]
    V1 res
 1: 58  58
 2: NA  58
 3: NA  58
 4: NA  NA
 5: NA  NA
 6: 13  13
 7: NA  13
 8: NA  13
 9: NA  NA
10: 12  12
11: 23  23
12: NA  23
13: 12  12

Data

dt &lt;- structure(list(V1 = c(58, NA, NA, NA, NA, 13, NA, NA, NA, 12, 
23, NA, 12)), row.names = c(NA, -13L), class = c(&quot;data.table&quot;, 
&quot;data.frame&quot;))

答案3

得分: 0

以下是翻译好的代码部分：

library(data.table)
dt = data.table(V1 = c(58, NA, NA, NA, NA, 13, NA, NA, NA, 12, 23, NA, 12))
dt[, V2 := fifelse(is.na(V1) & rowid(rleid(V1)) <= 2, nafill(V1, "locf"), V1)]

希望这对你有帮助。

英文:

One possible way to solve your problem:

library(data.table)
dt = data.table(V1 = c(58,NA,NA,NA,NA,13,NA,NA,NA,12,23,NA,12))
dt[, V2 := fifelse(is.na(V1) &amp; rowid(rleid(V1))&lt;=2, nafill(V1, &quot;locf&quot;), V1)]
	   V1    V2
 1:    58    58
 2:    NA    58
 3:    NA    58
 4:    NA    NA
 5:    NA    NA
 6:    13    13
 7:    NA    13
 8:    NA    13
 9:    NA    NA
10:    12    12
11:    23    23
12:    NA    23
13:    12    12

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Forward fill first instances of NAs in R data.table

问题

答案1

答案2

Data

Data

答案3

R pheatmap确定列顺序

文本和椭圆的颜色在rgl中不起作用。

颠倒若干列的内容顺序（最好在tidyverse中实现）。

重新排列向量的子集位置

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论