2023年3月7日 21:58:36go评论102阅读模式

英文:

Add results of box.test within a loop in a new column of dataframe

问题

我有一个包含许多时间序列的数据集。我想使用Box检验来检查每个序列是否具有平稳性。我的循环用于进行检验，但如何将结果（x²和p值）导出到现有的数据框（每个时间序列作为行）作为新列呢？

这是我的数据框示例：

a <- c(0.2569, 0.0145896, 0.0369, 0.025986, 0.12569, 0.3695)
b <- c(0.125, 0.04582, 0.2569, 0.256369, 0.25698, 0.1456)
c <- c(0.2584, 0.05698, 0.1258, 0.2569, 0.098563, 0.1569)
df <- data.frame(a, b, c)

以下是我的循环，它可以成功为每个时间序列提供x²和p值：

for(i in 1:ncol(df)) {       
  box <- Box.test(df[, i], type = "Ljung-Box")
  print(box)
}

现在结果应该被传递到一个数据框的空列中，如下所示：

d <- c("series1", "series2", "series3")
e <- c("green", "black", "red")
f <- c(18, 24, 12)
p_value <- NA  # 创建一个空列
x <- NA   
df2 <- data.frame(d, e, f, p_value, x)

我的第一个想法是这样的：

df2$p_value <- box$p.value

但在这里，每一行都会得到相同的p值。

我认为，我需要使用一个新的循环，但是我不知道如何实现"df2[i,] <- df2[i,]"：

for(i in 1:nrow(df2)) {       
    df2$p_value <- box$p.value(df2[i,] <- df2[i,])
}

这并不起作用。有人可以帮助我吗？也许使用另一个函数？

英文:

I have a data set with many time series. I would like to check each series for stationarity using the box test. My loop works fine for the test, but how can I export the results (x² and p-value) to an existing dataframe (each time series as rows) as new column?

Here is my dataframe example:

a &lt;- c(0.2569, 0.0145896, 0.0369, 0.025986, 0.12569, 0.3695)
b &lt;- c(0.125, 0.04582, 0.2569, 0.256369, 0.25698, 0.1456)
c &lt;- c(0.2584, 0.05698, 0.1258, 0.2569, 0.098563, 0.1569)
df &lt;- data.frame(a,b,c)

Here the loop, it works fine and give me for every time series x² und p-value:

for(i in 1:ncol(df)) {       
  box &lt;- Box.test(df[ , i] &lt;- df[ , i], type = &quot;Ljung-Box&quot;)
  print(box)
}

Now the results should be transferred to the empty columns in a dataframe like this:

d &lt;- c(&quot;series1&quot;, &quot;series2&quot;, &quot;series3&quot;)
e &lt;- c(&quot;green&quot;, &quot;black&quot;, &quot;red&quot;)
f &lt;- c(18, 24, 12)
p_value &lt;- NA  #to create an empty column
x &lt;- NA   
df2 &lt;- data.frame(d,e,f,p_value,x)

My first idea was this:

df2$p_value &lt;- box$p.value

But here I get in each row the same p_value.

I think, I have to do it with a new loop, but here I dont now how to implement "df2[i , ] <- df2[i ,]" it:

for(i in 1:nrow(df2)) {       
    df2$p_value &lt;- box$p.value(df2[i , ] &lt;- df2[i ,])
}

This doesn't work.
Can somebody help me? Maybe with another function?

答案1

得分: 1

你可以使用 sapply 和子集来获取 p 值。

bres <- sapply(df, function(x) Box.test(x, type="Ljung-Box")[['p.value']])

我强烈建议明确地创建一个字典 a 以避免错误。

a <- setNames(c("series1", "series2", "series3"), c('a', 'b', 'c'))

然后使用 cbind。

cbind(df2, p_value=bres[match(df2$d, a)])
#         d     e  f   p_value
# a series1 green 18 0.8206314
# b series2 black 24 0.6379121
# c series3   red 12 0.1574567

数据：

df <- structure(list(a = c(0.2569, 0.0145896, 0.0369, 0.025986, 0.12569, 
0.3695), b = c(0.125, 0.04582, 0.2569, 0.256369, 0.25698, 0.1456
), c = c(0.2584, 0.05698, 0.1258, 0.2569, 0.098563, 0.1569)), row.names = c(NA, 
-6L), class = "data.frame")
df2 <- data.frame(d=c("series1", "series2", "series3"),
                  e=c("green", "black", "red"),
                  f=c(18, 24, 12))

英文:

You can use sapply and subset for the p value.

bres &lt;- sapply(df, \(x) Box.test(x, type=&quot;Ljung-Box&quot;)[[&#39;p.value&#39;]])

I strongly recommend to explicitly formulate a dictionary a to avoid mistakes.

a &lt;- setNames(c(&quot;series1&quot;, &quot;series2&quot;, &quot;series3&quot;), c(&#39;a&#39;, &#39;b&#39;, &#39;c&#39;))

Then cbind.

cbind(df2, p_value=bres[match(df2$d, a)])
#         d     e  f   p_value
# a series1 green 18 0.8206314
# b series2 black 24 0.6379121
# c series3   red 12 0.1574567

Data:

df &lt;- structure(list(a = c(0.2569, 0.0145896, 0.0369, 0.025986, 0.12569, 
0.3695), b = c(0.125, 0.04582, 0.2569, 0.256369, 0.25698, 0.1456
), c = c(0.2584, 0.05698, 0.1258, 0.2569, 0.098563, 0.1569)), row.names = c(NA, 
-6L), class = &quot;data.frame&quot;)
df2 &lt;- data.frame(d=c(&quot;series1&quot;, &quot;series2&quot;, &quot;series3&quot;),
                  e=c(&quot;green&quot;, &quot;black&quot;, &quot;red&quot;),
                  f=c(18, 24, 12))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在数据框中的循环中将box.test的结果添加到新列中。

问题

答案1

删除一列，保持单元格的格式。

dplyr解决方案以精确和部分字符串连接方式

Ghost CMS API 来自 R

使用R查询服务器，基于Python示例。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。