2023年2月6日 12:46:41go评论85阅读模式

英文:

Unnesting/rectangling/flattening a nested list using `tidyr::unnest_longer()`

问题

I've been trying to get my head around the unnesting functions in tidyr and tibblify. I believe you should be able to use unnest_longer() to replicate the more manual methods below of turning this kind of nested list into a tibble, but I've been struggling with the docs a little. A correct example of how to do this would help me immensely:

# Example nested list
nl &lt;- list(time = list(&quot;2023-02-06&quot;, &quot;2023-02-07&quot;, &quot;2023-02-08&quot;,
                       &quot;2023-02-09&quot;, &quot;2023-02-10&quot;, &quot;2023-02-11&quot;,
                       &quot;2023-02-12&quot;), 
           precipitation_sum = list(0.9, 0, 0, 0.3, 0, 0, 0))
# one way to do it (extract colnames and construct)
tibble(!!! setNames(map(nl, unlist),names(nl)))
# another way (collect &amp; reduce each sublist)
as_tibble(lapply(nl, function(x) Reduce(c, x)))
# how to use tidyr and unnest_longer? (below is incorrect)
unnest_longer(tibble(nl), col = everything())

英文:

# Example nested list
nl &lt;- list(time = list(&quot;2023-02-06&quot;, &quot;2023-02-07&quot;, &quot;2023-02-08&quot;,
                       &quot;2023-02-09&quot;, &quot;2023-02-10&quot;, &quot;2023-02-11&quot;,
                       &quot;2023-02-12&quot;), 
           precipitation_sum = list(0.9, 0, 0, 0.3, 0, 0, 0))
# one way to do it (extract colnames and construct)
tibble(!!! setNames(map(nl, unlist),names(nl)))
# another way (collect &amp; reduce each sublist)
as_tibble(lapply(nl, function(x) Reduce(c, x)))
# how to use tidyr and unnest_longer? (below is incorrect)
unnest_longer(tibble(nl), col = everything())

答案1

得分: 4

以下是翻译后的代码部分：

library(tibble)
library(tidyr)
as_tibble(nl) %>%
    unnest(cols = where(is.list))

-output

# A tibble: 7 × 2
  time       precipitation_sum
  <chr>                  <dbl>
1 2023-02-06               0.9
2 2023-02-07               0  
3 2023-02-08               0  
4 2023-02-09               0.3
5 2023-02-10               0  
6 2023-02-11               0  
7 2023-02-12               0

或者更紧凑的写法：

library(purrr)
map_dfc(nl, unlist)
# A tibble: 7 × 2
  time       precipitation_sum
  <chr>                  <dbl>
1 2023-02-06               0.9
2 2023-02-07               0  
3 2023-02-08               0  
4 2023-02-09               0.3
5 2023-02-10               0  
6 2023-02-11               0  
7 2023-02-12               0

请注意，上述代码中的R语言代码保持不变，只有注释部分进行了翻译。

英文:

We could use

library(tibble)
library(tidyr)
as_tibble(nl) %&gt;% 
    unnest(cols = where(is.list))

-output

# A tibble: 7 &#215; 2
  time       precipitation_sum
  &lt;chr&gt;                  &lt;dbl&gt;
1 2023-02-06               0.9
2 2023-02-07               0  
3 2023-02-08               0  
4 2023-02-09               0.3
5 2023-02-10               0  
6 2023-02-11               0  
7 2023-02-12               0

Or more compactly

library(purrr)
map_dfc(nl, unlist)
# A tibble: 7 &#215; 2
  time       precipitation_sum
  &lt;chr&gt;                  &lt;dbl&gt;
1 2023-02-06               0.9
2 2023-02-07               0  
3 2023-02-08               0  
4 2023-02-09               0.3
5 2023-02-10               0  
6 2023-02-11               0  
7 2023-02-12               0

答案2

得分: 1

另一个有趣的选项是使用 dmap（以及 dmap 背后的历史）：

'purrrlyr 包含一些位于 purrr 和 dplyr 交集处的函数。它们已从 purrr 中移除，以使包更轻量，并且因为它们已被 tidyverse 中的其他解决方案替代。' <https://github.com/hadley/purrrlyr/>

#install.packages("purrrlyr")
library(purrrlyr)
nl %>%
  dmap(unlist)

  time       precipitation_sum
  <chr>                  <dbl>
1 2023-02-06               0.9
2 2023-02-07               0  
3 2023-02-08               0  
4 2023-02-09               0.3
5 2023-02-10               0  
6 2023-02-11               0  
7 2023-02-12               0

英文:

Another intersting option is to use dmap (and the history behind dmap):

'purrrlyr contains some functions that lie at the intersection of purrr and dplyr. They have been removed from purrr in order to make the package lighter and because they have been replaced by other solutions in the tidyverse.' <https://github.com/hadley/purrrlyr/>

#install.packages(&quot;purrrlyr&quot;)
library(purrrlyr)
nl %&gt;% 
  dmap(unlist)

  time       precipitation_sum
  &lt;chr&gt;                  &lt;dbl&gt;
1 2023-02-06               0.9
2 2023-02-07               0  
3 2023-02-08               0  
4 2023-02-09               0.3
5 2023-02-10               0  
6 2023-02-11               0  
7 2023-02-12               0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Unnesting/rectangling/flattening a nested list using `tidyr::unnest_longer()`

问题

答案1

答案2

如何针对特定ID保留包含特定短语的字符串？

Nullmodel with presence absence data in vegan – R

将数据框（df）的第一列转换为标题，并保留原始标题作为子标题。

Boxplot with additional lines for 10th and 90th percentile in R

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。