2023年4月4日 15:59:20go评论67阅读模式

英文:

Is there a way to summarize values grouped by years while keeping the index?

问题

以下是您要翻译的内容：

"I tried to summarize values of different years which are assigned to specific IDs.

I used dplyr to summarize it but did not find a way to keep the index.

My data looks something like this:

year &lt;- c(2015, 2015, 2015, 2016, 2016, 2017, 2017, 2018, 2018, 2018, 2018, 2019, 2019)
index &lt;- c(1,1,1,1,1,1,1,2,2,2,2,2,2)
value &lt;- c(5,7,3, NA,9,14, 15, 8, NA, 9, 10, 6, 4)
df1 &lt;- data.frame(year, index, value)

And that is the way i summarized the data:

sum1 &lt;-
  df1 %&gt;%
  group_by(year) %&gt;%
  summarise(value = sum(value, na.rm = T))

I'd like to get an outcome like:

year1 &lt;- c(2015, 2016, 2017, 2018, 2019)
index1 &lt;- c(1, 1, 1, 2, 2)
value1 &lt;- c(15, 9, 29, 27, 10)
df2 &lt;- data.frame(year1, index1, value1)

Thanks, I really appreciate your help!"

英文:

I tried to summarize values of different years which are assigned to specific IDs.

I used dplyr to summarize it but did not find a way to keep the index.

My data looks something like this:

year &lt;- c(2015, 2015, 2015, 2016, 2016, 2017, 2017, 2018, 2018, 2018, 2018, 2019, 2019)
index &lt;- c(1,1,1,1,1,1,1,2,2,2,2,2,2)
value &lt;- c(5,7,3, NA,9,14, 15, 8, NA, 9, 10, 6, 4)
df1 &lt;- data.frame(year, index, value)

And that is the way i summarized the data:

sum1 &lt;-
  df1 %&gt;%
  group_by(year) %&gt;%
  summarise(value = sum(value, na.rm = T))

I'd like to get an outcome like:

year1 &lt;- c(2015, 2016, 2017, 2018, 2019)
index1 &lt;- c(1, 1, 1, 2, 2)
value1 &lt;- c(15, 9, 29, 27, 10)
df2 &lt;- data.frame(year1, index1, value1)

Thanks, I really appreciate your help!

答案1

得分: 3

你可以使用 aggregate：

aggregate(value ~ ., df1, sum)
#  year index value
#1 2015     1    15
#2 2016     1     9
#3 2017     1    29
#4 2018     2    27
#5 2019     2    10

或者使用你的代码，在 group_by 中添加 index：

library(dplyr)

df1 %>%
  group_by(year, index) %>%
  summarise(value = sum(value, na.rm = T))
## A tibble: 5 × 3
## Groups:   year [5]
#   year index value
#  <dbl> <dbl> <dbl>
#1  2015     1    15
#2  2016     1     9
#3  2017     1    29
#4  2018     2    27
#5  2019     2    10

英文:

You can use aggregate:

aggregate(value ~ ., df1, sum)
#  year index value
#1 2015     1    15
#2 2016     1     9
#3 2017     1    29
#4 2018     2    27
#5 2019     2    10

Or using your code, adding index in the group_by.

library(dplyr)

df1 %&gt;%
  group_by(year, index) %&gt;%
  summarise(value = sum(value, na.rm = T))
## A tibble: 5 &#215; 3
## Groups:   year [5]
#   year index value
#  &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt;
#1  2015     1    15
#2  2016     1     9
#3  2017     1    29
#4  2018     2    27
#5  2019     2    10

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

有没有一种方法可以在保留索引的同时对按年分组的值进行总结？

问题

答案1

尝试更改URL后缀

In R, 有关 dplyr::bind_rows 合并数据框的问题。

在ggplot2中添加图例中的额外分组。

从其他列的分组中减去特定行的值。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论