2023年2月8日 23:59:02go评论84阅读模式

英文:

Obtaining a summary of grouped counts in R

问题

这应该很简单，但我一直被困扰住了：我试图找出获取分组计数的摘要统计信息的有效方法。以下是一个示例：

df = tibble(pid = c(1,2,2,3,3,3,4,4,4,4), y = rnorm(10))
df %>% group_by(pid) %>% count(pid)

这会输出期望的结果：

# A tibble: 4 × 2
# Groups:   pid [4]
    pid     n
  <dbl> <int>
1     1     1
2     2     2
3     3     3
4     4     4

然而，如果我想要这些分组计数的摘要，尝试创建新变量或使用add_count似乎不起作用，我猜测是因为变量的大小不同。例如：

df %>% group_by(pid) %>% count(pid) %>% mutate(count = summary(n))

会生成错误。生成分组计数的摘要统计信息（例如最小值、最大值、平均值等）的简单方法是什么？

英文:

This should be simple but I have been stumped by it: I am trying to figure out an efficient method for obtaining summary stats of a grouped count. Here's a toy example:

df = tibble(pid = c(1,2,2,3,3,3,4,4,4,4), y = rnorm(10))
df %&gt;% group_by(pid) %&gt;% count(pid)

which outputs the expected

# A tibble: 4 &#215; 2
# Groups:   pid [4]
    pid     n
  &lt;dbl&gt; &lt;int&gt;
1     1     1
2     2     2
3     3     3
4     4     4

However, what if I want a summary of those grouped counts? Attempting to mutate a new variable or add_count hasn't worked I assume because the variables are different sizes. For instance:

df %&gt;% group_by(pid) %&gt;% count(pid) %&gt;% mutate(count = summary(n))

generates an error. What would be a simple way to generate summary statistics of the grouped counts (e.g., min, max, mean, etc.)?

答案1

得分: 3

mutate用于向数据框添加列 - 在这里你不需要，你需要从数据框中提取列。

df %>%
  count(pid) %>%
  pull(n) %>%
  summary()

英文:

mutate is for adding columns to a data frame - you don't want that here, you need to pull the column out of the data frame.

df %&gt;% 
  count(pid) %&gt;% 
  pull(n) %&gt;% 
  summary()

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

获取R中分组计数的摘要

问题

答案1

如何从gtsummary表中隐藏特定单元格的信息

如何使用R编程进行代数除法？

颠倒若干列的内容顺序（最好在tidyverse中实现）。

R中时间序列的滚动均值，包括缺失日期。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。