2023年2月24日 12:19:58go评论128阅读模式

英文:

r table for many columns

问题

我的数据集如下。

   ID    Col_01    Col_02   Col_03    Col_04    Col_05    Col_06
   1     1         2        1         3         4         -9
   2     1         1        2         1         2          2
   3     2         4        1         1         1          1
   4     3         1        3         2        -9          4
   5     2         3        4         4         3          2

我想创建一个总结的数据集，其中每列（Col_01-Col_06）中的1、2、3、4和-9的数量如下所示。

    Values    Col_01    Col_02   Col_03    Col_04    Col_05    Col_06   
    1         2         2        2         2         1         1
    2         2         1        1         1         1         2
    3         1         1        1         1         1         0
    4         0         1        1         1         1         1
   -9         0         0        0         0         1         1

到目前为止，我尝试了以下代码

    df %>%
      select(matches(^Col_\\d+$")) %>%
       summarise_all(funs(table))

但我收到一个错误消息：Col_05 的大小必须为 4 或 1，而不是之前列的大小为 4。还有一堆其他警告。是否有任何建议，我如何可以为数据集中以 "Col_" 开头的所有列创建表格摘要？感谢。

英文:

My dataset is like this.

   ID    Col_01    Col_02   Col_03    Col_04    Col_05    Col_06
   1     1         2        1         3         4         -9
   2     1         1        2         1         2          2
   3     2         4        1         1         1          1
   4     3         1        3         2        -9          4
   5     2         3        4         4         3          2

I like to create a summarized dataset where the number of 1s,2s,3s,4s, -9s in each column (Col_01-Col_06) are counted like this.

    Values    Col_01    Col_02   Col_03    Col_04    Col_05    Col_06   
    1         2         2        2         2         1         1
    2         2         1        1         1         1         2
    3         1         1        1         1         1         0
    4         0         1        1         1         1         1
   -9         0         0        0         0         1         1

So far I tried

    df %&gt;%
      select(matches(^Col_\\d+$&quot;)) %&gt;%
       summarise_all(funs(table))

but I get an error Col_05 must be of size 4 or 1 , not 5 as earlier column had size 4. and bunch of other warnings. Any suggestions how I can create table summary for all columns starting with Col_ in my dataset is appreciated, Thanks.

答案1

得分: 2

在基本的R中，您可以执行以下操作：
```r
table(stack(df1,-1))

如果您需要一个数据框架：

as.data.frame(matrix(table(stack(df1,-1)))

英文:

In base R you could do

table(stack(df1,-1))

If you need a dataframe:

as.data.frame matrix(table(stack(df1,-1)))

答案2

得分: 1

以下是代码部分的翻译：

Pivoting longer, counting, then pivoting wider is one option.
library(dplyr)
library(tidyr)
df1 %>%
  pivot_longer(starts_with("Col_")) %>%
  count(name, value) %>%
  pivot_wider(names_from = name, 
              values_from = n, 
              values_fill = 0)

请注意，代码部分没有需要翻译的内容，所以只提供了原文的代码。如果您需要其他方面的帮助，请随时告诉我。

英文:

Pivoting longer, counting, then pivoting wider is one option.

library(dplyr)
library(tidyr)
df1 %&gt;% 
  pivot_longer(starts_with(&quot;Col_&quot;)) %&gt;% 
  count(name, value) %&gt;% 
  pivot_wider(names_from = name, 
              values_from = n, 
              values_fill = 0)

Result:

# A tibble: 5 &#215; 7
  value Col_01 Col_02 Col_03 Col_04 Col_05 Col_06
  &lt;int&gt;  &lt;int&gt;  &lt;int&gt;  &lt;int&gt;  &lt;int&gt;  &lt;int&gt;  &lt;int&gt;
1     1      2      2      2      2      1      1
2     2      2      1      1      1      1      2
3     3      1      1      1      1      1      0
4     4      0      1      1      1      1      1
5    -9      0      0      0      0      1      1

Data:

df1 &lt;- structure(list(ID = 1:5, Col_01 = c(1L, 1L, 2L, 3L, 2L), Col_02 = c(2L, 
1L, 4L, 1L, 3L), Col_03 = c(1L, 2L, 1L, 3L, 4L), Col_04 = c(3L, 
1L, 1L, 2L, 4L), Col_05 = c(4L, 2L, 1L, -9L, 3L), Col_06 = c(-9L, 
2L, 1L, 4L, 2L)), class = &quot;data.frame&quot;, row.names = c(NA, -5L
))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

r table for many columns

问题

答案1

答案2

使用`st_centroid`返回点的质心。

使Plotly漏斗图中各个条之间的区域透明。

无法更改ggsurv()图中的图例顺序。

R中时间序列的滚动均值，包括缺失日期。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。