使用adorn_totals()按列指定计算。

huangapple go评论115阅读模式
英文:

Specify the calculation by column using adorn_totals()

问题

  1. 我正在尝试使用R中的Janitor包添加一个总计行,但是我需要使用sum函数对两列进行总计,而另一列应该是一个百分比(而不是该列的总和)。
  2. ```R
  3. library(tidyverse)
  4. library(janitor)
  5. df.1 <- tribble(
  6. ~customer ,~period, ~cost1, ~cost2 ,
  7. 'cust1', '202201', 5, 10,
  8. 'cust1', '202202', 5, 10,
  9. 'cust1', '202203', 5, 10,
  10. 'cust1', '202204', 5, 10,
  11. )
  12. df.1 %>%
  13. group_by(customer, period) %>%
  14. summarise(cost1 = sum(cost1, na.rm = T),
  15. cost2 = sum(cost2, na.rm = T),
  16. total = cost1 + cost2,
  17. pct = cost1 / cost2) %>%
  18. adorn_totals(where = 'row')

预期输出应为:

  1. customer period cost1 cost2 total pct
  2. cust1 202201 5 10 15 .33333
  3. cust1 202202 5 10 15 .33333
  4. cust1 202203 5 10 15 .33333
  5. cust1 202204 5 10 15 .33333
  6. Total 20 40 60 .33333

提前感谢您的建议。

  1. <details>
  2. <summary>英文:</summary>
  3. I am trying to add a totals row usin ghte Janitor package in R, however I need 2 columns to be totaled using the sum function and one column to be a percentage (not the sum of the column).

library(tidyverse)
library(janitor)

df.1 <- tribble(
~customer ,~period, ~cost1, ~cost2 ,
'cust1', '202201', 5, 10,
'cust1', '202202', 5, 10,
'cust1', '202203', 5, 10,
'cust1', '202204', 5, 10,
)

df.1 %>%
group_by(customer, period) %>%
summarise(cost1 = sum(cost1, na.rm = T),
cost2 = sum(cost2, na.rm = T),
total = cost1 + cost2,
pct = cost1 / cost2) %>%
adorn_totals(where = 'row')

  1. Expected output would be:

customer period cost1 cost2 total pct
cust1 202201 5 10 15 .33333
cust1 202202 5 10 15 .33333
cust1 202203 5 10 15 .33333
cust1 202204 5 10 15 .33333
Total 20 40 60 .33333

  1. Thanks in advance for advice.
  2. </details>
  3. # 答案1
  4. **得分**: 2
  5. `adorn` 之后,按行获取百分比:
  6. ```R
  7. df.1 %>%
  8. group_by(customer, period) %>%
  9. summarise(cost1 = sum(cost1, na.rm = T),
  10. cost2 = sum(cost2, na.rm = T),
  11. total = cost1 + cost2) %>%
  12. adorn_totals(where = 'row') %>%
  13. mutate(pct = cost1/total)
  14. # `summarise()` has grouped output by 'customer'. You can override using the
  15. # `.groups` argument.
  16. # customer period cost1 cost2 total pct
  17. # cust1 202201 5 10 15 0.3333333
  18. # cust1 202202 5 10 15 0.3333333
  19. # cust1 202203 5 10 15 0.3333333
  20. # cust1 202204 5 10 15 0.3333333
  21. # Total - 20 40 60 0.3333333

请注意,这是R代码的一部分,只提供了翻译,不包含其他内容。

英文:

Get the percentage per row, after adorn:

  1. df.1 %&gt;%
  2. group_by(customer, period) %&gt;%
  3. summarise(cost1 = sum(cost1, na.rm = T),
  4. cost2 = sum(cost2, na.rm = T),
  5. total = cost1 + cost2) %&gt;%
  6. adorn_totals(where = &#39;row&#39;) %&gt;%
  7. mutate(pct = cost1/total)
  8. # `summarise()` has grouped output by &#39;customer&#39;. You can override using the
  9. # `.groups` argument.
  10. # customer period cost1 cost2 total pct
  11. # cust1 202201 5 10 15 0.3333333
  12. # cust1 202202 5 10 15 0.3333333
  13. # cust1 202203 5 10 15 0.3333333
  14. # cust1 202204 5 10 15 0.3333333
  15. # Total - 20 40 60 0.3333333

huangapple
  • 本文由 发表于 2023年6月5日 23:13:09
  • 转载请务必保留本文链接:https://go.coder-hub.com/76407817.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定