如何根据一列添加增量值?

huangapple go评论99阅读模式
英文:

How to add incremental value base on a column?

问题

例如,如何获取列“rank”的值?
在这里,数据帧中的每2条记录的值递增。但实际上可以是任意数量的记录。

我找到了一些类似的帖子,但它们不完全符合我所需的。

月份 排名
1 1
2 1
3 2
4 2
5 3
6 3
7 4
8 4
英文:

For example, how do I obtain the value for the column "rank"?
Here, the value increases by every 2 records in the data frame. Thought it could be any # of records instead of 2.

I did found a few similar posts, but they are not exactly what I need.

Months rank
1 1
2 1
3 2
4 2
5 3
6 3
7 4
8 4

答案1

得分: 1

使用repeach=参数根据nrow(dat)/n进行操作。

  1. f <- function(dat, n) {
  2. stopifnot(nrow(dat) %% n == 0) ## 为了安全起见
  3. rep(seq_len(nrow(dat)/n), each=n)
  4. }
  5. f(dat, 1)
  6. # [1] 1 2 3 4 5 6 7 8
  7. f(dat, 2)
  8. # [1] 1 1 2 2 3 3 4 4
  9. f(dat, 3)
  10. # Error in f(dat, 3): nrow(dat)%%n == 0 is not TRUE
  11. f(dat, 4)
  12. # [1] 1 1 1 1 2 2 2 2
  13. f(dat, nrow(dat))
  14. # [1] 1 1 1 1 1 1 1 1

要获得不等大小的组,你可以实现一个override=选项。

  1. f2 <- function(dat, n, override=FALSE) {
  2. if (!override) {
  3. stopifnot(nrow(dat) %% n == 0)
  4. rep(seq_len(nrow(dat)/n), each=n)
  5. } else {
  6. rep(seq_len(nrow(dat)), each=n)[seq_len(nrow(dat))]
  7. }
  8. }
  9. f2(dat, 3, override=TRUE)
  10. # [1] 1 1 1 2 2 2 3 3

或者,稍微更加优雅:

  1. f3 <- function(dat, n) {
  2. sort.int(rep_len(1:n, nrow(dat)))
  3. }
  4. f3(dat, 0)
  5. # [1] 0 0 0 0 1 1 1 1
  6. f3(dat, 1)
  7. # [1] 1 1 1 1 1 1 1 1
  8. f3(dat, 2)
  9. # [1] 1 1 1 1 2 2 2 2
  10. f3(dat, 3)
  11. # [1] 1 1 1 2 2 2 3 3
  12. f3(dat, 8)
  13. # [1] 1 2 3 4 5 6 7 8
  14. f3(dat, 9)
  15. # [1] 1 2 3 4 5 6 7 8
  16. f3(dat, -1)
  17. # [1] -1 -1 0 0 0 1 1 1

数据:

  1. dat <- structure(list(Months = 1:8, rank = c(1L, 1L, 2L, 2L, 3L, 3L, 4L, 4L)), class = "data.frame", row.names = c(NA, -8L))
英文:

Using rep and each= argument according to nrow(dat)/n.

  1. f &lt;- \(dat, n) {
  2. stopifnot(nrow(dat) %% n == 0) ## for safety
  3. rep(seq_len(nrow(dat)/n), each=n)
  4. }
  5. f(dat, 1)
  6. # [1] 1 2 3 4 5 6 7 8
  7. f(dat, 2)
  8. # [1] 1 1 2 2 3 3 4 4
  9. f(dat, 3)
  10. # Error in f(dat, 3) : nrow(dat)%%n == 0 is not TRUE
  11. f(dat, 4)
  12. # [1] 1 1 1 1 2 2 2 2
  13. f(dat, nrow(dat))
  14. # [1] 1 1 1 1 1 1 1 1

To also get unequal groups, you could implement an override= option.

  1. f2 &lt;- \(dat, n, override=FALSE) {
  2. if (!override) {
  3. stopifnot(nrow(dat) %% n == 0)
  4. rep(seq_len(nrow(dat)/n), each=n)
  5. } else {
  6. rep(seq_len(nrow(dat)), each=n)[seq_len(nrow(dat))]
  7. }
  8. }
  9. f2(dat, 3, override=TRUE)
  10. # [1] 1 1 1 2 2 2 3 3

Or, slightly more elegant:

  1. f3 &lt;- \(dat, n) {
  2. sort.int(rep_len(1:n, nrow(dat)))
  3. }
  4. f3(dat, 0)
  5. # [1] 0 0 0 0 1 1 1 1
  6. f3(dat, 1)
  7. # [1] 1 1 1 1 1 1 1 1
  8. f3(dat, 2)
  9. # [1] 1 1 1 1 2 2 2 2
  10. f3(dat, 3)
  11. # [1] 1 1 1 2 2 2 3 3
  12. f3(dat, 8)
  13. # [1] 1 2 3 4 5 6 7 8
  14. f3(dat, 9)
  15. # [1] 1 2 3 4 5 6 7 8
  16. f3(dat, -1)
  17. # [1] -1 -1 0 0 0 1 1 1

Data:

  1. dat &lt;- structure(list(Months = 1:8, rank = c(1L, 1L, 2L, 2L, 3L, 3L,
  2. 4L, 4L)), class = &quot;data.frame&quot;, row.names = c(NA, -8L))

答案2

得分: 1

以下是已翻译的内容:

假设我们有如下的数据:

然后我们执行以下操作:

  1. df <- data.frame(letters=c(LETTERS[1:8]), Months=c(1:8)) %>%
  2. mutate(rank1=(Months %% 2),
  3. rank2=ifelse(rank1==1,Months, NA_real_)) %>%
  4. fill(rank2) %>%
  5. mutate(rank=data.table::rleid(rank2)) %>%
  6. select(-c('rank1','rank2'))

输出结果为:

  1. letters Months rank
  2. 1 A 1 1
  3. 2 B 2 1
  4. 3 C 3 2
  5. 4 D 4 2
  6. 5 E 5 3
  7. 6 F 6 3
  8. 7 G 7 4
  9. 8 H 8 4
英文:

For suppose we have a data as below

  1. letters
  2. 1 A
  3. 2 B
  4. 3 C
  5. 4 D
  6. 5 E
  7. 6 F
  8. 7 G
  9. 8 H

Then we do something as below

  1. df &lt;- data.frame(letters=c(LETTERS[1:8]), Months=c(1:8)) %&gt;%
  2. mutate(rank1=(Months %% 2),
  3. rank2=ifelse(rank1==1,Months, NA_real_)) %&gt;%
  4. fill(rank2) %&gt;%
  5. mutate(rank=data.table::rleid(rank2)) %&gt;%
  6. select(-c(&#39;rank1&#39;,&#39;rank2&#39;))
  7. # output
  8. letters Months rank
  9. 1 A 1 1
  10. 2 B 2 1
  11. 3 C 3 2
  12. 4 D 4 2
  13. 5 E 5 3
  14. 6 F 6 3
  15. 7 G 7 4
  16. 8 H 8 4

答案3

得分: 0

请尝试以下代码:

  1. df <- data.frame(Months=c(1:8)) %>%
  2. mutate(rank1=(Months %% 2),
  3. rank2=ifelse(rank1==1,Months, NA_real_),
  4. ) %>%
  5. fill(rank2) %>%
  6. mutate(rank=data.table::rleid(rank2)) %>%
  7. select(-c('rank1','rank2'))
  8. # 输出结果
  9. Months rank
  10. 1 1 1
  11. 2 2 1
  12. 3 3 2
  13. 4 4 2
  14. 5 5 3
  15. 6 6 3
  16. 7 7 4
  17. 8 8 4
英文:

Please try the below code,

  1. df &lt;- data.frame(Months=c(1:8)) %&gt;% mutate(rank1=(Months %% 2),
  2. rank2=ifelse(rank1==1,Months, NA_real_),
  3. ) %&gt;% fill(rank2) %&gt;%
  4. mutate(rank=data.table::rleid(rank2)) %&gt;% select(-c(&#39;rank1&#39;,&#39;rank2&#39;))
  5. # output
  6. Months rank
  7. 1 1 1
  8. 2 2 1
  9. 3 3 2
  10. 4 4 2
  11. 5 5 3
  12. 6 6 3
  13. 7 7 4
  14. 8 8 4

huangapple
  • 本文由 发表于 2023年6月26日 00:57:49
  • 转载请务必保留本文链接:https://go.coder-hub.com/76551527.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定