将包含多个非零值的数据行分开,使新行每行只包含一个非零值。

huangapple go评论99阅读模式
英文:

Separating rows of data containing multiple non-zeros, so new rows contain one non-zero value each

问题

I understand that you want a Chinese translation of the code and its description. Here it is:

我明白您想要代码和描述的中文翻译。以下是翻译:

我有一个类似这样的数据集:

  1. df1 <- cbind(c("a","b"),c("c","c"),c(0,8),c(4,0),c(5,0),c(0,12))
  2. colnames(df1) <- c("name1","name2","v1","v2","v3","v4")
  3. name1 name2 v1 v2 v3 v4
  4. a c 0 4 5 0
  5. b c 8 0 0 12

但我需要创建新的行,以便名称对的非零值有自己的行,就像这样:

  1. df2 <- cbind(c("a","a","b","b"),c("c","c","c","c"),c(0,0,8,0),c(4,0,0,0),c(0,5,0,0),c(0,0,0,12))
  2. colnames(df2) <- c("name1","name2","v1","v2","v3","v4")
  3. name1 name2 v1 v2 v3 v4
  4. a c 0 4 0 0
  5. a c 0 0 5 0
  6. b c 8 0 0 0
  7. b c 0 0 0 12

这是我的第一个问题,所以希望这足够详细。我尝试过separate_rowsmutate if_else,但我完全被难住了。任何帮助将不胜感激!

英文:

I have a dataset like this:

  1. df1 &lt;- cbind(c(&quot;a&quot;,&quot;b&quot;),c(&quot;c&quot;,&quot;c&quot;),c(0,8),c(4,0),c(5,0),c(0,12))
  2. colnames(df1) &lt;- c(&quot;name1&quot;,&quot;name2&quot;,&quot;v1&quot;,&quot;v2&quot;,&quot;v3&quot;,&quot;v4&quot;)
  3. name1 name2 v1 v2 v3 v4
  4. a c 0 4 5 0
  5. b c 8 0 0 12

But I need to create new rows so that the non-zero values of pairs of names have their own row like this:

  1. df2 &lt;- cbind(c(&quot;a&quot;,&quot;a&quot;,&quot;b&quot;,&quot;b&quot;),c(&quot;c&quot;,&quot;c&quot;,&quot;c&quot;,&quot;c&quot;),c(0,0,8,0),c(4,0,0,0),c(0,5,0,0),c(0,0,0,12))
  2. colnames(df2) &lt;- c(&quot;name1&quot;,&quot;name2&quot;,&quot;v1&quot;,&quot;v2&quot;,&quot;v3&quot;,&quot;v4&quot;)
  3. name1 name2 v1 v2 v3 v4
  4. a c 0 4 0 0
  5. a c 0 0 5 0
  6. b c 8 0 0 0
  7. b c 0 0 0 12

This is my first question so I hope that's enough detail. I've tried separate_rows, and mutate if_else, but I'm completely stumped. Any help would be greatly appreciated!

答案1

得分: 2

  1. library(tidyverse)
  2. df1 |&gt;
  3. as_tibble() |&gt;
  4. pivot_longer(
  5. cols = starts_with(&quot;v&quot;),
  6. names_to = &#39;var&#39;,
  7. values_to = &#39;value&#39;
  8. ) |&gt;
  9. filter(value != 0) |&gt;
  10. mutate(
  11. var2 = var # dummy variable to trick `pivot_wider` to do what you want
  12. ) |&gt;
  13. pivot_wider(
  14. names_from = var,
  15. values_from = value,
  16. values_fill = &quot;0&quot; # values are forced to be characters due to how data is entered/saved. Might need to change to 0 (without the quotes) in your real data.
  17. ) |&gt;
  18. select(-var2) # remove dummy variable
英文:
  1. library(tidyverse)
  2. df1 |&gt;
  3. as_tibble() |&gt;
  4. pivot_longer(
  5. cols = starts_with(&quot;v&quot;),
  6. names_to = &#39;var&#39;,
  7. values_to = &#39;value&#39;
  8. ) |&gt;
  9. filter(value != 0) |&gt;
  10. mutate(
  11. var2 = var # dummy variable to trick `pivot_wider` to do what you want
  12. ) |&gt;
  13. pivot_wider(
  14. names_from = var,
  15. values_from = value,
  16. values_fill = &quot;0&quot; # values are forced to be characters due to how data is entered/saved. Might need to change to 0 (without the quotes) in your real data.
  17. ) |&gt;
  18. select(-var2) # remove dummy variable

答案2

得分: 1

  1. # 使用 data.table 的 melt 和 dcast 函数:
  2. library(data.table)
  3. dcast(
  4. melt(setDT(df1), c("name1", "name2"))[value != 0],
  5. name1 + name2 + value ~ variable, fill = 0
  6. )[, value := NULL][]
  7. #> name1 name2 v1 v2 v3 v4
  8. #> 1: a c 0 4 0 0
  9. #> 2: a c 0 0 5 0
  10. #> 3: b c 8 0 0 0
  11. #> 4: b c 0 0 0 12

数据:

  1. df1 <- data.frame(
  2. name1 = c("a","b"),
  3. name2 = c("c","c"),
  4. v1 = c(0,8),
  5. v2 = c(4,0),
  6. v3 = c(5,0),
  7. v4 = c(0,12)
  8. )
英文:

A data.table melt and dcast:

  1. library(data.table)
  2. dcast(
  3. melt(setDT(df1), c(&quot;name1&quot;, &quot;name2&quot;))[value != 0],
  4. name1 + name2 + value ~ variable, fill = 0,
  5. )[, value := NULL][]
  6. #&gt; name1 name2 v1 v2 v3 v4
  7. #&gt; 1: a c 0 4 0 0
  8. #&gt; 2: a c 0 0 5 0
  9. #&gt; 3: b c 8 0 0 0
  10. #&gt; 4: b c 0 0 0 12

Data:

  1. df1 &lt;- data.frame(
  2. name1 = c(&quot;a&quot;,&quot;b&quot;),
  3. name2 = c(&quot;c&quot;,&quot;c&quot;),
  4. v1 = c(0,8),
  5. v2 = c(4,0),
  6. v3 = c(5,0),
  7. v4 = c(0,12)
  8. )

huangapple
  • 本文由 发表于 2023年6月8日 22:56:23
  • 转载请务必保留本文链接:https://go.coder-hub.com/76433153.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定