2023年6月8日 22:56:23go评论104阅读模式

英文:

Separating rows of data containing multiple non-zeros, so new rows contain one non-zero value each

问题

I understand that you want a Chinese translation of the code and its description. Here it is:

我明白您想要代码和描述的中文翻译。以下是翻译：

我有一个类似这样的数据集：

df1 <- cbind(c("a","b"),c("c","c"),c(0,8),c(4,0),c(5,0),c(0,12))
colnames(df1) <- c("name1","name2","v1","v2","v3","v4")
name1 name2 v1 v2 v3 v4
    a     c  0  4  5  0
    b     c  8  0  0 12

但我需要创建新的行，以便名称对的非零值有自己的行，就像这样：

df2 <- cbind(c("a","a","b","b"),c("c","c","c","c"),c(0,0,8,0),c(4,0,0,0),c(0,5,0,0),c(0,0,0,12))
colnames(df2) <- c("name1","name2","v1","v2","v3","v4")
name1 name2 v1 v2 v3 v4
    a     c  0  4  0  0
    a     c  0  0  5  0
    b     c  8  0  0  0
    b     c  0  0  0 12

这是我的第一个问题，所以希望这足够详细。我尝试过separate_rows和mutate if_else，但我完全被难住了。任何帮助将不胜感激！

英文:

I have a dataset like this:

df1 &lt;- cbind(c(&quot;a&quot;,&quot;b&quot;),c(&quot;c&quot;,&quot;c&quot;),c(0,8),c(4,0),c(5,0),c(0,12))
colnames(df1) &lt;- c(&quot;name1&quot;,&quot;name2&quot;,&quot;v1&quot;,&quot;v2&quot;,&quot;v3&quot;,&quot;v4&quot;)
name1 name2 v1 v2 v3 v4
    a     c  0  4  5  0
    b     c  8  0  0 12

But I need to create new rows so that the non-zero values of pairs of names have their own row like this:

df2 &lt;- cbind(c(&quot;a&quot;,&quot;a&quot;,&quot;b&quot;,&quot;b&quot;),c(&quot;c&quot;,&quot;c&quot;,&quot;c&quot;,&quot;c&quot;),c(0,0,8,0),c(4,0,0,0),c(0,5,0,0),c(0,0,0,12))
colnames(df2) &lt;- c(&quot;name1&quot;,&quot;name2&quot;,&quot;v1&quot;,&quot;v2&quot;,&quot;v3&quot;,&quot;v4&quot;)
name1 name2 v1 v2 v3 v4
    a     c  0  4  0  0
    a     c  0  0  5  0
    b     c  8  0  0  0
    b     c  0  0  0 12

This is my first question so I hope that's enough detail. I've tried separate_rows, and mutate if_else, but I'm completely stumped. Any help would be greatly appreciated!

答案1

得分: 2

library(tidyverse)
df1 |&gt;
  as_tibble() |&gt;
  pivot_longer(
    cols = starts_with(&quot;v&quot;),
    names_to = &#39;var&#39;,
    values_to = &#39;value&#39;
  ) |&gt;
  filter(value != 0) |&gt;
  mutate(
    var2 = var  # dummy variable to trick `pivot_wider` to do what you want
  ) |&gt;
  pivot_wider(
    names_from = var,
    values_from = value,
    values_fill = &quot;0&quot;  # values are forced to be characters due to how data is entered/saved.  Might need to change to 0 (without the quotes) in your real data.
  ) |&gt;
select(-var2) # remove dummy variable

英文:

library(tidyverse)
df1 |&gt;
  as_tibble() |&gt;
  pivot_longer(
    cols = starts_with(&quot;v&quot;),
    names_to = &#39;var&#39;,
    values_to = &#39;value&#39;
  ) |&gt;
  filter(value != 0) |&gt;
  mutate(
    var2 = var  # dummy variable to trick `pivot_wider` to do what you want
  ) |&gt;
  pivot_wider(
    names_from = var,
    values_from = value,
    values_fill = &quot;0&quot;  # values are forced to be characters due to how data is entered/saved.  Might need to change to 0 (without the quotes) in your real data.
  ) |&gt;
select(-var2) # remove dummy variable

答案2

得分: 1

# 使用 data.table 的 melt 和 dcast 函数：
library(data.table)
dcast(
  melt(setDT(df1), c("name1", "name2"))[value != 0],
  name1 + name2 + value ~ variable, fill = 0
)[, value := NULL][]
#>    name1 name2 v1 v2 v3 v4
#> 1:     a     c  0  4  0  0
#> 2:     a     c  0  0  5  0
#> 3:     b     c  8  0  0  0
#> 4:     b     c  0  0  0 12

数据：

df1 <- data.frame(
  name1 = c("a","b"),
  name2 = c("c","c"),
  v1 = c(0,8),
  v2 = c(4,0),
  v3 = c(5,0),
  v4 = c(0,12)
)

英文:

A data.table melt and dcast:

library(data.table)
dcast(
  melt(setDT(df1), c(&quot;name1&quot;, &quot;name2&quot;))[value != 0],
  name1 + name2 + value ~ variable, fill = 0,
)[, value := NULL][]
#&gt;    name1 name2 v1 v2 v3 v4
#&gt; 1:     a     c  0  4  0  0
#&gt; 2:     a     c  0  0  5  0
#&gt; 3:     b     c  8  0  0  0
#&gt; 4:     b     c  0  0  0 12

Data:

df1 &lt;- data.frame(
  name1 = c(&quot;a&quot;,&quot;b&quot;),
  name2 = c(&quot;c&quot;,&quot;c&quot;),
  v1 = c(0,8),
  v2 = c(4,0),
  v3 = c(5,0),
  v4 = c(0,12)
)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

将包含多个非零值的数据行分开，使新行每行只包含一个非零值。

问题

答案1

答案2

在R数据框中的矩阵/数组乘法。

从一列中提取一个单词/字母后面的数值到新的一列

使用分组的字符串索引拆分数组

Python/Pandas. For loop on multiple dataFrames not working correctly.

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。