2023年6月22日 19:21:56go评论126阅读模式

英文:

Create binary table where the number of rows are values from a variable

问题

我有一个包含两个计数变量（var1和var2）的数据集，它们代表两个时间点的存活个体数。
我需要创建一个包含新的二进制变量的表格，其中行数对应于var1的值，并在这个变量内，根据var2的值分配"1"（例如，如果var1中有10，var2中有3，那么我需要10行，其中3行为"1"，其余为"0"）。
是否有任何可以帮助的函数？
谢谢。

英文:

I have a dataset with a two variables with counts (var1 and var2), that represent alive individuals at two times.
I need to create a table with a new binary variable, where the number of rows correspond to the values of var1, and within this variable, I need to assign "1" depending on values of var2 (e.g. if there is a 10 in var1 and a 3 in var2, then I would need 10 rows, from which 3 rows have "1", and the rest have "0").
Any function that would help?
Thanks

答案1

得分: 1

我可能误解了你的需求，但以下是我做的：

exemple <- data.frame(id = 1:3, var1 = c(10, 5, 1), var2 = c(3, 3, 0))
apply(exemple, 1, \(x) data.frame(id = rep(x[["id"]], times = x[["var1"]]), 
                                  var_binaire = rep(c(1, 0), times = c(x[["var2"]], x[["var1"]] - x[["var2"]])))) |>
  do.call(what = "rbind")

它似乎可以达到效果，它使用了rep函数重复生成1和0。apply被用于在行方向上执行操作。因此apply在此处创建了一个列表，每个元素都是一个具有var1行和var_binaire = 1 var2次的数据框。然后，do.call("rbind")将它们合并成一个数据框。以下是结果：

   id var_binaire
1   1           1
2   1           1
3   1           1
4   1           0
5   1           0
6   1           0
7   1           0
8   1           0
9   1           0
10  1           0
11  2           1
12  2           1
13  2           1
14  2           0
15  2           0
16  3           0

我曾以为可以在不使用apply的情况下完成，但我无法想出方法。也许会有更好的解决方案出现。希望对你有所帮助。

英文:

I may have misunderstood what you wanted but here is what I did:

exemple &lt;- data.frame(id = 1:3, var1 = c(10, 5, 1), var2 = c(3, 3, 0))
apply(exemple, 1, \(x) data.frame(id = rep(x[[&quot;id&quot;]], times = x[[&quot;var1&quot;]]), 
                                  var_binaire = rep(c(1, 0), times = c(x[[&quot;var2&quot;]], x[[&quot;var1&quot;]] - x[[&quot;var2&quot;]])))) |&gt;
  do.call(what = &quot;rbind&quot;)

It seems to do the trick, it creates a data.frame by repeating 1's and 0's with function rep. Apply is used with margin = 1 to perform rowwise operations. Hence apply create here a list with each element a data.frame with var1 lines and var_binaire = 1 var2 times. Then, do.call("rbind") combines all in a data.frame. Here is the result:

exemple
  id var1 var2
1  1   10    3
2  2    5    3
3  3    1    0
output
   id var_binaire
1   1           1
2   1           1
3   1           1
4   1           0
5   1           0
6   1           0
7   1           0
8   1           0
9   1           0
10  1           0
11  2           1
12  2           1
13  2           1
14  2           0
15  2           0
16  3           0

I thought it would be possible without apply, but I couldn't figure out how. Maybe a better solution will come up. I hope this helped.

答案2

得分: 1

与@Guillaume Mulier非常相似的一种方法，但使用向量化：

x <- data.frame(id = 1:3, var1 = c(10, 5, 1), var2 = c(3, 3, 0))
with(x, data.frame(id=rep(id, var1),
              mort=rep(rep(1:0, nrow(x)),
                       c(matrix(c(var2, var1 - var2), 2, byrow=TRUE)))))
#   id mort
#1   1    1
#2   1    1
#3   1    1
#4   1    0
#5   1    0
#6   1    0
#7   1    0
#8   1    0
#9   1    0
#10  1    0
#11  2    1
#12  2    1
#13  2    1
#14  2    0
#15  2    0
#16  3    0

请注意，这段代码是R语言的一部分，主要用于数据框（data.frame）操作。

英文:

A way quite similar to @Guillaume Mulier but vectorizing;

x &lt;- data.frame(id = 1:3, var1 = c(10, 5, 1), var2 = c(3, 3, 0))
with(x, data.frame(id=rep(id, var1),
              mort=rep(rep(1:0, nrow(x)),
                       c(matrix(c(var2, var1 - var2), 2, byrow=TRUE)))))
#   id mort
#1   1    1
#2   1    1
#3   1    1
#4   1    0
#5   1    0
#6   1    0
#7   1    0
#8   1    0
#9   1    0
#10  1    0
#11  2    1
#12  2    1
#13  2    1
#14  2    0
#15  2    0
#16  3    0

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

创建二进制表，其中行数是来自变量的值。

问题

答案1

答案2

检查列中是否省略了前导零，如果省略则添加。

使用R中的ivreg工具变量三部分右侧符号表示法。

将绘制的线转换为模型对象

数据框中的新列不保留 POSIXct 类。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。