2023年6月6日 03:05:53go评论109阅读模式

英文:

how to do conditional summation in R

问题

I am trying to do a conditional summation based on a table that looks like this:

在R中如何进行条件求和 ]1

我正在尝试基于这样的表格进行条件求和：

I am trying to do a summation of the column "value" and group by Location. Normally I would just do this:

通常情况下，我只需这样做：

data <- file %>%
group_by(location, date) %>%
summarize(value = sum (value))

However, only for the Location "Central" I would like to exclude the Program "B". So I tried this way, but it did not work:

但是，仅对于“Central”位置，我想排除“B”程序。所以我尝试了这种方式，但没有成功：

data <- file %>%
group_by(location, date) %>%
summarize(value =
case_when(location == "Central" ~ filter(program != "B")),
TRUE ~ sum(value)
)

If someone could please help me with the code above, I would much appreciate that.
Thank you

如果有人能帮助我处理上面的代码，我将不胜感激。
谢谢

英文:

I am trying to do a conditional summation based on a table that looks like this:

在R中如何进行条件求和 ]1

I am trying to do a summation of the column "value" and group by Location. Normally I would just do this:

data &lt;- file %&gt;%
group_by(location, date) %&gt;%
summarize(value = sum (value))

However, only for the Location "Central" I would like to exclude the Program "B". So I tried this way, but it did not work:

data &lt;- file %&gt;%
group_by(location, date) %&gt;%
summarize(value = 
case_when(location == &quot;Central&quot; ~ filter(program != &quot;B&quot;)),
          TRUE ~ sum(value)
)

If someone oculd please help me with the code above, I would much appreciate that.
Thank you

EDIT:
Here is the reproducible data using dput:

structure(list(pid = c(123, 123, 123, 123, 123, 123, 123, 
123, 123, 123), program = c(&quot;A&quot;, &quot;A&quot;, 
&quot;A&quot;, &quot;A&quot;, &quot;A&quot;, 
&quot;A&quot;, &quot;A&quot;, &quot;A&quot;, 
&quot;A&quot;, &quot;A&quot;), location = c(&quot;Central&quot;, 
&quot;Central&quot;, &quot;Central&quot;, &quot;Central&quot;, &quot;Central&quot;, &quot;Central&quot;, &quot;Central&quot;, 
&quot;Central&quot;, &quot;Central&quot;, &quot;Central&quot;), locationid = c(&quot;123-Central&quot;, 
&quot;123-Central&quot;, &quot;123-Central&quot;, &quot;123-Central&quot;, &quot;123-Central&quot;, 
&quot;123-Central&quot;, &quot;123-Central&quot;, &quot;123-Central&quot;, &quot;123-Central&quot;, 
&quot;123-Central&quot;), date = structure(c(1302480000, 1305072000, 1307750400, 
1310342400, 1313020800, 1315699200, 1318291200, 1323561600, 1326240000, 
1328918400), tzone = &quot;UTC&quot;, class = c(&quot;POSIXct&quot;, &quot;POSIXt&quot;)), 
    value = c(37207.43, -56936.95, -52871, 6980.05, 10703.16, 
    4006.1, 6505.3, 9661.29, 6897.26, 7212.87)), row.names = c(NA, 
-10L), class = c(&quot;tbl_df&quot;, &quot;tbl&quot;, &quot;data.frame&quot;))

答案1

得分: 2

filter 作用于整个数据框。你可以首先进行筛选：

file |&gt;
  filter(!(program == &quot;B&quot; &amp; location == &quot;Central&quot;)) |&gt;
  group_by(location, date) |&gt;
  summarize(value = sum(value))

或者你可以在 sum 中使用向量子集函数，像这样：

data &lt;- file |&gt;
  group_by(location, date) |&gt;
  summarize(value = 
    case_when(
      location == &quot;Central&quot; ~ sum(value[program != &quot;B&quot;]),
      TRUE ~ sum(value)
    )
  )

但你不能在向量/列上调用 filter。也不能像这样将其用作结果，location == "Central" ~ filter(program != "B")，当你希望结果是一个总和时。

英文:

filter works on a whole data frame. You can filter first:

file |&gt;
  filter(!(program == &quot;B&quot; &amp; location == &quot;Central&quot;)) |&gt;
  group_by(location, date) |&gt;
  summarize(value = sum (value))

Or you can use vector subsetting functions like [ inside the sum like this:

data &lt;- file |&gt;
  group_by(location, date) |&gt;
  summarize(value = 
    case_when(
      location == &quot;Central&quot; ~ sum(value[program != &quot;B&quot;]),
      TRUE ~ sum(value)
    )
  )

But you can't call filter on a vector/column. Nor can you use it like a result, location == "Central" ~ filter(program != "B") when you want the result to be a sum.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在R中如何进行条件求和

问题

答案1

在R中创建结果组，每个元素仅使用一次（不重复的组合）。

使用R中的Rayshader为3D地图的高度添加颜色。

在dplyr代码中添加一行总计，但只在特定列下方。

在R程序中，想要使用特定值从向量值填充矩阵的特定列的各行。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。