2023年6月29日 21:48:35go评论111阅读模式

英文:

Conditional fractioning

问题

ID	Country	Sales	fraction
1	奥地利	6	0.666
1	奥地利	6	0.666
1	比利时	6	0.333
2	比利时	10	0.5
2	捷克	10	0.5
3	丹麦	3	1
3	德国	3	1

英文:

Suppose I have the dataset

ID	Country	Sales
1	Austria	6
1	Austria	6
1	Belgium	6
2	Belgium	10
2	Czech	10
3	Denmark	3
3	Germany	3

I want to another variable of sales which depends on countries and their ID ie in fractions.

ID	Country	Sales	fraction
1	Austria	6	0.666
1	Austria	6	0.666
1	Belgium	6	0.333
2	Belgium	10	0.5
2	Czech	10	0.5
3	Denmark	3	1
3	Denmark	3	1

Any help would be appreciated!

答案1

得分: 1

library(dplyr)
your_data |&gt;
  mutate(country_total = sum(Sales), .by = c(ID, Country)) |&gt;
  mutate(fraction = country_total / sum(Sales), .by = ID)
#   ID Country Sales country_total  fraction
# 1  1 Austria     6            12 0.6666667
# 2  1 Austria     6            12 0.6666667
# 3  1 Belgium     6             6 0.3333333
# 4  2 Belgium    10            10 0.5000000
# 5  2   Czech    10            10 0.5000000
# 6  3 Denmark     3             3 0.5000000
# 7  3 Germany     3             3 0.5000000

使用此示例数据：

your_data = read.table(text = &#39;ID 	Country 	Sales
1 	Austria 	6
1 	Austria 	6
1 	Belgium 	6
2 	Belgium 	10
2 	Czech 	10
3 	Denmark 	3
3 	Germany 	3&#39;, header = T)

英文:

library(dplyr)
your_data |&gt;
  mutate(country_total = sum(Sales), .by = c(ID, Country)) |&gt;
  mutate(fraction = country_total / sum(Sales), .by = ID)
#   ID Country Sales country_total  fraction
# 1  1 Austria     6            12 0.6666667
# 2  1 Austria     6            12 0.6666667
# 3  1 Belgium     6             6 0.3333333
# 4  2 Belgium    10            10 0.5000000
# 5  2   Czech    10            10 0.5000000
# 6  3 Denmark     3             3 0.5000000
# 7  3 Germany     3             3 0.5000000

Using this sample data:

your_data = read.table(text = &#39;ID 	Country 	Sales
1 	Austria 	6
1 	Austria 	6
1 	Belgium 	6
2 	Belgium 	10
2 	Czech 	10
3 	Denmark 	3
3 	Germany 	3&#39;, header = T)

答案2

得分: 1

Here is the translated content:

"或者，我们可以使用 add_count 来获得相同的结果

library(dplyr)
df %>% add_count(ID,name = 'n') %>% add_count(ID,Country, name = 'gn') %>% 
mutate(new=gn/n) %>% select(-c(n,gn))
# 输出
# A tibble: 7 × 4
     ID Country Sales   new
  <dbl> <chr>   <dbl> <dbl>
1     1 Austria     6 0.667
2     1 Austria     6 0.667
3     1 Belgium     6 0.333
4     2 Belgium    10 0.5  
5     2 Czech      10 0.5  
6     3 Denmark     3 0.5  
7     3 Germany     3 0.5  
```"
<details>
<summary>英文:</summary>
Alternatively we could use `add_count` to get the same result
````r
library(dplyr)
df %&gt;% add_count(ID,name = &#39;n&#39;) %&gt;% add_count(ID,Country, name = &#39;gn&#39;) %&gt;% 
mutate(new=gn/n) %&gt;% select(-c(n,gn))
# output
# A tibble: 7 &#215; 4
     ID Country Sales   new
  &lt;dbl&gt; &lt;chr&gt;   &lt;dbl&gt; &lt;dbl&gt;
1     1 Austria     6 0.667
2     1 Austria     6 0.667
3     1 Belgium     6 0.333
4     2 Belgium    10 0.5  
5     2 Czech      10 0.5  
6     3 Denmark     3 0.5  
7     3 Germany     3 0.5

答案3

得分: 0

ID Country Sales  fraction
1  1 Austria     6 0.6666667
2  1 Austria     6 0.6666667
3  1 Belgium     6 0.3333333
4  2 Belgium    10 0.5000000
5  2   Czech    10 0.5000000
6  3 Denmark     3 0.5000000
7  3 Germany     3 0.5000000

英文:

Base

&gt; df$fraction=ave(df$Sales,list(df$ID,df$Country),FUN=sum)/ave(df$Sales,df$ID,FUN=sum)
  ID Country Sales  fraction
1  1 Austria     6 0.6666667
2  1 Austria     6 0.6666667
3  1 Belgium     6 0.3333333
4  2 Belgium    10 0.5000000
5  2   Czech    10 0.5000000
6  3 Denmark     3 0.5000000
7  3 Germany     3 0.5000000

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

条件分数化

问题

答案1

答案2

答案3

如何解决RMarkdown中与维度相关的评估错误

将宽表格转换为长表格在R中

在R中有条件地修改多个列

Go语言的线性回归库

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。