2023年6月12日 20:05:41go评论104阅读模式

英文:

Modify dataframe based on the unique levels of column and then combine values from 2 other columns inside them

问题

我有以下数据框，我想以一种方式修改它，使其具有3列Election，Party_1和Party_2，在Party列内部粘贴Candidate和Voteshare之间的内容

df <- structure(list(Election = c("PC_2009", "AC_2011", "PC_2014", 
"AC_2016", "PC_2019", "AC_2021", "PC_2009", "AC_2011", "PC_2014", 
"AC_2016", "PC_2019", "AC_2021"), Party = c("Party_1", "Party_1", 
"Party_1", "Party_1", "Party_1", "Party_1", "Party_2", "Party_2", 
"Party_2", "Party_2", "Party_2", "Party_2"), Candidate = c("Mr.Sen", 
"Dr.Kar", "Mr. Sen", "Dr.Kar", "Dr.Reddy", "Dr.Kar", "Dr.Dutta", 
"Mrs.Dondopani", "Mr. Das", "Mrs.Dondopani", "Mr,Das", "Mrs.Dondopani"
), Voteshare = c(0.2, 0.32, 0.12, 0.36, 0.005, 0.26, 0.15, 0.23, 
0.45, 0.28, 0.54, 0.38)), row.names = c(NA, -12L), class = c("tbl_df", 
"tbl", "data.frame"))

像这样：

英文:

I have the dataframe below and I want to modify it in a way that will have 3 columns Election,Party_1 and Party_2 and inside Party columns a paste between Candidate and Voteshare

df&lt;-structure(list(ELECtion = c(&quot;PC_2009&quot;, &quot;AC_2011&quot;, &quot;PC_2014&quot;, 
&quot;AC_2016&quot;, &quot;PC_2019&quot;, &quot;AC_2021&quot;, &quot;PC_2009&quot;, &quot;AC_2011&quot;, &quot;PC_2014&quot;, 
&quot;AC_2016&quot;, &quot;PC_2019&quot;, &quot;AC_2021&quot;), Party = c(&quot;Party_1&quot;, &quot;Party_1&quot;, 
&quot;Party_1&quot;, &quot;Party_1&quot;, &quot;Party_1&quot;, &quot;Party_1&quot;, &quot;Party_2&quot;, &quot;Party_2&quot;, 
&quot;Party_2&quot;, &quot;Party_2&quot;, &quot;Party_2&quot;, &quot;Party_2&quot;), Candidate = c(&quot;Mr.Sen&quot;, 
&quot;Dr.Kar&quot;, &quot;Mr. Sen&quot;, &quot;Dr.Kar&quot;, &quot;Dr.Reddy&quot;, &quot;Dr.Kar&quot;, &quot;Dr.Dutta&quot;, 
&quot;Mrs.Dondopani&quot;, &quot;Mr. Das&quot;, &quot;Mrs.Dondopani&quot;, &quot;Mr,Das&quot;, &quot;Mrs.Dondopani&quot;
), Voteshare = c(0.2, 0.32, 0.12, 0.36, 0.005, 0.26, 0.15, 0.23, 
0.45, 0.28, 0.54, 0.38)), row.names = c(NA, -12L), class = c(&quot;tbl_df&quot;, 
&quot;tbl&quot;, &quot;data.frame&quot;))

答案1

得分: 1

使用 paste + pivot_wider：

library(tidyr)
library(dplyr)
df %>% 
  mutate(cand_vote = paste(Candidate, paste0(Voteshare * 100, "%")), .keep = "unused") %>% 
  pivot_wider(names_from = "Party", values_from = "cand_vote")
#   ELECtion Party_1       Party_2          
#   <chr>    <chr>         <chr>            
# 1 PC_2009  Mr.Sen 20%    Dr.Dutta 15%     
# 2 AC_2011  Dr.Kar 32%    Mrs.Dondopani 23%
# 3 PC_2014  Mr. Sen 12%   Mr. Das 45%      
# 4 AC_2016  Dr.Kar 36%    Mrs.Dondopani 28%
# 5 PC_2019  Dr.Reddy 0.5% Mr,Das 54%       
# 6 AC_2021  Dr.Kar 26%    Mrs.Dondopani 38%

英文:

With paste + pivot_wider:

library(tidyr)
library(dplyr)
df %&gt;% 
  mutate(cand_vote = paste(Candidate, paste0(Voteshare * 100, &quot;%&quot;)), .keep = &quot;unused&quot;) %&gt;% 
  pivot_wider(names_from = &quot;Party&quot;, values_from = &quot;cand_vote&quot;)
#   ELECtion Party_1       Party_2          
#   &lt;chr&gt;    &lt;chr&gt;         &lt;chr&gt;            
# 1 PC_2009  Mr.Sen 20%    Dr.Dutta 15%     
# 2 AC_2011  Dr.Kar 32%    Mrs.Dondopani 23%
# 3 PC_2014  Mr. Sen 12%   Mr. Das 45%      
# 4 AC_2016  Dr.Kar 36%    Mrs.Dondopani 28%
# 5 PC_2019  Dr.Reddy 0.5% Mr,Das 54%       
# 6 AC_2021  Dr.Kar 26%    Mrs.Dondopani 38%

答案2

得分: 1

我们可以将候选人和选票份额列合并为一个单一变量，然后使用pivot_wider来展开数据。
scales::percent可以轻松将分数转换为百分比。

library(tidyr)
library(dplyr)
df |&gt; 
    mutate(Voteshare = scales::percent(Voteshare)) |&gt; 
    unite(c(&quot;Candidate&quot;, &quot;Voteshare&quot;),
          sep = &quot; &quot;,
          col = &quot;value&quot;) |&gt; 
    pivot_wider(names_from = Party,
                values_from = value)
# 生成的数据表如下：
  ELECtion Party_1       Party_2            
1 PC_2009  Mr.Sen 20.0%  Dr.Dutta 15.0%     
2 AC_2011  Dr.Kar 32.0%  Mrs.Dondopani 23.0%
3 PC_2014  Mr. Sen 12.0% Mr. Das 45.0%      
4 AC_2016  Dr.Kar 36.0%  Mrs.Dondopani 28.0%
5 PC_2019  Dr.Reddy 0.5% Mr,Das 54.0%       
6 AC_2021  Dr.Kar 26.0%  Mrs.Dondopani 38.0%

英文:

We can unite the candidate and voteshare columns into a single variable, then pivot_wider to spread the data.
scales::percent converts fractions to percentages easily.

library(tidyr)
library(dplyr)
df |&gt; 
    mutate(Voteshare = scales::percent(Voteshare)) |&gt; 
    unite(c(&quot;Candidate&quot;, &quot;Voteshare&quot;),
          sep = &quot; &quot;,
          col = &quot;value&quot;) |&gt; 
    pivot_wider(names_from = Party,
                values_from = value)
# A tibble: 6 &#215; 3
  ELECtion Party_1       Party_2            
  &lt;chr&gt;    &lt;chr&gt;         &lt;chr&gt;              
1 PC_2009  Mr.Sen 20.0%  Dr.Dutta 15.0%     
2 AC_2011  Dr.Kar 32.0%  Mrs.Dondopani 23.0%
3 PC_2014  Mr. Sen 12.0% Mr. Das 45.0%      
4 AC_2016  Dr.Kar 36.0%  Mrs.Dondopani 28.0%
5 PC_2019  Dr.Reddy 0.5% Mr,Das 54.0%       
6 AC_2021  Dr.Kar 26.0%  Mrs.Dondopani 38.0%
</details>

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

根据列的唯一级别修改数据框，然后将其中的2个其他列的值合并。

问题

答案1

答案2

你可以使用R来将数据框转置，使某一列成为列名，而另一列填充值。

R函数用于修剪数据框。

如何在单个图/地图中可视化三个参数的相对贡献？

重排 R 数据框架（根据特定条件更改为宽格式，重命名和重新排列列）

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。