2023年6月12日 22:16:50go评论109阅读模式

英文:

Use column names from dataframe and create a new one with one column with those column names as values

问题

我有以下数据框，我想创建一个新的数据框，其中包含两列，一列名为“设施”（Amenities），其值将包括“nn_bank”或“nn_hospital”，另一列名为“名称”（Name），其值将是“nn_bank”或“nn_hospital”的名称。

df <- structure(list(state = c("West Bengal", "West Bengal", "West Bengal", 
"West Bengal", "West Bengal"), nn_hospital = c("Khundkuri Hospital", 
"Khundkuri Hospital", "Mankar Rural Hospital", "Khundkuri Hospital", 
"Khundkuri Hospital"), distance_nn_hospital = c(8949.68646563084, 
17217.1419457099, 16939.2318150416, 15812.9872649418, 1408.372117616
), nn_bank = c("contai", "contai", "Allahabad Bank", "contai", 
"contai"), distance_nn_bank = c(13959.9950089655, 20598.4763042432, 
19688.6296071566, 20537.3799009137, 11385.8738290783)), class = "data.frame", row.names = c(NA, 
5L))

结果应该如下：

英文:

I have the dataframe below and I want to create a new one with 2 columns one named Amenities which will include either "nn_bank" or "nn_hospital" as value and the other column "Name" with the name of the nn_bank or nn_hospital.

df&lt;-structure(list(state = c(&quot;West Bengal&quot;, &quot;West Bengal&quot;, &quot;West Bengal&quot;, 
&quot;West Bengal&quot;, &quot;West Bengal&quot;), nn_hospital = c(&quot;Khundkuri Hospital&quot;, 
&quot;Khundkuri Hospital&quot;, &quot;Mankar Rural Hospital&quot;, &quot;Khundkuri Hospital&quot;, 
&quot;Khundkuri Hospital&quot;), distance_nn_hospital = c(8949.68646563084, 
17217.1419457099, 16939.2318150416, 15812.9872649418, 1408.372117616
), nn_bank = c(&quot;contai&quot;, &quot;contai&quot;, &quot;Allahabad Bank&quot;, &quot;contai&quot;, 
&quot;contai&quot;), distance_nn_bank = c(13959.9950089655, 20598.4763042432, 
19688.6296071566, 20537.3799009137, 11385.8738290783)), class = &quot;data.frame&quot;, row.names = c(NA, 
5L))

result should be like

答案1

得分: 1

Here is the translated content:

当你有多个值列时，一种选择是使用 tidyr::pivot_longer 的 names_pattern 参数以及特殊的 .value 来重塑你的数据。之后你需要进行一些额外的清理。

library(tidyr)
library(dplyr, warn = FALSE)
library(stringr)
df |&gt;
  pivot_longer(-state,
    names_to = c(&quot;.value&quot;, &quot;Amenities&quot;),
    names_pattern = &quot;(.*?nn)_(.*)&quot;
  ) |&gt;
  select(-c(state, distance_nn), Name = nn) |&gt;
  mutate(across(c(Amenities, Name), str_to_title))
#&gt; # A tibble: 10 &#215; 2
#&gt;    Amenities Name                 
#&gt;    &lt;chr&gt;     &lt;chr&gt;                
#&gt;  1 Hospital  Khundkuri医院      
#&gt;  2 Bank      Contai               
#&gt;  3 Hospital  Khundkuri医院      
#&gt;  4 Bank      Contai               
#&gt;  5 Hospital  Mankar Rural医院
#&gt;  6 Bank      Allahabad银行     
#&gt;  7 Hospital  Khundkuri医院      
#&gt;  8 Bank      Contai               
#&gt;  9 Hospital  Khundkuri医院      
#&gt; 10 Bank      Contai

(Note: I have left the code part unchanged as per your request.)

英文:

When you have multiple value columns one option would be to use the names_pattern argument of tidyr::pivot_longer along with the special .value to reshape your data. Afterwards you have to do some additional cleaning.

library(tidyr)
library(dplyr, warn = FALSE)
library(stringr)
df |&gt;
  pivot_longer(-state,
    names_to = c(&quot;.value&quot;, &quot;Amenities&quot;),
    names_pattern = &quot;(.*?nn)_(.*)&quot;
  ) |&gt;
  select(-c(state, distance_nn), Name = nn) |&gt;
  mutate(across(c(Amenities, Name), str_to_title))
#&gt; # A tibble: 10 &#215; 2
#&gt;    Amenities Name                 
#&gt;    &lt;chr&gt;     &lt;chr&gt;                
#&gt;  1 Hospital  Khundkuri Hospital   
#&gt;  2 Bank      Contai               
#&gt;  3 Hospital  Khundkuri Hospital   
#&gt;  4 Bank      Contai               
#&gt;  5 Hospital  Mankar Rural Hospital
#&gt;  6 Bank      Allahabad Bank       
#&gt;  7 Hospital  Khundkuri Hospital   
#&gt;  8 Bank      Contai               
#&gt;  9 Hospital  Khundkuri Hospital   
#&gt; 10 Bank      Contai

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Create a new column in the dataframe using the column names as its values.

问题

答案1

如何绘制带有标签的facet_wrap，类似于facet_grid（独立的y轴）。

在R数据框中反转非NA值的顺序。

R: 基于嵌套群组计算比例

在另一列的名称基础上添加一行到新元素中。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。