2023年6月8日 21:40:25go评论101阅读模式

英文:

Using gsub to separate a sting into cities and counties

问题

I am trying to split a string into cities and countries but having difficulty when the city or country in question is more than one word (Eg, aix-en-provence or United States). The current code I am using a will work for most like Paris, France but not for ones similar to those above.

locations
paris_france
miami_united states
new york_united states
aix-en-provence_france
auckland_new_zealand
current code used
city = gsub("([A-z]+)_([A-z]+)", "\", locations)
country = gsub("([A-z]+)_([A-z]+)", "\", locations)

so now city will return paris and country will be france which is fine but for other stuff like auckland and zealand will be returned. Guessing its obviously a case of getting it to recognize more than one word before or after the '_'

英文:

 locations
 paris_france
 miami_united states
 new york_united states
 aix-en-provence_france
 auckland_new_zealand
current code used
city = gsub(&quot;([A-z]+)_([A-z]+)&quot;, &quot;\&quot;, locations)
country = gsub(&quot;([A-z]+)_([A-z]+)&quot;, &quot;\&quot;, locations)

答案1

得分: 3

因为 new_zealand，我们必须多加小心。

base R

strcapture("^([^_]+)_(.*)$", locs$locations, proto = c(city="", country=""))
#              city       country
# 1           paris        france
# 2           miami united states
# 3        new york united states
# 4 aix-en-provence        france
# 5        auckland   new_zealand

tidyr

library(tidyr)
separate_wider_delim(locs, locations, delim = "_", names = c("city", "country"), too_many = "merge")
# # A tibble: 5 × 2
#   city            country      
#   <chr>           <chr>        
# 1 paris           france       
# 2 miami           united states
# 3 new york        united states
# 4 aix-en-provence france       
# 5 auckland        new_zealand

Data

locs <- structure(list(locations = c("paris_france", "miami_united states", "new york_united states", "aix-en-provence_france", "auckland_new_zealand")), row.names = c(NA, -5L), class = "data.frame")

英文:

Because of new_zealand, we have to take a little extra caution.

base R

strcapture(&quot;^([^_]+)_(.*)$&quot;, locs$locations, proto = c(city=&quot;&quot;, country=&quot;&quot;))
#              city       country
# 1           paris        france
# 2           miami united states
# 3        new york united states
# 4 aix-en-provence        france
# 5        auckland   new_zealand

tidyr

library(tidyr)
separate_wider_delim(locs, locations, delim = &quot;_&quot;, names = c(&quot;city&quot;, &quot;country&quot;), too_many = &quot;merge&quot;)
# # A tibble: 5 &#215; 2
#   city            country      
#   &lt;chr&gt;           &lt;chr&gt;        
# 1 paris           france       
# 2 miami           united states
# 3 new york        united states
# 4 aix-en-provence france       
# 5 auckland        new_zealand

Data

locs &lt;- structure(list(locations = c(&quot;paris_france&quot;, &quot;miami_united states&quot;, &quot;new york_united states&quot;, &quot;aix-en-provence_france&quot;, &quot;auckland_new_zealand&quot;)), row.names = c(NA, -5L), class = &quot;data.frame&quot;)

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用 gsub 将字符串分成城市和县。

问题

答案1

base R

tidyr

base R

tidyr

将元素按照 separate() 函数分成不同的列。

找到R中每年的最大时间差

如何在R中删除数据框中的空白空间

从分组数据中使用分段回归提取多个变量的断点。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

发表评论