2023年8月10日 21:15:22go评论146阅读模式

英文:

Multipler delimiters in a column

问题

结果列中有多个答案，用','或'/'分隔。需要计算每个答案的实例数。

我想要的结果是

我不知道如何拆分第一个表格中的答案列。我对字符串拆分一窍不通。

我尝试过在数据框中使用strsplit和str_split，并将列转换为列表，然后尝试使用它们，但是这样做很混乱，并且一直给我各种错误消息。我接近成功了

df %>% separate_longer_delim(Answers, delim = &#39;,/&#39;)

但是我无法让delim部分起作用。我可以使用逗号或斜杠，但不能同时使用两者。

英文:

Column of results with multiple answers separated by ',' or '/'. Need to count the instances of each response.

What I want to end up with is

I'm at a loss how to split the Answers column in the first table. I'm terrible with string splits.

I've tried using both strsplit and str_split as part of the data frame and turning the column into a list and trying them that way, but it was messy and keep giving me various error messages. I'm close with

df %&gt;% separate_longer_delim(Answers, delim = &#39;,/&#39;)

But I can't get the delim part to work. I can use either the comma or the slash but not both together.

答案1

得分: 4

根据separate_longer_delim文档的描述：

> delim：默认情况下，它被解释为固定字符串；使用stringr::regex()和相关函数以其他方式进行分割。

library(tidyr)
library(dplyr)
df %>% 
  separate_longer_delim(Answers, stringr::regex("[,/]\\s*")) %>% 
  count(Answers, sort = TRUE)
#          Answers n
# 1           cars 2
# 2           dirt 2
# 3           toys 2
# 4 all the things 1
# 5          dolls 1
# 6         trucks 1

数据

df <- data.frame(id = 1:4, Answers = c("toys, dirt", "cars, dolls", "cars/toys/dirt", "all the things, trucks"))

英文:

As described in the document of separate_longer_delim:

> delim: By default, it is interpreted as a fixed string; use stringr::regex() and friends to split in other ways.

library(tidyr)
library(dplyr)
df %&gt;%
  separate_longer_delim(Answers, stringr::regex(&quot;[,/]\\s*&quot;)) %&gt;%
  count(Answers, sort = TRUE)
#          Answers n
# 1           cars 2
# 2           dirt 2
# 3           toys 2
# 4 all the things 1
# 5          dolls 1
# 6         trucks 1

Data

df &lt;- data.frame(id = 1:4, Answers = c(&quot;toys, dirt&quot;, &quot;cars, dolls&quot;, &quot;cars/toys/dirt&quot;, &quot;all the things, trucks&quot;))

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

在一列中使用多个分隔符

问题

答案1

数据

Data

将列分离/拆分为三个单独的列，当存在NA值时。

如何根据条件在Spark SQL中选择不同的行。

在Pandas系列中对列执行的函数。

缺失的列用于ydata-profiling相关性报告

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。