2023年5月11日 20:19:54go评论104阅读模式

英文:

Separate columns in R based on the second occurence of ("\\.")

问题

Sure, here's the translated part:

我有一个非常难以从数据集中分离出我的列
tibble(sample=c("AM.F10.T1", "AM.F10.T2","DA.AD.1","DA.AD.2", "ES.AD.1"))
并使它们看起来像
#>   sample        col1      col2
#>   <chr>    
#> 1 AM.F10.T1     AM.F10     T1
#> 2 AM.F10.T2     AM.F10     T2
#> 3 DA.AD.1       DA.AD       1
#> 4 DA.AD.2       DA.AD       2
#> 5 ES.AD.1       ES.AD       1
谢谢您花时间查看我的帖子

英文:

I have a very hard to separate my columns from data set

library(dplyr)
#&gt; 
#&gt; Attaching package: &#39;dplyr&#39;
#&gt; The following objects are masked from &#39;package:stats&#39;:
#&gt; 
#&gt;     filter, lag
#&gt; The following objects are masked from &#39;package:base&#39;:
#&gt; 
#&gt;     intersect, setdiff, setequal, union
tibble(sample=c(&quot;AM.F10.T1&quot;, &quot;AM.F10.T2&quot;,&quot;DA.AD.1&quot;,&quot;DA.AD.2&quot;, &quot;ES.AD.1&quot;))
#&gt; # A tibble: 5 &#215; 1
#&gt;   sample   
#&gt;   &lt;chr&gt;    
#&gt; 1 AM.F10.T1
#&gt; 2 AM.F10.T2
#&gt; 3 DA.AD.1  
#&gt; 4 DA.AD.2  
#&gt; 5 ES.AD.1

<sup>Created on 2023-05-11 with reprex v2.0.2</sup>

and make them look like

#&gt;   sample        col1      col2
#&gt;   &lt;chr&gt;    
#&gt; 1 AM.F10.T1     AM.F10     T1
#&gt; 2 AM.F10.T2     AM.F10     T2
#&gt; 3 DA.AD.1       DA.AD       1
#&gt; 4 DA.AD.2       DA.AD       2
#&gt; 5 ES.AD.1       ES.AD       1

Thank you for spending time in my post

答案1

得分: 1

你可以使用 tidyr::separate_wider_regex() 来实现这个功能（此函数包含在 tidyr 的最新版本中）。你可以明确指定第一列和第二列的内容以及它们之间的分隔符。

library(tidyr)
tibble(sample=c("AM.F10.T1", "AM.F10.T2","DA.AD.1","DA.AD.2", "ES.AD.1")) |> 
  separate_wider_regex(
     cols = sample, 
     patterns = c(first  = "\\w*\\.\\w*", "\\.", second = "\\w*")
  )
#> # A tibble: 5 × 2
#>   first  second
#>   <chr>  <chr> 
#> 1 AM.F10 T1    
#> 2 AM.F10 T2    
#> 3 DA.AD  1     
#> 4 DA.AD  2     
#> 5 ES.AD  1

^{创建于2023年05月11日，使用 reprex v2.0.2}

英文:

You can do this with tidyr::separate_wider_regex() (this function is in the recent release of tidyr). You can be explicit about what is in the first and second columns and what separates them.

library(tidyr)
tibble(sample=c(&quot;AM.F10.T1&quot;, &quot;AM.F10.T2&quot;,&quot;DA.AD.1&quot;,&quot;DA.AD.2&quot;, &quot;ES.AD.1&quot;)) |&gt; 
  separate_wider_regex(
     cols = sample, 
     patterns = c(first  = &quot;\\w*\\.\\w*&quot;, &quot;\\.&quot;, second = &quot;\\w*&quot;)
  )
#&gt; # A tibble: 5 &#215; 2
#&gt;   first  second
#&gt;   &lt;chr&gt;  &lt;chr&gt; 
#&gt; 1 AM.F10 T1    
#&gt; 2 AM.F10 T2    
#&gt; 3 DA.AD  1     
#&gt; 4 DA.AD  2     
#&gt; 5 ES.AD  1

<sup>Created on 2023-05-11 with reprex v2.0.2</sup>

答案2

得分: 1

虽然tidyr包中的extract函数已被separate_wider_regex替代，但我认为它有时仍然很有用。

在第一个捕获组中使用激进匹配会强制后一个捕获组获取第二个句点后的内容。

library(tidyr)
extract(df, sample, regex = "(.*)\\.(.*)", into = c("col1", "col2"), remove = FALSE)
# A tibble: 5 × 3
  sample    col1   col2 
  <chr>     <chr>  <chr>
1 AM.F10.T1 AM.F10 T1   
2 AM.F10.T2 AM.F10 T2   
3 DA.AD.1   DA.AD  1    
4 DA.AD.2   DA.AD  2    
5 ES.AD.1   ES.AD  1

英文:

Although the extract function from the tidyr package was superseded by separate_wider_regex, I think it's still useful sometimes.

Using an aggressive match in the first capture group would force the latter capture group to get the content after the second dot.

library(tidyr)
extract(df, sample, regex = &quot;(.*)\\.(.*)&quot;, into = c(&quot;col1&quot;, &quot;col2&quot;), remove = F)
# A tibble: 5 &#215; 3
  sample    col1   col2 
  &lt;chr&gt;     &lt;chr&gt;  &lt;chr&gt;
1 AM.F10.T1 AM.F10 T1   
2 AM.F10.T2 AM.F10 T2   
3 DA.AD.1   DA.AD  1    
4 DA.AD.2   DA.AD  2    
5 ES.AD.1   ES.AD  1

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Separate columns in R based on the second occurrence of (“.”).

问题

答案1

答案2

添加标准误差以修正分面条形图上的面板

有没有一种方法可以按大小拆分分组的数据框？

无法设置瓷砖图中颜色的数值范围。

R: 如何防止（嵌套）摘要组内的重叠范围

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。