为什么 `str_match` 不像 `regex101` 一样捕获分组？

huangapple

117266
文章

0
评论

2023年2月10日 15:23:06go评论86阅读模式

英文:

Why does str_match not capture group the same way regex101 does?

问题

我有这个字符串：
bn = &quot;this_is_a_test_12345.txt&quot;
我想捕获/提取其中的数字部分（`12345`）。在regex101.com上尝试的正则表达式如下：
[![enter image description here][1]][1]
但在R中尝试不起作用：
str_match(bn, &quot;.*(\\d*).*&quot;) # 不起作用
str_match(bn, &quot;.*_(\\d*).*&quot;) # 起作用（第二列是匹配组）
我认为我可能错过了一些关于贪婪性或其他方面的简单东西，但我不确定...
  [1]: https://i.stack.imgur.com/psLRp.png

英文:

I have this string:

bn = &quot;this_is_a_test_12345.txt&quot;

And I want to capture/extract the numeric part (12345). Trying it on regex101.com works like this:

Yet doing it in R does not work that way:

str_match(bn, &quot;.*(\\d*).*&quot;) # works not
str_match(bn, &quot;.*_(\\d*).*&quot;) # works (second column is the matched group)

I think I am missing something very simple about greediness or so, but I am not sure...

答案1

得分: 1

如评论中所提到的，您需要使用?来捕获非贪婪模式：

sub(".*?(\\d+).*", "\\1", bn)
# [1] "12345"

英文:

As mentioned in the comments, you will need the non greedy pattern as captured with ?:

sub(&quot;.*?(\\d+).*&quot;, &quot;\&quot;, bn)
# [1] &quot;12345&quot;

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

本文由 huangapple 发表于 2023年2月10日 15:23:06
转载请务必保留本文链接：https://go.coder-hub.com/75408022.html

r
regex
stringr

Dockerizing a shiny app with an error in the building process

go 97 05/17

观察在另一个模块中发生的动作

go 87 03/04

将日期和时间分开在R中

go 109 07/13

List of Tables and List of Figures in Table of Contents using Quarto book in pdf format

go 100 05/29

为什么 `str_match` 不像 `regex101` 一样捕获分组？

问题

答案1

Dockerizing a shiny app with an error in the building process

观察在另一个模块中发生的动作

将日期和时间分开在R中

List of Tables and List of Figures in Table of Contents using Quarto book in pdf format

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。