问题

使用Go的正则表达式，我正在尝试从原始文本中提取一组预定义的有序键值（多行）对，其中最后一个元素可能是可选的。例如，

Key1:
 SomeValue1
 MoreValue1
Key2:
 SomeValue2
 MoreValue2
OptionalKey3:
 SomeValue3
 MoreValue3

（在这里，我想将所有值作为命名组提取出来）

如果我使用默认的贪婪模式(?s:Key1:\n(?P<Key1>.*)Key2:\n(?P<Key2>.*)(?:OptionalKey3:\n(?P<OptionalKey3>.*))?)，它永远不会看到OptionalKey3，并将剩余的文本匹配为Key2。

如果我使用非贪婪模式(?s:Key1:\n(?P<Key1>.*)Key2:\n(?P<Key2>.*?)(?:OptionalKey3:\n(?P<OptionalKey3>.*))?)，它甚至看不到SomeValue2，并立即停止匹配：https://regex101.com/r/QE2g3o/1

有没有办法在可选匹配OptionalKey3的同时，也能捕获所有其他的值？

英文:

Using Go's regexp, I'm trying to extract a predefined set of ordered key-value (multiline) pairs whose last element may be optional from a raw text, e.g.,

 Key1:
  SomeValue1
  MoreValue1
 Key2:
  SomeValue2
  MoreValue2
 OptionalKey3:
  SomeValue3
  MoreValue3

(here, I want to extract all the values as named groups)

If I use the default greedy pattern (?s:Key1:\n(?P<Key1>.*)Key2:\n(?P<Key2>.*)(?:OptionalKey3:\n(?P<OptionalKey3>.*))?), it never sees OptionalKey3 and matches the rest of the text as Key2.

If I use the non-greedy pattern (?s:Key1:\n(?P<Key1>.*)Key2:\n(?P<Key2>.*?)(?:OptionalKey3:\n(?P<OptionalKey3>.*))?), it doesn't even see SomeValue2 and stops immediately: https://regex101.com/r/QE2g3o/1

Is there a way to optionally match OptionalKey3 while also able to capture all the other ones?

答案1

得分: 2

使用

(?s)\AKey1:\n(?P<Key1>.*)Key2:\n(?P<Key2>.*?)(?:OptionalKey3:\n(?P<OptionalKey3>.*))?\z

参见正则表达式验证。

解释

--------------------------------------------------------------------------------
  (?s)                     设置此块的标志（使用.匹配\n）（区分大小写）（使用^和$
                           正常匹配）（正常匹配空格和#）
--------------------------------------------------------------------------------
  \A                       字符串的开头
--------------------------------------------------------------------------------
  Key1:                    'Key1:'
--------------------------------------------------------------------------------
  \n                       '\n'（换行符）
--------------------------------------------------------------------------------
  (?P<Key1>                 分组并捕获到“Key1”：
--------------------------------------------------------------------------------
    .*                       任意字符（0次或多次（匹配最多的次数））
--------------------------------------------------------------------------------
  )                        结束“Key1”
--------------------------------------------------------------------------------
  Key2:                    'Key2:'
--------------------------------------------------------------------------------
  \n                       '\n'（换行符）
--------------------------------------------------------------------------------
  (?P<Key2>                分组并捕获到“Key2”：
--------------------------------------------------------------------------------
    .*?                      任意字符（0次或多次（匹配最少的次数））
--------------------------------------------------------------------------------
  )                        结束“Key2”
--------------------------------------------------------------------------------
  (?:                      分组，但不捕获（可选的（匹配最多的次数））：
--------------------------------------------------------------------------------
    OptionalKey3:            'OptionalKey3:'
--------------------------------------------------------------------------------
    \n                       '\n'（换行符）
--------------------------------------------------------------------------------
    (?P<OptionalKey3>         分组并捕获到“OptionalKey3”：
--------------------------------------------------------------------------------
      .*                       任意字符（0次或多次（匹配最多的次数））
--------------------------------------------------------------------------------
    )                        结束“OptionalKey3”
--------------------------------------------------------------------------------
  )?                       结束分组
--------------------------------------------------------------------------------
  \z                       字符串的结尾

英文:

Use

(?s)\AKey1:\n(?P&lt;Key1&gt;.*)Key2:\n(?P&lt;Key2&gt;.*?)(?:OptionalKey3:\n(?P&lt;OptionalKey3&gt;.*))?\z

See regex proof.

EXPLANATION

--------------------------------------------------------------------------------
  (?s)                     set flags for this block (with . matching
                           \n) (case-sensitive) (with ^ and $
                           matching normally) (matching whitespace
                           and # normally)
--------------------------------------------------------------------------------
  \A                       the beginning of the string
--------------------------------------------------------------------------------
  Key1:                    &#39;Key1:&#39;
--------------------------------------------------------------------------------
  \n                       &#39;\n&#39; (newline)
--------------------------------------------------------------------------------
  (?P&lt;Key1&gt;                 group and capture to &quot;Key1&quot;:
--------------------------------------------------------------------------------
    .*                       any character (0 or more times (matching
                             the most amount possible))
--------------------------------------------------------------------------------
  )                        end of &quot;Key1&quot;
--------------------------------------------------------------------------------
  Key2:                    &#39;Key2:&#39;
--------------------------------------------------------------------------------
  \n                       &#39;\n&#39; (newline)
--------------------------------------------------------------------------------
  (?P&lt;Key2&gt;                group and capture to &quot;Key2&quot;:
--------------------------------------------------------------------------------
    .*?                      any character (0 or more times (matching
                             the least amount possible))
--------------------------------------------------------------------------------
  )                        end of &quot;Key2&quot;
--------------------------------------------------------------------------------
  (?:                      group, but do not capture (optional
                           (matching the most amount possible)):
--------------------------------------------------------------------------------
    OptionalKey3:            &#39;OptionalKey3:&#39;
--------------------------------------------------------------------------------
    \n                       &#39;\n&#39; (newline)
--------------------------------------------------------------------------------
    (?P&lt;OptionalKey3&gt;         group and capture to &quot;OptionalKey3&quot;:
--------------------------------------------------------------------------------
      .*                       any character (0 or more times
                               (matching the most amount possible))
--------------------------------------------------------------------------------
    )                        end of &quot;OptionalKey3&quot;
--------------------------------------------------------------------------------
  )?                       end of grouping
--------------------------------------------------------------------------------
  \z                       the end of the string

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

正则表达式：多行，非贪婪匹配，直到可选字符串。

问题

答案1

如何连接io.Reader和io.Writer？

当我从net.TCPConn中只收到EOF时，我如何知道连接是否已经断开？

How can I properly demonstrate concurrency AND parallelism in Go/Golang?

GO WebSocket保持活动的适当时间跨度是多久？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论