2020年8月10日 23:10:12go评论71阅读模式

英文:

Split string using multiple patterns, where second pattern matches smaller parts of the first

问题

Here's the translated code snippet:

for (String substr : "&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f&#167;ltest1 &#167;rtest2".split("((?<=(&#167;x(&#167;[0-9a-f]){6}))|(?<=&#167;[0-9a-z])|(?=&#167;[0-9a-z]))")) {
  System.out.println(substr);
}

Please note that the code itself remains in English since code keywords and syntax are typically written in English regardless of the programming language used.

英文:

I'm reading special "formatting codes" in a string and am trying to split the string so that I have those formatting codes and the string's text separated.

There are two "types" of formatting codes: "Encoded" hex colors: §x§7§3§7§5§f§f and other codes in the format of §r.

Given the example string: §x§7§3§7§5§f§f§ltest1 §rtest2

I need the larger pattern split as a whole, and then the smaller ones. I can do what I want on those patterns separately, but am having trouble combining them into a single regex. Because the second pattern matches pieces of the first pattern, it's just splitting everything into smaller groups.

I'm trying this:

for (String substr : &quot;&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f&#167;ltest1 &#167;rtest2&quot;.split(&quot;((?&lt;=(&#167;x(&#167;[0-9a-f]){6}))|(?&lt;=&#167;[0-9a-z])|(?=&#167;[0-9a-z]))&quot;)) {
  System.out.println(substr);
}

My expected output is:

&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f
&#167;l
test1
&#167;r
test

My actual output is:

&#167;x
&#167;7
&#167;3
&#167;7
&#167;5
&#167;f
&#167;f
&#167;l
test1
&#167;r
test2

When I split the expressions up into different split tests, they work, they're just not working together.

答案1

得分: 2

Instead of splitting, you could just use this simplified regex for matching:

&#167;x(?:&#167;[0-9a-f]){6}|&#167;[0-9a-z]|[^&#167;\s]+

RegEx Demo

RegEx Details:

§x(?:§[0-9a-f]){6}: 匹配以 §x 开头的文本，后面跟着 6 个十六进制字符
|: 或
§[0-9a-z]: 匹配以 § 开头的文本，后面跟着一个字母数字字符
|: 或
[^§\s]+: 匹配 1 个或多个非空格且非 § 字符

Code:

final String regex = "&quot;&#167;x(?:&#167;[0-9a-f]){6}|&#167;[0-9a-z]|[^&#167;\\s]+&quot;";
final String string = "&quot;&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f&#167;ltest1 &#167;rtest2&quot;";

final Pattern pattern = Pattern.compile(regex);
final Matcher matcher = pattern.matcher(string);

while (matcher.find()) {
    System.out.println(matcher.group(0));
}

英文:

Instead of splitting, you could just use this simplified regex for matching:

&#167;x(?:&#167;[0-9a-f]){6}|&#167;[0-9a-z]|[^&#167;\s]+

RegEx Demo

RegEx Details:

§x(?:§[0-9a-f]){6}: Match text starting with §x and 6 hex characters
|: OR
§[0-9a-z]: Match text starting with § and an alphanumeric
|: OR
[^§\s]+: Match 1+ non-whitespace and non-§ characters

Code:

final String regex = &quot;&#167;x(?:&#167;[0-9a-f]){6}|&#167;[0-9a-z]|[^&#167;\\s]+&quot;;
final String string = &quot;&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f&#167;ltest1 &#167;rtest2&quot;;

final Pattern pattern = Pattern.compile(regex);
final Matcher matcher = pattern.matcher(string);

while (matcher.find()) {
    System.out.println( matcher.group(0) );
}

答案2

得分: 1

你可以使用以下正则表达式：

在这里查看它的工作原理

 ?((?:&#167;[^&#167;])(?=[^&#167;])|[^&#167; ]{2,})

它的工作原理如下：

? 可选地匹配空格字符。
((?:§[^§])(?=[^§])|[^§ ]{2,}) 捕获以下之一：
- (?:§[^§])(?=[^§]) 匹配以下内容：
  - (?:§[^§]) 匹配 § 后跟任何字符，但不包括 §。
  - (?=[^§]) 前瞻，确保接下来的字符不是 §（与 (?!§) 相同，但更高效）。
- [^§ ]{2,} 匹配任何字符，除了 § 或空格，两次或更多。

通过替换为 \n$1

结果：

&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f
&#167;l
test1
&#167;r
test2

英文:

You can use the following regex:

See it working here

 ?((?:&#167;[^&#167;])(?=[^&#167;])|[^&#167; ]{2,})

How it works:

? optionally match the space character
((?:§[^§])(?=[^§])|[^§ ]{2,}) capture either of the following:
- (?:§[^§])(?=[^§]) match the following:
  - (?:§[^§]) match § followed by any character except §
  - (?=[^§]) lookahead ensuring what follows is not § (same as (?!§) but more efficient)
- [^§ ]{2,} match any character except § or space two or more times

With the substitution of \n$1

Result:

&#167;x&#167;7&#167;3&#167;7&#167;5&#167;f&#167;f
&#167;l
test1
&#167;r
test2

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用多个模式拆分字符串，其中第二个模式匹配第一个模式的较小部分。

问题

答案1

答案2

如何在Java中打印数学乘法乘表（特定数字的整个乘法表）。

Spring Tool Suite（STS）4.8.0 RELEASE – 从4.7.2升级后Java消失了

Failed to execute goal org.springframework.boot:spring-boot-maven-plugin:2.2.5.RELEASE:run (default-cli)

Java可以删除类对象吗？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论