2017年5月31日 14:27:15go评论215阅读模式

英文:

Golang regular expression for parsing key value pair into a string map

问题

我正在寻找一种方法，使用正则表达式将以下字符串解析为map[string]string：

time="2017-05-30T19:02:08-05:00" level=info msg="some log message" app=sample size=10

我试图创建一个包含以下键值对的映射：

m["time"] = "2017-05-30T19:02:08-05:00"
m["level"] = "info"

等等

我尝试使用regex.FindAllStringIndex，但是无法找到合适的正则表达式。这是正确的方法吗？

英文:

I'm looking to parse the following string into a map[string]string using a regular expression:

time=&quot;2017-05-30T19:02:08-05:00&quot; level=info msg=&quot;some log message&quot; app=sample size=10

I'm trying to create a map that would have

m[&quot;time&quot;] = &quot;2017-05-30T19:02:08-05:00&quot;
m[&quot;level&quot;] = &quot;info&quot;

etc

I have tried using regex.FindAllStringIndex but can't quite come up with an appropriate regex? Is this the correct way to go?

答案1

得分: 5

这是一个示例，不使用正则表达式，而是使用strings.FieldsFunc来实现相同的功能。

package main

import (
	"fmt"
	"strings"
	"unicode"
)

const foo = `time="2017-05-30T19:02:08-05:00" level=info msg="some log message" app=sample size=10`

func main() {
	lastQuote := rune(0)
	f := func(c rune) bool {
		switch {
		case c == lastQuote:
			lastQuote = rune(0)
			return false
		case lastQuote != rune(0):
			return false
		case unicode.In(c, unicode.Quotation_Mark):
			lastQuote = c
			return false
		default:
			return unicode.IsSpace(c)

		}
	}

	// splitting string by space but considering quoted section
	items := strings.FieldsFunc(foo, f)

	// create and fill the map
	m := make(map[string]string)
	for _, item := range items {
		x := strings.Split(item, "=")
		m[x[0]] = x[1]
	}

	// print the map
	for k, v := range m {
		fmt.Printf("%s: %s\n", k, v)
	}
}

英文:

This is not using regex but is just an example of how to achieve the same by using strings.FieldsFunc.

https://play.golang.org/p/rr6U8xTJZT

package main

import (
	&quot;fmt&quot;
	&quot;strings&quot;
	&quot;unicode&quot;
)

const foo = `time=&quot;2017-05-30T19:02:08-05:00&quot; level=info msg=&quot;some log message&quot; app=sample size=10`

func main() {
	lastQuote := rune(0)
	f := func(c rune) bool {
		switch {
		case c == lastQuote:
			lastQuote = rune(0)
			return false
		case lastQuote != rune(0):
			return false
		case unicode.In(c, unicode.Quotation_Mark):
			lastQuote = c
			return false
		default:
			return unicode.IsSpace(c)

		}
	}

	// splitting string by space but considering quoted section
	items := strings.FieldsFunc(foo, f)

	// create and fill the map
	m := make(map[string]string)
	for _, item := range items {
		x := strings.Split(item, &quot;=&quot;)
		m[x[0]] = x[1]
	}

	// print the map
	for k, v := range m {
		fmt.Printf(&quot;%s: %s\n&quot;, k, v)
	}
}

答案2

得分: 4

你可以使用github.com/kr/logfmt包来代替自己编写正则表达式。

该包实现了logfmt键值对的解码。

示例logfmt消息：

foo=bar a=14 baz="hello kitty" cool%story=bro f %^asdf

示例JSON结果：

{
    "foo": "bar",
    "a": 14,
    "baz": "hello kitty",
    "cool%story": "bro",
    "f": true,
    "%^asdf": true
}

英文:

Instead of writing regex of your own, you could simply use the github.com/kr/logfmt package.

> Package implements the decoding of logfmt key-value pairs.
>
> Example logfmt message:
>
> foo=bar a=14 baz="hello kitty" cool%story=bro f %^asdf
>
> Example result in JSON:
>
> {
> "foo": "bar",
> "a": 14,
> "baz": "hello kitty",
> "cool%story": "bro",
> "f": true,
> "%^asdf": true
> }

答案3

得分: 2

使用命名捕获组和FindStringSubmatch、SubexpNames函数来处理正则表达式。例如：

s := `time=&quot;2017-05-30T19:02:08-05:00&quot; level=info msg=&quot;some log message&quot; app=sample size=10`
re := regexp.MustCompile(`time=&quot;(?P<time>.*?)&quot;\slevel=(?P<level>.*?)\s`)
values := re.FindStringSubmatch(s)
keys := re.SubexpNames()

// 创建映射
d := make(map[string]string)
for i := 1; i < len(keys); i++ {
    d[keys[i]] = values[i]
}
fmt.Println(d)
// 输出: map[time:2017-05-30T19:02:08-05:00 level:info]

values是一个包含所有子匹配的列表。第一个子匹配是整个与正则表达式匹配的表达式，其后是每个捕获组的子匹配。

如果你需要经常使用这个功能（例如类似Python的match.groupdict），你可以将代码封装成一个函数：

package main

import (
    "fmt"
    "regexp"
)

func groupmap(s string, r *regexp.Regexp) map[string]string {
    values := r.FindStringSubmatch(s)
    keys := r.SubexpNames()

    // 创建映射
    d := make(map[string]string)
    for i := 1; i < len(keys); i++ {
        d[keys[i]] = values[i]
    }

    return d
}

func main() {
    s := `time=&quot;2017-05-30T19:02:08-05:00&quot; level=info msg=&quot;some log message&quot; app=sample size=10`
    re := regexp.MustCompile(`time=&quot;(?P<time>.*?)&quot;\slevel=(?P<level>.*?)\s`)

    fmt.Println(groupmap(s, re))
    // 输出: map[time:2017-05-30T19:02:08-05:00 level:info]
}

希望对你有帮助！

英文:

Use named capturing groups in your regular expression and the FindStringSubmatch and SubexpNames functions. E.g.:

s := `time=&quot;2017-05-30T19:02:08-05:00&quot; level=info msg=&quot;some log message&quot; app=sample size=10`
re := regexp.MustCompile(`time=&quot;(?P&lt;time&gt;.*?)&quot;\slevel=(?P&lt;level&gt;.*?)\s`)
values := re.FindStringSubmatch(s)
keys := re.SubexpNames()

// create map
d := make(map[string]string)
for i := 1; i &lt; len(keys); i++ {
	d[keys[i]] = values[i]
}
fmt.Println(d)
// OUTPUT: map[time:2017-05-30T19:02:08-05:00 level:info]

values is a list containing all submatches. The first submatch is the whole expression that matches the regexp, followed by a submatch for each capturing group.

You can wrap the code into a function if you need this more frequently (i.e. if you need something like pythons match.groupdict):

package main

import (
    &quot;fmt&quot;
	&quot;regexp&quot;
)

func groupmap(s string, r *regexp.Regexp) map[string]string {
    values := r.FindStringSubmatch(s)
    keys := r.SubexpNames()

	// create map
    d := make(map[string]string)
    for i := 1; i &lt; len(keys); i++ {
    	d[keys[i]] = values[i]
    }

    return d
}

func main() {
    s := `time=&quot;2017-05-30T19:02:08-05:00&quot; level=info msg=&quot;some log message&quot; app=sample size=10`
    re := regexp.MustCompile(`time=&quot;(?P&lt;time&gt;.*?)&quot;\slevel=(?P&lt;level&gt;.*?)\s`)

    fmt.Println(groupmap(s, re))
    // OUTPUT: map[time:2017-05-30T19:02:08-05:00 level:info]
}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Golang regular expression for parsing key value pair into a string map

问题

答案1

答案2

答案3

Go恐慌：行尾有额外的分隔符

当使用fmt.Printf()时，会显示很多空格。

接口类型返回

Google App Engine仅注销应用程序，而不是整个Google帐户。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论