2017年3月11日 03:16:07go评论101阅读模式

英文:

Counting the occurrence of one or more substrings in a string

问题

我知道要计算一个子字符串的出现次数，可以使用"strings.Count(, )"。如果我想计算substring1或substring2的出现次数，有没有比写另一行strings.count()更优雅的方法？

英文:

I know that for counting the occurrence of one substring I can use "strings.Count(<string>, <substring>)". What if I want to count the number of occurrences of substring1 OR substring2? Is there a more elegant way than writing another new line with strings.count()?

答案1

得分: 17

使用正则表达式（regular expression）：

https://golang.org/pkg/regexp/

aORb := regexp.MustCompile("A|B")

matches := aORb.FindAllStringIndex("A B C B A", -1)
fmt.Println(len(matches))

英文:

Use a regular expression:

https://play.golang.org/p/xMsHIYKtkQ

aORb := regexp.MustCompile(&quot;A|B&quot;)

matches := aORb.FindAllStringIndex(&quot;A B C B A&quot;, -1)
fmt.Println(len(matches))

答案2

得分: 2

另一种进行子字符串匹配的方法是使用suffixarray包。下面是一个匹配多个模式的示例：

package main

import (
	"fmt"
	"index/suffixarray"
	"regexp"
)

func main() {
	r := regexp.MustCompile("an")
	index := suffixarray.New([]byte("banana"))
	results := index.FindAllIndex(r, -1)
	fmt.Println(len(results))
}

你也可以使用Lookup函数来匹配单个子字符串。

英文:

Another way to do substring matching is with the suffixarray package. Here is an example of matching multiple patterns:

package main

import (
	&quot;fmt&quot;
	&quot;index/suffixarray&quot;
	&quot;regexp&quot;
)

func main() {
	r := regexp.MustCompile(&quot;an&quot;)
	index := suffixarray.New([]byte(&quot;banana&quot;))
	results := index.FindAllIndex(r, -1)
	fmt.Println(len(results))
}

You can also match a single substring with the Lookup function.

答案3

得分: 0

如果你想在一个大字符串中计算匹配项的数量，而不需要为了获取长度而分配所有索引的空间，然后再将它们丢弃，你可以使用Regexp.FindStringIndex在循环中匹配连续的子字符串：

func countMatches(s string, re *regexp.Regexp) int {
	total := 0
	for start := 0; start < len(s); {
		remaining := s[start:] // 切片操作是廉价的
		loc := re.FindStringIndex(remaining)
		if loc == nil {
			break
		}
		// loc[0] 是匹配的起始索引，
		// loc[1] 是匹配的结束索引（不包含）
		start += loc[1]
		total++
	}
	return total
}

func main() {
	s := "abracadabra"
	fmt.Println(countMatches(s, regexp.MustCompile(`a|b`)))
}

在 Go Playground 上运行的可执行示例

英文:

If you want to count the number of matches in a large string, without allocating space for all the indices just to get the length and then throwing them away, you can use Regexp.FindStringIndex in a loop to match against successive substrings:

func countMatches(s string, re *regexp.Regexp) int {
	total := 0
	for start := 0; start &lt; len(s); {
		remaining := s[start:] // slicing the string is cheap
		loc := re.FindStringIndex(remaining)
		if loc == nil {
			break
		}
		// loc[0] is the start index of the match,
		// loc[1] is the end index (exclusive)
		start += loc[1]
		total++
	}
	return total
}

func main() {
	s := &quot;abracadabra&quot;
	fmt.Println(countMatches(s, regexp.MustCompile(`a|b`)))
}

runnable example at Go Playground

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

计算字符串中一个或多个子字符串的出现次数

问题

答案1

答案2

答案3

在Go 1.18中，strings.Title()已被弃用。现在应该使用什么？以及如何使用？

存储一组构造函数，用于所有符合相同接口的类型。

如何从头开始计算CID？

Golang GAE 将图像 URL 保存到 Blobstore

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论