2017年6月6日 12:19:52go评论125阅读模式

英文:

What does "Scan advances the Scanner to the next token" mean in Go's bufio.Scanner?

问题

根据Scanner.scan documents的说明，Scan()函数将Scanner前进到下一个标记（token），但这是什么意思呢？我发现Scanner.Text和Scanner.Bytes可能会有所不同，这让我感到困惑。

这段代码并不总是引起错误，但随着文件变得越来越大，就会出现问题：

func TestScanner(t *testing.T) {
    path := "/tmp/test.txt"
    f, err := os.Open(path)
    if err != nil {
        panic(fmt.Sprint("failed to open ", path))
    }
    defer f.Close()
    scanner := bufio.NewScanner(f)
    bs := make([][]byte, 0)
    for scanner.Scan() {
        bs = append(bs, scanner.Bytes())
    }
    f, err = os.Open(path)
    if err != nil {
        panic(fmt.Sprint("failed to open ", path))
    }
    defer f.Close()
    scanner = bufio.NewScanner(f)
    ss := make([]string, 0)
    for scanner.Scan() {
        ss = append(ss, scanner.Text())
    }
    for i, b := range bs {
        if string(b) != ss[i] {
            t.Errorf("expect %s, got %s", ss[i], string(b))
        }
    }
}

英文:

According to Scanner.scan documents, Scan() advances the Scanner to the next token, but what does that mean? I find that Scanner.Text and Scanner.Bytes can be different, which is puzzling.

This code doesn't always cause an error, but as the file becomes larger it does:

func TestScanner(t *testing.T) {
	path := &quot;/tmp/test.txt&quot;
	f, err := os.Open(path)
	if err != nil {
		panic(fmt.Sprint(&quot;failed to open &quot;, path))
	}
	defer f.Close()
	scanner := bufio.NewScanner(f)
	bs := make([][]byte, 0)
	for scanner.Scan() {
		bs = append(bs, scanner.Bytes())
	}
	f, err = os.Open(path)
	if err != nil {
		panic(fmt.Sprint(&quot;failed to open &quot;, path))
	}
	defer f.Close()
	scanner = bufio.NewScanner(f)
	ss := make([]string, 0)
	for scanner.Scan() {
		ss = append(ss, scanner.Text())
	}
	for i, b := range bs {
		if string(b) != ss[i] {
			t.Errorf(&quot;expect %s, got %s&quot;, ss[i], string(b))
		}
	}
}

答案1

得分: 4

标记由扫描器的split函数定义。当split函数找到一个标记或出现错误时，Scan()函数返回。

String()和Bytes()方法都返回当前的标记。String()方法返回标记的副本。Bytes()方法不分配内存，并且返回一个切片，该切片可能使用一个在后续调用Scan()时被覆盖的后备数组。

为了避免这个问题，复制从Bytes()返回的切片：

for scanner.Scan() {
    bs = append(bs, append([]byte(nil), scanner.Bytes()...))
}

英文:

The token is defined by the scanner's split function. Scan() returns when the split function finds a token or there's an error.

The String() and Bytes() methods both return the current token. The String() method returns a copy of the token. The Bytes() method does not allocate memory and returns a slice that may use a backing array that's overwritten on a subsequent call to Scan().

Copy the slice returned from Bytes() to avoid this issue:

for scanner.Scan() {
    bs = append(bs, append([]byte(nil), scanner.Bytes()...))
}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

What does "Scan advances the Scanner to the next token" mean in Go's bufio.Scanner?

问题

答案1

如何为 Protobuf 消息中的重复 oneof 字段分配值？

在Golang中，nil接收器在方法中的行为是什么？

Go template/html iteration to generate table from struct

你可以使用Cobra和Viper将一个值绑定为配置数组中的第一项。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。