2022年8月30日 10:18:26go评论109阅读模式

英文:

Is branch prediction purely cpu behavior, or will the compiler give some hints?

问题

在Go标准包src/sync/once.go中，最近的修订更改了代码片段：

if atomic.LoadUint32(&o.done) == 1 {
		return
	}
//otherwise
...

改为：

//if atomic.LoadUint32(&o.done) == 1 {
//		return
//	}
if atomic.LoadUint32(&o.done) == 0 {
		...
}

问题是，根据这个改变，热路径不再在代码中明确表示，这个改变对分支预测有不良影响吗？Go编译器在后续运行中是否提供了一些帮助，还是分支预测完全由CPU处理？

提交页面：https://github.com/golang/go/commit/ca8354843ef9f30207efd0a40bb6c53e7ba86892

英文:

In go standard package src/sync/once.go, a recent revision change the snippets

if atomic.LoadUint32(&amp;o.done) == 1 {
		return
	}
//otherwise
...

to:

//if atomic.LoadUint32(&amp;o.done) == 1 {
//		return
//	}
if atomic.LoadUint32(&amp;o.done) == 0 {
		...
	}

the question is, according to this change, hot path is no longer explicit in the code, does this change has bad impact on branch prediction ? does go compiler make some help in the subsequent run of this function or the whole thing of branch prediction is on cpu?

commit page:https://github.com/golang/go/commit/ca8354843ef9f30207efd0a40bb6c53e7ba86892

答案1

得分: 1

你所提到的特定提交（通过Brits在评论中找到）并不是为了利用分支预测。它使用了关于Go编译器如何对小函数进行内联扩展的知识。

我们可以选择以这种方式编写函数：

func (o *Object) Operate() {
    if (o.alreadyDone) { return }
    ... some code ...
}

或者以这种方式编写：

func (o *Object) Operate() {
    if (!o.alreadyDone) { o.reallyOperate() }
}

其中o.reallyOperate接管了... some code ...部分。

如果... some code ...部分超过几条指令，并且按照原始的once.Do的方式编写，Go编译器通过让调用者调用实际函数来实现该函数。但是，当它像替代方案那样很短时，调用者将函数实现为内联测试、分支，然后可能调用reallyOperate函数。

由于sync.Once实际上每个Once对象只调用一次函数，在其余时间内不调用该函数，这种内联扩展导致在每个Do调用上除了第一个调用之外都不进行调用。这实际上使得调用点的代码变得更大（增加了一两条指令），但由于通常不执行调用，结果通常更快。

英文:

The particular commit you're talking about (found by Brits in a comment) is not an attempt to make use of branch prediction. It's using knowledge about how the Go compiler does inline expansion of small functions.

We're given the option of writing a function in this way:

func (o *Object) Operate() {
    if (o.alreadyDone) { return }
    ... some code ...
}

Or, alternatively, writing it this way:

func (o *Object) Operate() {
    if (!o.alreadyDone) { o.reallyOperate() }
}

where o.reallyOperate takes over the ... some code ... part.

If the some code part is more than a few instructions long and is written the way the original once.Do was, the Go compiler implements the function by having the caller call the actual function. But when it's as short as the replacement, the caller implements the function as an inline test, branch, and then maybe call the reallyOperate function.

Since sync.Once actually calls the function only once per Once object, and the rest of the time, does not call, this inline expansion results in not making the call on every Do call except the first one. This actually makes the code at the call site bigger (by one or two instructions) but since the call is normally not executed, the result is normally faster.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

分支预测是纯粹由CPU执行的行为，还是编译器会提供一些提示？

问题

答案1

‘For’循环的前置和后置空语句

How do I handle opening/closing Db connection in a Go app?

查找一个单词是否是另一个单词的复数形式。

在Go语言中进行多线程请求并且无法获得高RPS。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。