2022年3月8日 02:40:12go评论121阅读模式

英文:

compute the floor of an integer division efficiently

问题

如何在Go语言中高效地计算两个整数的商（向下取整，而不是向零取整）？以下代码似乎给出了正确的结果，但看起来笨拙且低效：

func floorDiv(a, b int) int {
    if b < 0 {
        a, b = -a, -b
    }
    r := a % b
    if r < 0 {
        r += b
    }
    return (a - r) / b
}

（也可以在playground上查看）。有更好的方法吗？

英文:

How can I efficiently compute the quotient of two integers, rounded down (not towards zero) in Go? The following code seems to give the correct result, but looks awkward and inefficient:

func floorDiv(a, b int) int {
	if b &lt; 0 {
		a, b = -a, -b
	}
	r := a % b
	if r &lt; 0 {
		r += b
	}
	return (a - r) / b
}

(also on the playground). Is there a better way?

答案1

得分: 3

将整数转换为float64，进行除法运算，并使用math.Floor函数。

func floorDiv(a, b int) int {
  return int(math.Floor(float64(a)/float64(b)))
}

基准测试显示它们的运行时间大致相同，并且与简单的加法函数相同。

func BenchmarkFloorDiv(b *testing.B) {
  for i := 0; i < b.N; i++ {
    _ = floorDiv(i, 3)
  }
}
func BenchmarkFloorDivGo(b *testing.B) {
  for i := 0; i < b.N; i++ {
    _ = floorDivGo(i, 3)
  }
}
func BenchmarkFloorAdd(b *testing.B) {
  for i := 0; i < b.N; i++ {
    _ = add(i, 3)
  }
}
goos: darwin
goarch: amd64
pkg: github.com/my/repo/test_go
cpu: Intel(R) Core(TM) i7-8559U CPU @ 2.70GHz
BenchmarkFloorDiv-8     	1000000000	         0.2612 ns/op
BenchmarkFloorDivGo-8   	1000000000	         0.2610 ns/op
BenchmarkFloorAdd-8     	1000000000	         0.2565 ns/op
PASS
ok  	github.com/my/repo/test_go	1.000s

这表明它运行得非常快，我们只是在对循环进行基准测试。它不太可能成为瓶颈，我建议选择最简单的选项。

英文:

Convert the integers to float64, divide, and use math.Floor.

func floorDiv(a, b int) int {
  return int(math.Floor(float64(a)/float64(b)))
}

Benchmarking shows they run in about the same time, and the same as a simple add function.

func BenchmarkFloorDiv(b *testing.B) {
  for i := 0; i &lt; b.N; i++ {
    _ = floorDiv(i, 3)
  }
}
func BenchmarkFloorDivGo(b *testing.B) {
  for i := 0; i &lt; b.N; i++ {
    _ = floorDivGo(i, 3)
  }
}
func BenchmarkFloorAdd(b *testing.B) {
  for i := 0; i &lt; b.N; i++ {
    _ = add(i, 3)
  }
}
goos: darwin
goarch: amd64
pkg: github.com/my/repo/test_go
cpu: Intel(R) Core(TM) i7-8559U CPU @ 2.70GHz
BenchmarkFloorDiv-8     	1000000000	         0.2612 ns/op
BenchmarkFloorDivGo-8   	1000000000	         0.2610 ns/op
BenchmarkFloorAdd-8     	1000000000	         0.2565 ns/op
PASS
ok  	github.com/my/repo/test_go	1.000s

This suggests it's so fast we're just benchmarking the loop. It is unlikely to be a bottleneck, I would suggest the simplest option.

答案2

得分: 2

位操作在这里非常有用 - 我将其与将两个整数转换为浮点数并使用math.Floor()进行比较。

func IntFloorDiv(x int, y int) int {
  q := x / y
  r := x % y
  
  if r != 0 && x&math.MinInt != y&math.MinInt {
    q -= 1
  }
  
  return q
}

位操作使我们能够轻松确定二进制补码整数的符号：

一个二进制补码整数可能包含的最小[负]值具有设置了符号位和其余位都清零的值（0x80000000）。
整数值与该最小值进行按位与运算，可以得到0或该整数类型可能包含的最小值：
- 5 & math.MinInt 得到0
- 0 & math.MinInt 得到0
- -5 & Math.MinInt 得到math.MinInt

这样我们就可以做到：

计算 x / y 的商和余数。
如果余数为零，则商为 ⌊ x / y ⌋。
否则（余数非零），
1. 如果符号位不同，则商为负数，我们必须从商中减去1，得到 ⌊ x / y ⌋。
2. 如果符号位相同，则商为正数，商为 ⌊ x / y ⌋。

点击这里查看Go Playground

对于 -10 ≤ x ≤ +10，y = 3 的结果：

X	Y	⌊X÷Y⌋
-10	3	-4
-9	3	-3
-8	3	-3
-7	3	-3
-6	3	-2
-5	3	-2
-4	3	-2
-3	3	-1
-2	3	-1
-1	3	-1
0	3	0
1	3	0
2	3	0
3	3	1
4	3	1
5	3	1
6	3	2
7	3	2
8	3	2
9	3	3
10	3	3

基准测试

在5次不同运行中的基准测试时间显示，将其转换为浮点数并使用math.Floor()比整数除法和位操作慢了近21倍。

[是否真正重要完全取决于用例。]

基准测试代码每次循环迭代调用被基准测试的函数21次（对于-10到+10的范围），因此循环代码的成本不会掩盖被基准测试的函数。

❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2730 ns/op
Benchmark_floorDiv_Math-8       189576496                5.969 ns/op
PASS
ok      foobar.com/floordiv     2.266s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2718 ns/op
Benchmark_floorDiv_Math-8       196402200                5.954 ns/op
PASS
ok      foobar.com/floordiv     2.243s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2756 ns/op
Benchmark_floorDiv_Math-8       200432154                5.976 ns/op
PASS
ok      foobar.com/floordiv     2.271s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2814 ns/op
Benchmark_floorDiv_Math-8       195009298                6.083 ns/op
PASS
ok      foobar.com/floordiv     2.314s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2751 ns/op
Benchmark_floorDiv_Math-8       201196482                6.033 ns/op
PASS
ok      foobar.com/floordiv     2.290s

运行此基准测试的代码：

package main
import (
  "math"
  "testing"
)
func floorDiv_Int(x int, y int) int {
  q := x / y
  r := x % y
  
  if r != 0 && x&math.MinInt != y&math.MinInt {
    q -= 1
    }
  
  return q
}
func floorDiv_Math(x int, y int) int {
  return int(math.Floor(
    float64(x) / float64(y),
  ))
}
func Benchmark_floorDiv_Int(b *testing.B) {
  for i := 0; i < b.N; i++ {
    floorDiv_Int(-10, 3)
    floorDiv_Int(-9, 3)
    floorDiv_Int(-8, 3)
    floorDiv_Int(-7, 3)
    floorDiv_Int(-6, 3)
    floorDiv_Int(-5, 3)
    floorDiv_Int(-4, 3)
    floorDiv_Int(-3, 3)
    floorDiv_Int(-2, 3)
    floorDiv_Int(-1, 3)
    floorDiv_Int(0, 3)
    floorDiv_Int(1, 3)
    floorDiv_Int(2, 3)
    floorDiv_Int(3, 3)
    floorDiv_Int(4, 3)
    floorDiv_Int(5, 3)
    floorDiv_Int(6, 3)
    floorDiv_Int(7, 3)
    floorDiv_Int(8, 3)
    floorDiv_Int(9, 3)
    floorDiv_Int(10, 3)
  }
}
func Benchmark_floorDiv_Math(b *testing.B) {
  for i := 0; i < b.N; i++ {
    floorDiv_Math(-10, 3)
    floorDiv_Math(-9, 3)
    floorDiv_Math(-8, 3)
    floorDiv_Math(-7, 3)
    floorDiv_Math(-6, 3)
    floorDiv_Math(-5, 3)
    floorDiv_Math(-4, 3)
    floorDiv_Math(-3, 3)
    floorDiv_Math(-2, 3)
    floorDiv_Math(-1, 3)
    floorDiv_Math(0, 3)
    floorDiv_Math(1, 3)
    floorDiv_Math(2, 3)
    floorDiv_Math(3, 3)
    floorDiv_Math(4, 3)
    floorDiv_Math(5, 3)
    floorDiv_Math(6, 3)
    floorDiv_Math(7, 3)
    floorDiv_Math(8, 3)
    floorDiv_Math(9, 3)
    floorDiv_Math(10, 3)
  }
}

英文:

Bit twiddling is your friend here — I'll put this up against converting two integers to doubles and using math.Floor().

func IntFloorDiv(x int, y int) int {
q := x / y
r := x % y
if r != 0 &amp;&amp; x&amp;math.MinInt != y&amp;math.MinInt {
q -= 1
}
return q
}

Bit-twiddling allows us to easily identify the sign of a two's-complement integer:

The smallest [negative] value that a two's-complement integer may contain has the sign bit set and the remaining bits all clear (0x80000000).
A bitwise AND of an integer value against that smallest value gives us either 0 or the smallest value that that integer type may contain:
- 5 & math.MinInt yields 0
- 0 & math.MinInt yields 0
- -5 & Math.MinInt yields math.MinInt

That lets us do this:

Compute the quotient and remainder for x / y.
If the remainder is zero, the quotient is ⌊ x / y ⌋.
Otherwise (the remainder is non-zero),
1. If the sign bits differ, the quotient is negative and
  we must subtract 1 from the quotient to yield ⌊ x / y ⌋.
2. if the sign bits are identical, the quotient is positive and
  the quotient is ⌊ x / y ⌋.

Click here for the Go Playground

Results for x such that -10 ≤ x ≤ +10, and y = 3:

X	Y	⌊X÷Y⌋
-10	3	-4
-9	3	-3
-8	3	-3
-7	3	-3
-6	3	-2
-5	3	-2
-4	3	-2
-3	3	-1
-2	3	-1
-1	3	-1
0	3	0
1	3	0
2	3	0
3	3	1
4	3	1
5	3	1
6	3	2
7	3	2
8	3	2
9	3	3
10	3	3

Benchmarks

Benchmark timings across 5 different runs show that converting to float and using math.Floor() to be nearly 21x slower than integer division and bit twiddling.

[Whether or not that actually matters is entirely dependent on the use case.]

The benchmark code calls the function being benchmarked 21x per loop iteration (for -10 to +10 inclusive) so the cost of the loop code doesn't mask the function being benchmarked.

❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2730 ns/op
Benchmark_floorDiv_Math-8       189576496                5.969 ns/op
PASS
ok      foobar.com/floordiv     2.266s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2718 ns/op
Benchmark_floorDiv_Math-8       196402200                5.954 ns/op
PASS
ok      foobar.com/floordiv     2.243s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2756 ns/op
Benchmark_floorDiv_Math-8       200432154                5.976 ns/op
PASS
ok      foobar.com/floordiv     2.271s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2814 ns/op
Benchmark_floorDiv_Math-8       195009298                6.083 ns/op
PASS
ok      foobar.com/floordiv     2.314s
❯ go test -bench=.
goos: darwin
goarch: amd64
pkg: foobar.com/floordiv
cpu: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
Benchmark_floorDiv_Int-8        1000000000               0.2751 ns/op
Benchmark_floorDiv_Math-8       201196482                6.033 ns/op
PASS
ok      foobar.com/floordiv     2.290s

running this benchmark:

package main
import (
&quot;math&quot;
&quot;testing&quot;
)
func floorDiv_Int(x int, y int) int {
q := x / y
r := x % y
if r != 0 &amp;&amp; x&amp;math.MinInt != y&amp;math.MinInt {
q -= 1
}
return q
}
func floorDiv_Math(x int, y int) int {
return int(math.Floor(
float64(x) / float64(y),
))
}
func Benchmark_floorDiv_Int(b *testing.B) {
for i := 0; i &lt; b.N; i++ {
floorDiv_Int(-10, 3)
floorDiv_Int(-9, 3)
floorDiv_Int(-8, 3)
floorDiv_Int(-7, 3)
floorDiv_Int(-6, 3)
floorDiv_Int(-5, 3)
floorDiv_Int(-4, 3)
floorDiv_Int(-3, 3)
floorDiv_Int(-2, 3)
floorDiv_Int(-1, 3)
floorDiv_Int(0, 3)
floorDiv_Int(1, 3)
floorDiv_Int(2, 3)
floorDiv_Int(3, 3)
floorDiv_Int(4, 3)
floorDiv_Int(5, 3)
floorDiv_Int(6, 3)
floorDiv_Int(7, 3)
floorDiv_Int(8, 3)
floorDiv_Int(9, 3)
floorDiv_Int(10, 3)
}
}
func Benchmark_floorDiv_Math(b *testing.B) {
for i := 0; i &lt; b.N; i++ {
floorDiv_Math(-10, 3)
floorDiv_Math(-9, 3)
floorDiv_Math(-8, 3)
floorDiv_Math(-7, 3)
floorDiv_Math(-6, 3)
floorDiv_Math(-5, 3)
floorDiv_Math(-4, 3)
floorDiv_Math(-3, 3)
floorDiv_Math(-2, 3)
floorDiv_Math(-1, 3)
floorDiv_Math(0, 3)
floorDiv_Math(1, 3)
floorDiv_Math(2, 3)
floorDiv_Math(3, 3)
floorDiv_Math(4, 3)
floorDiv_Math(5, 3)
floorDiv_Math(6, 3)
floorDiv_Math(7, 3)
floorDiv_Math(8, 3)
floorDiv_Math(9, 3)
floorDiv_Math(10, 3)
}
}

答案3

得分: 2

在这里是你要翻译的内容：

如果在乘法运算（a*b）中，要选择更快的方法：

func floorDivMustafa2(a, b int) int {
	if a%b != 0 && a*b < 0 {
		return a/b - 1
	}
	return a / b
}

我尝试了异或方法，但速度较慢：

//异或方法
func floorDivMustafa(a, b int) int {
	if a%b != 0 && (a < 0 || b < 0) && !(a < 0 && b < 0) {
		return a/b - 1
	}
	return a / b
}

这是我的单元测试方法：

func Test_floorDivSchwern(t *testing.T) {
	var (
		got, want int
	)
	for a := -10; a < 10; a++ {
		for b := -10; b < 10; b++ {
			if b == 0 {
				continue
			}
			got = floorDivMustafa(a, b)
			want = floorDivSchwern(a, b)
			if got != want {
				t.Errorf("divRound(%v/%v) = %v, want %v", a, b, got, want)
			}
		}
	}
}

基准测试方法：

func BenchmarkFloorDivMustafa2(b *testing.B) {
	for i := 0; i < b.N; i++ {
		for a := -100; a < 100; a++ {
			for b := -100; b < 100; b++ {
				if b == 0 {
					continue
				}
				_ = floorDivMustafa2(a, b)
			}
		}
	}
}

以及所有的输出：

>>>go test -bench=^Benchmark  -benchtime=1000x
BenchmarkSchwern-4                  1000            182345 ns/op
BenchmarkFloorDivMustafa-4          1000            441682 ns/op
BenchmarkFloorDivMustafa2-4         1000             31234 ns/op
BenchmarkJochen-4                   1000             60315 ns/op
BenchmarkIntfolurdiv-4              1000          26437611 ns/op
BenchmarkFloorDivICza-4             1000          26406776 ns/op

编辑在看到位操作在我的基准测试中较慢的结果后，我尝试了操作的基准方法，这是我的结果：

BenchmarkTwentyNumberBitwise-4                    328639            113790 ns/op
BenchmarkTwentyNumberMultiplication-4             295512            101345 ns/op

英文:

The benchmark in case multiply numbers (a*b) will be faster:-

func floorDivMustafa2(a, b int) int {
	if a%b != 0 &amp;&amp; a*b &lt; 0 {
		return a/b - 1
	}
	return a / b
}

I tried the Xor method but it is slower:

//xor method
func floorDivMustafa(a, b int) int {
	if a%b != 0 &amp;&amp; (a &lt; 0 || b &lt; 0) &amp;&amp; !(a &lt; 0 &amp;&amp; b &lt; 0) {
		return a/b - 1
	}
	return a / b
}

this is my unit test method

func Test_floorDivSchwern(t *testing.T) {
	var (
		got, want int
	)
	for a := -10; a &lt; 10; a++ {
		for b := -10; b &lt; 10; b++ {
			if b == 0 {
				continue
			}
			got = floorDivMustafa(a, b)
			want = floorDivSchwern(a, b)
			if got != want {
				t.Errorf(&quot;divRound(%v/%v) = %v, want %v&quot;, a, b, got, want)
			}
		}
	}
}

the bench-mark method:

func BenchmarkFloorDivMustafa2(b *testing.B) {
	for i := 0; i &lt; b.N; i++ {
		for a := -100; a &lt; 100; a++ {
			for b := -100; b &lt; 100; b++ {
				if b == 0 {
					continue
				}
				_ = floorDivMustafa2(a, b)
			}
		}
	}
}

and the output for all:

&gt;&gt;&gt;go test -bench=^Benchmark  -benchtime=1000x
BenchmarkSchwern-4                  1000            182345 ns/op
BenchmarkFloorDivMustafa-4          1000            441682 ns/op
BenchmarkFloorDivMustafa2-4         1000             31234 ns/op
BenchmarkJochen-4                   1000             60315 ns/op
BenchmarkIntfolurdiv-4              1000          26437611 ns/op
BenchmarkFloorDivICza-4             1000          26406776 ns/op

Edit after I saw the result of Bit twiddling slower in my benchmark I try the bench method of the op and this is my result:

BenchmarkTwentyNumberBitwise-4                    328639            113790 ns/op
BenchmarkTwentyNumberMultiplication-4             295512            101345 ns/op

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

compute the floor of an integer division efficiently

问题

答案1

答案2

基准测试

Benchmarks

答案3

How to use a file as stdin when exec a command

Go template extend and super?

尝试对 base64 解码时出现了 “illegal base64 data” 的错误。

如何在 Promise 内部返回错误？

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。