2023年7月24日 16:36:59go评论83阅读模式

英文:

Why std::array<int, 10> is faster than std::vector<int>(10) in this scenario?

问题

I am given N, where N belongs to [3; 1000000] different numbers X, where X belongs to [0; 18446744073709551616).

How many numbers can be formed by rearranging the digits of a different number in the input ?

Example:

25 21 10242 42210 52 24021

Answer:

Explanation:

There are 5 different numbers namely {25, 52} and {10242, 42210, 24021}

25 -> 52 and 52 -> 25

10242 -> 42210 and 42210 -> 10242

10242 -> 24021 and 24021 -> 10242

42210 -> 24021 and 24021 -> 42210

lhs -> rhs means that rhs can be formed from the digits of lhs

I solved the task successfully. Here is my solution:

#include <iostream>
#include <string>
#include <vector>
#include <array>
#include <algorithm>
#include <iterator>

namespace Const {
    std::string::size_type constexpr sz_max{ 20 };
}

int main() {
    // 0
    std::ios_base::sync_with_stdio(false);
    std::cin.tie(nullptr);
    // 1
    int n;
    std::cin >> n;
    int i;
    std::string x;
    x.reserve(Const::sz_max);
    std::vector<std::array<int, 10>> lut(n);
    for (i = 0; i != n; ++i) {
        std::cin >> x;
        for (auto const& ch : x) {
            ++lut[i][ch - '0'];
        }
    }
    // 2
    std::sort(lut.begin(), lut.end());
    // 3
    auto prev{ lut.cbegin() };
    decltype(prev) curr;
    auto cnt{ 1 }, ans{ 0 };
    for (curr = std::next(prev); curr != lut.cend(); ++curr) {
        if (*curr == *prev) {
            ++cnt;
        }
        else {
            if (cnt != 1) {
                ans += cnt;
            }
            prev = curr;
            cnt = 1;
        }
    }
    if (cnt != 1) {
        ans += cnt;
    }
    prev = curr;
    cnt = 1;
    // 4
    std::cout << ans << '\n';
    return 0;
}

The execution time of this version is 550-600 ms.

If I change the definition of lut in // 1 from

std::vector<std::array<int, 10>> lut(n);

std::vector<std::vector<int>> lut(n, std::vector<int>(10));

the execution time increases drastically to 1050-1150 ms.

In both cases heap allocations have to take place but why the first ones are faster than the second ones ?

英文:

The task (from a Bulgarian judge, click on "Език" to change it to English):

I am given N, where N belongs to [3; 1000000] different numbers X, where X belongs to [0; 18446744073709551616).

How many numbers can be formed by rearranging the digits of a different number in the input ?

Example:

25 21 10242 42210 52 24021

Answer:

Explanation:

There are 5 different numbers namely {25, 52} and {10242, 42210, 24021}

25 -> 52 and 52 -> 25

10242 -> 42210 and 42210 -> 10242

10242 -> 24021 and 24021 -> 10242

42210 -> 24021 and 24021 -> 42210

lhs -> rhs means that rhs can be formed from the digits of lhs

I solved the task successfully. Here is my solution:

#include &lt;iostream&gt;
#include &lt;string&gt;
#include &lt;vector&gt;
#include &lt;array&gt;
#include &lt;algorithm&gt;
#include &lt;iterator&gt;

namespace Const {
	std::string::size_type constexpr sz_max{ 20 };
}

int main() {
	// 0
	std::ios_base::sync_with_stdio(false);
	std::cin.tie(nullptr);
	// 1
	int n;
	std::cin &gt;&gt; n;
	int i;
	std::string x;
	x.reserve(Const::sz_max);
	std::vector&lt;std::array&lt;int, 10&gt;&gt; lut(n);
	for (i = 0; i != n; ++i) {
		std::cin &gt;&gt; x;
		for (auto const&amp; ch : x) {
			++lut[i][ch - &#39;0&#39;];
		}
	}
	// 2
	std::sort(lut.begin(), lut.end());
	// 3
	auto prev{ lut.cbegin() };
	decltype(prev) curr;
	auto cnt{ 1 }, ans{ 0 };
	for (curr = std::next(prev); curr != lut.cend(); ++curr) {
		if (*curr == *prev) {
			++cnt;
		}
		else {
			if (cnt != 1) {
				ans += cnt;
			}
			prev = curr;
			cnt = 1;
		}
	}
	if (cnt != 1) {
		ans += cnt;
	}
	prev = curr;
	cnt = 1;
	// 4
	std::cout &lt;&lt; ans &lt;&lt; &#39;\n&#39;;
	return 0;
}

The execution time of this version is 550-600 ms.

If I change the definition of lut in // 1 from

std::vector&lt;std::array&lt;int, 10&gt;&gt; lut(n);

std::vector&lt;std::vector&lt;int&gt;&gt; lut(n, std::vector&lt;int&gt;(10));

the execution time increases drastically to 1050-1150 ms.

In both cases heap allocations have to take place but why the first ones are faster than the second ones ?

答案1

得分: 2

std::array 不像 std::vector 那样为其元素进行单独的堆分配，即

sizeof(std::array<int,16>) == sizeof(int) * 16

所有的元素都直接存储在对象中。因此，vector<array<int,10>> 中的所有数据都存储在相同的连续分配中。

英文:

std::array doesn't make a separate heap allocation for its elements like std::vector does, i.e.,

sizeof(std::array<int,16>) == sizeof(int) * 16

All of the elements are stored directly in the object. Because of this, all the data in a vector<array<int,10>> is stored in the same contiguous allocation.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

为什么在这种情况下，`std::array` 比 `std::vector(10)` 更快？

问题

答案1

auto open_flags = std::ios::binary; seems to produce the wrong type in MSVC. Is it a bug?

在C++中，我们可以在将其作为函数参数传递时创建一个数组吗？

无溢出的无分支方式加两个无符号整数？

最短路径是在 Dijkstra 算法中找到某个顶点的时候。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论