2023年1月9日 13:42:53go评论74阅读模式

英文:

Loop vectorising of vector3d products using Eigen

问题

Hi，我正在使用Eigen来在粒子滤波器中对一系列循环操作进行矢量化处理。

实质上，我有两个矩阵：

Eigen::Matrix<N, 3> A;
Eigen::Matrix<3, N> B;

其中N是一个大的数。

我想要一行代码来执行以下操作的等效版本：

Eigen::Matrix<N, 1> D;
for (size_t i=0; i< N; i++)
{
   D.row(i) = A.row(i)*B.col(i);
}

我曾尝试使用D =A.rowwise()*B.colwise()，但这些广播方法之间没有定义operator*()。

英文:

Hi I'm using Eigen to roll or vectorise a number of loop operations in a particle filter.

In essence I have two matricies

Eigen::Matrix&lt;N, 3&gt; A;
Eigen::Matrix&lt;3, N&gt; B;

Where N is a large number.

And I would like a one line which does the equivalent of:

Eigen::Matrix&lt;N, 1&gt; D;
for (size_t i=0; i&lt; N; i++)
{
   D.row(i) = A.row(i)*B.col(i);
}

I had been trying to use D =A.rowwise()*B.colwise() but these broadcasting methods do not define an operator*() between them.

答案1

得分: 1

这是一个在小向量情况下几乎是最优的版本。

Eigen::MatrixX3d A;
Eigen::Matrix3Xd B;
Eigen::VectorXd D = (A.array() * B.array().transpose()).rowwise().sum();

为了更好地适应较大的向量大小，例如如果这些是方阵，微调这个表达式有点挑战，但对于3行/列来说，它工作得很好。

如果你可以选择你的矩阵形状，考虑改成这个：

Eigen::Matrix3Xd A, B;
Eigen::VectorXd D = (A.array() * B.array()).colwise().sum();

或者这个：

Eigen::MatrixX3d A, B;
Eigen::VectorXd D = (A.array() * B.array()).rowwise().sum();

这两种方法都会生成非常好的汇编代码，并充分利用了向量指令。在Eigen-3.4.0、GCC-11.3下经过测试，使用-O3 -DNDEBUG编译选项。第二种在低维度（从MatrixX2d到MatrixX4d）情况下效果更好，第一种在大尺寸情况下效果更好。

英文:

Here is a version that is pretty much optimal for small vectors like these.

Eigen::MatrixX3d A;
Eigen::Matrix3Xd B;
Eigen::VectorXd D = (A.array() * B.array().transpose()).rowwise().sum();

Fine-tuning this expression for larger vector size, e.g. if these were square matrices, is a bit of a challenge but for 3 rows/columns it works fine.

If you can chose the shape of your matrices, consider changing to this:

Eigen::Matrix3Xd A, B;
Eigen::VectorXd D = (A.array() * B.array()).colwise().sum();

or this:

Eigen::MatrixX3d A, B;
Eigen::VectorXd D = (A.array() * B.array()).rowwise().sum();

Both produce assembly that looks very good and makes full use of vector instructions. Tested with Eigen-3.4.0, GCC-11.3 -O3 -DNDEBUG. The second one is better with low dimensions (MatrixX2d up to MatrixX4d). The first will scale better to large sizes.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

使用Eigen进行vector3d乘积的循环矢量化

问题

答案1

提取模板中的基础类型

在C++20中如何编写一个自定义分配器以用于std::map

能够在（非继承的）进程之间共享指针吗？

extern global variable issue only in CentOS/RHEL 7

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论