intrinsics | 开发者交流平台

如何在C++中交错三个AVX寄存器的字节。

英文: How to interleave the bytes of 3 avx registers in c++ 问题 #include <immintrin.h> #include &...

2023年6月22日180评论

英文: __m128i initializers and _mm_madd_epi16: What is the result? 问题我尝试了以下代码： __m128i x = {1,2,3,4,5...

2023年6月1日159评论

英文: Transpose 4x4 int32 matrix using NEON 问题如何高效地转置一个以四个int32x4t值表示的矩阵？我不能使用ld4q_s32和st4q_s32。英文: ...

2023年6月1日176评论

英文: Multiply 128-bit vectors of signed 16-bit integers, widening to 32-bit elements 问题我有2个__m128i。每...

2023年5月25日103评论

英文: How to go not out of bounds when loading data from the end of an array into AVX/AVX2 registers? ...

2023年4月19日148评论

英文: Split 16-bit vector (__m128i) into 2 vectors of odd and even positions with Intel intrinsics 问题 ...

2023年4月11日103评论

英文: How to multiply-accumulate unsigned bytes into 32-bit elements without overflow with RISC-V exte...

2023年4月4日158评论

英文: Usage of __AVX512F__ in Visual Studio for compiling code 问题我想使用 __AVX512F__ 来编译代码的特定部分。 #ifndef...

2023年3月7日198评论

英文: Intel store instructions on delibrately overlapping memory regions 问题 I have to store the lower ...

2020年1月3日211评论