Intel® C++ Compiler 16.0 User and Reference Guide
Multiplies signed packed 16/32-bit integer data elements of two vectors and stores low bits. The corresponding Intel® AVX2 instruction is VPMULLW or VPMULLD.
extern __m256i _mm256_mullo_epi16(__m256i s1, __m256i s2); |
extern __m256i _mm256_mullo_epi32(__m256i s1, __m256i s2); |
s1 |
integer source vector used for the operation |
s2 |
integer source vector used for the operation |
Performs a SIMD signed multiply of the packed signed 16- or 32-bit integers in source vectors s1 and s2 and stores the low 16- or 32-bits of each intermediate 32- or 64-bit result in the destination vector.
Result of the multiplication operation.