Intel® C++ Compiler 16.0 User and Reference Guide
Gathers float32 vector with int32 indices. Corresponding instruction is VGATHERDPS. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern __m512 __cdecl _mm512_i32extgather_ps(__m512i index, void const* mv, _MM_UPCONV_PS_ENUM conv, int scale, int hint); extern __m512 __cdecl _mm512_i32gather_ps(__m512i index, void const* mv, int scale); |
With Mask extern __m512 __cdecl _mm512_mask_i32extgather_ps(__m512 v1_old, __mmask16 k1, __m512i index, void const* mv, _MM_UPCONV_PS_ENUM conv, int scale, int hint); extern __m512 __cdecl _mm512_mask_i32gather_ps(__m512 v1_old, __mmask16 k1, __m512i index, void const* mv, int scale); |
Up-converts a set of 16 memory locations pointed by base address mv and int32 index vector index with scale scale, and gathers them into a float32 vector.
The resulting vector for the masked variant is populated by elements for which the corresponding bit in the writemask vector k1 is set. The remaining elements of the resulting vector for the masked variant is populated by corresponding elements from v1_old.
The non-masked variant of the intrinsic is equivalent to the masked variant with full mask (k1=0xffff).
These intrinsics do not have broadcast support.
Returns the result of the up-convert load operation.