Intel® C++ Compiler 16.0 User and Reference Guide
Gather prefetch float32 vector with int32 indices. Corresponding instructions are VGATHERPF0DPS and VGATHERPF1DPS. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern void __cdecl _mm512_prefetch_i32extgather_ps(__m512i index, void const* mv, _MM_UPCONV_PS_ENUM conv, int scale, int pf_hint); extern void __cdecl _mm512_prefetch_i32gather_ps(__m512i index, void const* mv, int scale, int pf_hint); |
With Mask extern void __cdecl _mm512_mask_prefetch_i32extgather_ps(__m512i index, __mmask16 k1, void const* mv, _MM_UPCONV_PS_ENUM conv, int scale, int pf_hint); extern void __cdecl _mm512_mask_prefetch_i32gather_ps(__m512i index, __mmask16 k1, void const* mv, int scale, int pf_hint); |
A set of 16 memory locations, to which base address mv and int32 index vector index with scale scale point, are prefetched from memory to L1 or L2 level of cache, depending on the pf_hint parameter.
The non-masked variant of the intrinsic is equivalent to the masked variant with full mask (k1=0xffff).
You can use the simplified versions of this intrinsic, without ext in the name, if no up-conversion is required.
None.