Intel® C++ Compiler 16.0 User and Reference Guide
Stores float32 vector. Corresponding instruction is VMOVAPS. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern void __cdecl _mm512_store_ps(void* mt, __m512 v1); |
With Mask extern void __cdecl _mm512_mask_store_ps(void* mt, __mmask16 k1, __m512 v1); |
v1 |
source vector to store elements from |
k1 |
writemask |
mt |
memory location to store vector elements (must be 64-byte aligned) |
Stores 16 single precision floating point elements of float32 vector v1 to the memory address mt . The address mt must be 64-byte-aligned.
In the masked variant, only those elements with the corresponding bit set in vector mask k1 are stored to memory. Elements in the destination memory vector with the corresponding bit clear in k1 retain their previous value.
Returns nothing.