Intel® C++ Compiler 16.0 User and Reference Guide
Converts float64 vector to float32 vector. Corresponding instruction is VCVTPD2PS. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern __m512 __cdecl _mm512_cvt_roundpd_pslo(__m512d v2, int rc); |
With Mask extern __m512 __cdecl _mm512_mask_cvt_roundpd_pslo(__m512 v1_old, __mmask8 k1, __m512d v2, int rc); |
Performs an element-by-element conversion of the rounded float64 vector v2 to a float32 vector. The resulting elements are written into the lower half of the result vector. The remaining locations (upper half of the result vector) are set to '0'.
The masked variant has one additional argument: k1. Only those elements in the source vectors with the corresponding bit set in vector mask k1 are used for computing.
Returns the result of the conversion operation.