Intel® C++ Compiler 16.0 User and Reference Guide

_mm512_cvtfxpnt_roundpd_epu32lo/ _mm512_mask_cvtfxpnt_roundpd_epu32lo

Converts and rounds float64 vector to uint32 vector. Corresponding instruction is VCVTFXPNTPD2UDQ. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).

Syntax

Without Mask

extern __m512i __cdecl _mm512_cvtfxpnt_roundpd_epu32lo(__m512d v2, int rc);

With Mask

extern __m512i __cdecl _mm512_mask_cvtfxpnt_roundpd_epu32lo(__m512i v1_old, __mmask8 k1, __m512d v2, int rc);

Parameters

v2

float32 vector used for the conversion

v1_old

Source vector that retains old values of the destination vector; the resulting vector gets corresponding elements from v1_old for zero mask bits

k1

Writemask; only those elements of the source vectors with corresponding bit set to '1' in the k1 mask are computed and stored in the result; elements in the result vector corresponding to zero bit in k1 are copied from corresponding elements of vector v1_old

rc

Rounding control values; these can be one of the following:

  • _MM_FROUND_TO_NEAREST_INT - rounds to nearest even
  • _MM_FROUND_TO_NEG_INF - rounds to negative infinity
  • _MM_FROUND_TO_POS_INF - rounds to positive infinity
  • _MM_FROUND_TO_ZERO - rounds to zero
  • _MM_FROUND_CUR_DIRECTION - rounds using default from MXCSR register

Description

Performs an element-by-element conversion of the rounded fixed-point float64 vector v2 to an unsigned int32 vector. The resulting elements are written into the lower half of the result vector. The remaining locations (upper half of the result vector) are set to '0'.

The masked variant has one additional argument: k1. Only those elements in the source vectors with the corresponding bit set in vector mask k1 are used for computing.

Returns

Returns the result of the conversion operation.