Intel® C++ Compiler 16.0 User and Reference Guide
Adds rounded float64 vectors. The corresponding instruction is ADDPD. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern _m512d __cdecl _mm512_add_round_pd(_m512d v2, _m512d v3, int rc); |
With Mask extern _m512d __cdecl _mm512_mask_add_round_pd(_m512d v1_old, __mmask8 k1, _m512d v2, _m512d v3, int rc); |
Performs an element-by-element addition between float64 vector v2 and float64 vector v3. Intermediate values are rounded according to rc value.
The masked variant has two additional arguments: v1_old and k1. Those elements of v2 and v3 with the corresponding bit clear in vector mask k1 are not used in the computation. Instead, the corresponding element from v1_old is copied to the resulting vector.
Returns the result of the addition operation.