Intel® C++ Compiler 16.0 User and Reference Guide
Subtracts rounded float64 vectors. The corresponding instruction is VSUBPD. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern _m512d __cdecl _mm512_sub_round_pd(_m512d v2, _m512d v3, int rc); |
With Mask extern _m512d __cdecl _mm512_mask_sub_round_pd(_m512d v1_old, __mmask8 k1, _m512d v2, _m512d v3, int rc); |
Performs an element-by-element subtraction of rounded float64 vector v3 from rounded float64 vector v2. Intermediate values are rounded according to rc value.
The masked variant has two additional arguments: v1_old and k1. Those elements of v2 and v3 with the corresponding bit clear in vector mask k1 are not used in the computation. Instead, the corresponding element from v1_old is copied to the resulting vector.
Returns the result of the subtraction operation.