Intel® C++ Compiler 16.0 User and Reference Guide
Reverse subtracts rounded float32 vectors. The corresponding instruction is VSUBRPS. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).
Without Mask extern _m512 __cdecl _mm512_subr_round_ps(_m512 v2, _m512 v3, int rc); |
With Mask extern _m512 __cdecl _mm512_mask_subr_round_ps(_m512 v1_old, __mmask16 k1, _m512 v2, _m512 v3, int rc); |
Performs an element-by-element subtraction of rounded float32 vector v2 from rounded float32 vector v3. Intermediate values are rounded according to rc value.
The masked variant has two additional arguments: v1_old and k1. Those elements of v2 and v3 with the corresponding bit clear in vector mask k1 are not used in the computation. Instead, the corresponding element from v1_old is copied to the resulting vector.
Returns the result of the subtraction operation.