_mm512_mul

Multiply float32 vectors. The corresponding instruction is VMULPS. This intrinsic only applies to Intel® Many Integrated Core Architecture (Intel® MIC Architecture).

Syntax

Without Mask

extern _m512 __cdecl _mm512_mul_ps(_m512 v2, _m512 v3);

With Mask

extern _m512 __cdecl _mm512_mask_mul_ps(_m512 v1_old, __mmask16 k1, _m512 v2, _m512 v3);

Parameters

`v2`	float32 vector multiplied to float32 vector `v3`
`v3`	float32 vector multiplied to float32 vector `v2`
`v1_old`	Source vector that retains old values of the destination vector; the resulting vector gets corresponding elements from `v1_old` for zero mask bits
`k1`	Writemask; only those elements of the source vectors with corresponding bit set to '1' in the `k1` mask are computed and stored in the result; elements in the result vector corresponding to zero bit in `k1` are copied from corresponding elements of vector `v1_old`

Description

Performs an element-by-element multiplication between float32 vector v2 and the float32 vector v3.

The masked variant has one additional argument: k1. Only those elements in source registers with the corresponding bit set in vector mask k1 are used for computing. The remaining elements of the resulting vector are filled with corresponding elements from v1_old.

Returns

Returns the result of the multiplication operation.

_mm512_mul_ps/ _mm512_mask_mul_ps

Syntax

Parameters

Description

Returns