Intel® C++ Compiler 16.0 User and Reference Guide
Take the low packed single-precision floating-point data element from the source operand and broadcast to all elements of the result vector. The corresponding Intel® AVX2 instruction is VBROADCASTSS.
extern __m128 _mm_broadcastss_ps(__m128 val); |
extern __m256 _mm256_broadcastss_ps(__m128 val); |
val |
__m128 vector containing the 32-bit element to be broadcasted |
Takes the low packed single-precision floating-point (float32) data element from the source operand and broadcasts it to all elements of the result vector. The source operand is __m128; only the low 32 bits of this operand are broadcasted.
Return result of the broadcast operation.