Intel® C++ Compiler 16.0 User and Reference Guide

_mm_i32gather_pd, _mm256_i32gather_pd

Gathers 2/4 packed double-precision floating point values from memory referenced by the given base address, dword indices and scale. The corresponding Intel® AVX2 instruction is VGATHERDPD.

Syntax

extern __m128d _mm_i32gather_pd(double const * base, __m128i vindex, const int scale);

extern __m256d _mm256_i32gather_pd(double const * base, __m128i vindex, const int scale);

Arguments

base

the base address used to reference the loaded FP elements.

vindex

the vector of dword indices used to reference the loaded FP elements.

scale

32-bit scale used to address the loaded FP elements.

Description

The intrinsics load 2/4 packed double-precision floating-point values from memory using dword indices and updates the destination operand.

Below is the pseudo-code for the intrinsics:

_mm_i32gather_pd():

result[63:0] = mem[base+vindex[31:0]*scale];
result[127:64] = mem[base+vindex[63:32]*scale];

_mm256_i32gather_pd():

result[63:0] = mem[base+vindex[31:0]*scale];
result[127:64] = mem[base+vindex[63:32]*scale];
result[191:128] = mem[base+vindex[95:64]*scale];
result[255:192] = mem[base+vindex[127:96]*scale];

Returns

A 128/256-bit vector with unconditionally gathered double-precision FP values.