Developer Guide for Intel® Data Analytics Acceleration Library 2016 Update 4

Distributed Processing

The distributed processing mode assumes that the data set is split in nblocks blocks across computation nodes.

To initialize the implicit ALS algorithm in the distributed processing mode, use the one-step process illustrated by the following diagram for nblocks=3:


Implicit Alternating Least Squares, Initialization, Distributed Processing Workflow

Input

In the distributed processing mode, initialization of item factors for the implicit ALS algorithm accepts the input described below. Pass the Input ID as a parameter to the methods that provide input for your algorithm. For more details, see Algorithms.

Input ID

Input

data

Pointer to the m x n numeric table with the mining data. The input can be an object of any class derived from NumericTable except PackedTriangularMatrix and PackedSymmetricMatrix.

Parameters

In the distributed processing mode, initialization of item factors for the implicit ALS algorithm has the following parameters:

Parameter

Default Value

Description

algorithmFPType

double

The floating-point type that the algorithm uses for intermediate computations. Can be float or double.

method

fastCSR

Performance-oriented computation method for CSR numeric tables, the only method supported by the algorithm.

nFactors

10

The total number of factors.

seed

777777

The seed for the random number generation in the initialization step.

fullNUsers

0

The total number of users.

Output

In the distributed processing mode, initialization of item factors for the implicit ALS algorithm calculates the result described below. Pass the Result ID as a parameter to the methods that access the results of your algorithm. For more details, see Algorithms.

Result ID

Result

model

The model with initialized item factors. The result can only be an object of the Model class.