Developer Guide for Intel® Data Analytics Acceleration Library 2016 Update 4
Linear regression is a method for modeling the relationship between a dependent variable (which may be a vector) and one or more explanatory variables by fitting linear equations to observed data. The case of one explanatory variable is called Simple Linear Regression. For several explanatory variables the method is called Multiple Linear Regression.
Let (x1,…,xp) be a vector of input variables and y=(y1,…,yk) be the response. For each j=1,…,k, the linear regression model has the format [Hastie2009]:
y j =β0j+ β1jx1+...+ βpjxp
Here xi, i=1,...,p, are referred to as independent variables, and yj are referred to as dependent variables or responses.
The linear regression is multiple if the number of input variables p > 1.
Training Stage
Let (x11,...,x1p,y1 ),…,(xn1,...,xnp,yn ) be a set of training data, n >> p. The matrix X of size n x p contains observations xij,i=1,...,n,j=1,….,p, of independent variables.
To estimate the coefficients (β0j,...,βpj) one these methods can be used:
Normal Equation system
QR matrix decomposition
Prediction Stage
Linear regression based prediction is done for input vector (x1,…,xp) using the equation yj=β0j+ β1jx1+...+βpjxp for each j=1,…,k.