Quality Metrics for Multi-class Classification Algorithms

For l classes C ₁, ..., C _l, given a vector X= (x ₁, …,x _n) of class labels computed at the prediction stage of the classification algorithm and a vector Y= (y ₁, …,y _n) of expected class labels, the problem is to evaluate the classifier by computing the confusion matrix and connected quality metrics: precision, error rate, and so on.

Further definitions use the following notations:

`tp` _i (true positive)	the number of correctly recognized observations for class `C` _i
`tn` _i (true negative)	the number of correctly recognized observations that do not belong to the class `C` _i
`fp` _i (false positive)	the number of observations that were incorrectly assigned to the class `C` _{_i}
`fn` _i (false negative)	the number of observations that were not recognized as belonging to the class `C` _i

The library uses the following quality metrics for multi-class classifiers:

Quality Metric	Definition
Average accuracy
Error rate
Micro precision (`Precision` _μ)
Micro recall (`Recall` _μ)
Micro F-score (`F-score` _μ)
Macro precision (`Precision` _M)
Macro recall (Recall _M)
Macro F-score (F-score _M)

For more details of these metrics, including the evaluation focus, refer to [Sokolova09].

The following is the confusion matrix:

	Classified as Class `C` ₁	...	Classified as Class `C` _i	...	Classified as Class `C` _l
Actual Class `C` ₁	`c` ₁₁	...	`c` _1i	...	`c` _1l
...	...	...	...	...	...
Actual Class `C` _i	`c` _i1	...	`c` _ii	...	`c` _il
...	...	...	...	...	...
Actual Class `C` _l	`c` _l1	...	`c` _li	...	`c` _ll

The positives and negatives are defined through elements of the confusion matrix as follows: