Quality Metrics for Binary Classification Algorithms

For two classes C ₁ and C ₂, given a vector X= (x ₁, …,x _n) of class labels computed at the prediction stage of the classification algorithm and a vector Y= (y ₁, …,y _n) of expected class labels, the problem is to evaluate the classifier by computing the confusion matrix and connected quality metrics: precision, recall, and so on.

Further definitions use the following notations:

`tp` (true positive)	the number of correctly recognized observations for class `C` ₁
`tn` (true negative)	the number of correctly recognized observations that do not belong to the class `C` ₁
`fp` (false positive)	the number of observations that were incorrectly assigned to the class `C` ₁
`fn` (false negative)	the number of observations that were not recognized as belonging to the class `C` ₁

The library uses the following quality metrics for binary classifiers:

Quality Metric	Definition
Accuracy
Precision
Recall
F-score
Specificity
Area under curve (AUC)

For more details of these metrics, including the evaluation focus, refer to [Sokolova09].

The confusion matrix is defined as follows:

	Classified as Class `C` ₁	Classified as Class `C` ₂
Actual Class `C` ₁	`tp`	`fn`
Actual Class `C` ₂	`fp`	`tn`