A Strategy on Selecting Performance Metrics for Classifier Evaluation

被引:98
作者
Liu, Yangguang [1 ]
Zhou, Yangming [1 ]
Wen, Shiting [1 ]
Tang, Chaogang [2 ]
机构
[1] Zhejiang Univ, Ningbo Inst Technol, Ningbo, Zhejiang, Peoples R China
[2] China Univ Min & Technol, Xuzhou, Peoples R China
关键词
Classifiers; Classifiers' Performances; Correlation; Machine Learning Community; Performance Metrics;
D O I
10.4018/IJMCMC.2014100102
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The evaluation of classifiers' performances plays a critical role in construction and selection of classification model. Although many performance metrics have been proposed in machine learning community, no general guidelines are available among practitioners regarding which metric to be selected for evaluating a classifier's performance. In this paper, we attempt to provide practitioners with a strategy on selecting performance metrics for classifier evaluation. Firstly, the authors investigate seven widely used performance metrics, namely classification accuracy, F-measure, kappa statistic, root mean square error, mean absolute error, the area under the receiver operating curve, and the area under the precision-recall curve. Secondly, the authors resort to using Pearson linear correlation and Spearman rank correlation to analyses the potential relationship among these seven metrics. Experimental results show that these commonly used metrics can be divided into three groups, and all metrics within a given group are highly correlated but less correlated with metrics from different groups.
引用
收藏
页码:20 / 35
页数:16
相关论文
共 27 条