Evolving Neural Networks with Maximum AUC for Imbalanced Data Classification

被引:0
作者
Lu, Xiaofen [1 ]
Tang, Ke [1 ]
Yao, Xin [1 ]
机构
[1] Univ Sci & Technol China, NICAL, Sch Comp Sci & Technol, Hefei 230027, Peoples R China
来源
HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 1 | 2010年 / 6076卷
关键词
ROC; AUC; Class-imbalance Learning; Feed-forward Neural Networks; Evolutionary Algorithms; Differential Evolution; DIFFERENTIAL EVOLUTION; OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-world classification problems usually involve imbalanced data sets. In such cases, a classifier with high classification accuracy does not necessarily imply a good classification performance for all classes. The Area Under the ROC Curve (AUC) has been recognized as a more appropriate performance indicator in such cases. Quite a few methods have been developed to design classifiers with the maximum AUC. In the context of Neural Networks (NNs), however, it is usually an approximation of AUC rather than the exact AUC itself that is maximized, because AUC is non-differentiable and cannot be directly maximized by gradient-based methods. In this paper, we propose to use evolutionary algorithms to train NNs with the maximum AUC. The proposed method employs AUC as the objective function. An evolutionary algorithm, namely the Self-adaptive Differential Evolution with Neighborhood Search (SaNSDE) algorithm, is used to optimize the weights of NNs with respect to AUC. Empirical studies on 19 binary and multi-class imbalanced data sets show that the proposed evolutionary AUC maximization (EAM) method can train NN with larger AUC than existing methods.
引用
收藏
页码:335 / 342
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 2005, P ICML 2005 WORKSH R
[2]  
[Anonymous], 1998, UCI REPOSITORY MACHI
[3]  
[Anonymous], 2003, INT C MACHINE LEARNI
[4]  
Calders T, 2007, LECT NOTES ARTIF INT, V4702, P42
[5]  
Caruana R., 2006, ACM INT C P SER, P161, DOI [10.1145/1143844.1143865, DOI 10.1145/1143844.1143865]
[6]  
Cortes C, 2004, ADV NEUR IN, V16, P313
[7]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[8]   An introduction to ROC analysis [J].
Fawcett, Tom .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :861-874
[9]  
Ferri C., 2002, 19 INT C MACH LEARN, P139
[10]   A simple generalisation of the area under the ROC curve for multiple class classification problems [J].
Hand, DJ ;
Till, RJ .
MACHINE LEARNING, 2001, 45 (02) :171-186