Confusion-Matrix-Based Kernel Logistic Regression for Imbalanced Data Classification

被引:113
作者
Ohsaki, Miho [1 ]
Wang, Peng [1 ]
Matsuda, Kenji [1 ]
Katagiri, Shigeru [1 ]
Watanabe, Hideyuki [2 ]
Ralescu, Anca [3 ]
机构
[1] Doshisha Univ, Grad Sch Sci & Engn, 1-3 Tataramiyakodani, Kyotanabe, Kyoto 6100321, Japan
[2] Natl Inst Informat & Commun Technol, 3-5 Hikaridai, Seika, Kyoto 6190289, Japan
[3] Univ Cincinnati, Coll Engn & Appl Sci, Dept Elect Engn & Comp Syst, 812 Rhodes Hall, Cincinnati, OH 45221 USA
关键词
Imbalanced data; confusion matrix; kernel logistic regression; minimum classification error and generalized probabilistic descent; OPTIMIZATION; SELECTION; MODEL;
D O I
10.1109/TKDE.2017.2682249
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There have been many attempts to classify imbalanced data, since this classification is critical in a wide variety of applications related to the detection of anomalies, failures, and risks. Many conventional methods, which can be categorized into sampling, cost-sensitive, or ensemble, include heuristic and task dependent processes. In order to achieve a better classification performance by formulation without heuristics and task dependence, we propose confusion-matrix-based kernel logistic regression (CM-KLOGR). Its objective function is the harmonic mean of various evaluation criteria derived from a confusion matrix, such criteria as sensitivity, positive predictive value, and others for negatives. This objective function and its optimization are consistently formulated on the framework of KLOGR, based on minimum classification error and generalized probabilistic descent (MCE/GPD) learning. Due to the merits of the harmonic mean, KLOGR, and MCE/GPD, CM-KLOGR improves the multifaceted performances in a well-balanced way. This paper presents the formulation of CM-KLOGR and its effectiveness through experiments that comparatively evaluated CM-KLOGR using benchmark imbalanced datasets.
引用
收藏
页码:1806 / 1819
页数:14
相关论文
共 56 条
[21]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[22]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328
[23]   Discriminative learning in sequential pattern recognition [J].
He, Xiaodong ;
Deng, Li ;
Chou, Wu .
IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (05) :14-36
[24]  
Jaakkola T., 1999, P WORKSH ART INT STA
[25]   Minimum classification error rate methods for speech recognition [J].
Juang, BH ;
Chou, W ;
Lee, CH .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03) :257-265
[26]   Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method [J].
Katagiri, S ;
Juang, BH ;
Lee, CH .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2345-2373
[27]  
Keerthi S., 2006, NEURAL INFORM PROCES, V18, P673
[28]  
Krawczyk B, 2016, IEEE IJCNN, P2447, DOI 10.1109/IJCNN.2016.7727503
[29]  
Liu XY, 2006, IEEE DATA MINING, P970
[30]   Mislocalization-related disease gene discovery using gene expression based computational protein localization prediction [J].
Liu, Zhonghao ;
Hu, Jianjun .
METHODS, 2016, 93 :119-127