Minority manifold regularization by stacked auto-encoder for imbalanced learning

被引:4
作者
Farajian, Nima [1 ,3 ]
Adibi, Peyman [2 ]
机构
[1] Univ Kashan, Fac Comp & Elect Engn, Dept Comp Engn, Kashan, Iran
[2] Univ Isfahan, Fac Comp Engn, Dept Artificial Intelligence, Esfahan, Iran
[3] Univ Eyvanekey, Dept Comp Engn, Fac Comp & Elect Engn, Semnan, Iran
关键词
Regularized auto-encoder; Imbalanced data classification; Feature learning; CLASSIFICATION; REPRESENTATION; CLASSIFIERS; NETWORK;
D O I
10.1016/j.eswa.2020.114317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced learning is considered one of the challenging problems in machine learning. This problem arises when a learning algorithm is biased toward the majority class due to the large proportion of the majority class data while detecting the minority class is of greater importance. In the present study, a novel method (MMRAE) is presented for imbalanced learning encompassing feature learning and classification steps. In the feature learning step, meaningful features are extracted from the minority data and their underlying manifold are captured by taking advantage of one-class learning approach through stacking two regularized auto-encoders. The existence of novel and different regularizers in each auto-encoder leads to a new representation with proper data discrimination which improves the between-class and within-class imbalanced problems. Then, in the classification step, the classification between the minority and majority class is performed by constructing a multilayer neural network using features learned throughout pre-training. The proposed method is extensively studied on six artificial and twenty real datasets in order to have a precise evaluation. Based on different criteria such as F-measure, G-mean, and AUC, the results represent considerable performance of the proposed method compared to several other existing methods.
引用
收藏
页数:13
相关论文
共 87 条
[1]  
Alain G, 2014, J MACH LEARN RES, V15, P3563
[2]  
[Anonymous], 2011, NIPS
[3]  
[Anonymous], 2004, ACM SIGKDD EXPLORATI, DOI DOI 10.1145/1007730.1007737
[4]   MWMOTE-Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning [J].
Barua, Sukarna ;
Islam, Md. Monirul ;
Yao, Xin ;
Murase, Kazuyuki .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) :405-425
[5]  
Batista GE., 2004, ACM SIGKDD EXPL NEWS, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]
[6]  
Bengio Y., 2006, P ADV NEUR INF PROC, V19, P153, DOI [DOI 10.7551/MITPRESS/7503.003.0024, DOI 10.5555/2976456.2976476]
[7]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[8]  
Bishop C.M., 2006, Pattern Recognition and Machine Learning
[9]  
Bordes A, 2005, J MACH LEARN RES, V6, P1579
[10]   DBSMOTE: Density-Based Synthetic Minority Over-sampling TEchnique [J].
Bunkhumpornpat, Chumphol ;
Sinapiromsaran, Krung ;
Lursinsap, Chidchanok .
APPLIED INTELLIGENCE, 2012, 36 (03) :664-684