Graph-Based Class-Imbalance Learning With Label Enhancement

被引:47
作者
Du, Guodong [1 ,2 ]
Zhang, Jia [3 ]
Jiang, Min [4 ,5 ]
Long, Jinyi [3 ,6 ]
Lin, Yaojin [7 ]
Li, Shaozi [8 ,9 ]
Tan, Kay Chen [10 ]
机构
[1] Xiamen Univ, Inst Artificial Intelligence, Dept Cognit Sci, Xiamen, Peoples R China
[2] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen 361005, Peoples R China
[3] Jinan Univ, Coll Informat Sci & Technol, Guangzhou 510632, Peoples R China
[4] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[5] Xiamen Univ, Fujian Key Lab Machine Intelligence & Robot, Xiamen 361005, Peoples R China
[6] Jinan Univ, Guangdong Key Lab Tradit Chinese Med Informat Tech, Guangzhou 510632, Peoples R China
[7] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
[8] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[9] Minnan Normal Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Peoples R China
[10] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
关键词
Class-imbalance learning; graph theory; label enhancement; multilabel learning; single-label learning; supervised learning; ENSEMBLE; CLASSIFIERS; NETWORKS;
D O I
10.1109/TNNLS.2021.3133262
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance is a common issue in the community of machine learning and data mining. The class-imbalance distribution can make most classical classification algorithms neglect the significance of the minority class and tend toward the majority class. In this article, we propose a label enhancement method to solve the class-imbalance problem in a graph manner, which estimates the numerical label and trains the inductive model simultaneously. It gives a new perspective on the class-imbalance learning based on the numerical label rather than the original logical label. We also present an iterative optimization algorithm and analyze the computation complexity and its convergence. To demonstrate the superiority of the proposed method, several single-label and multilabel datasets are applied in the experiments. The experimental results show that the proposed method achieves a promising performance and outperforms some state-of-the-art single-label and multilabel class-imbalance learning methods.
引用
收藏
页码:6081 / 6095
页数:15
相关论文
共 68 条
[1]   Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks [J].
Barushka, Aliaksandr ;
Hajek, Petr .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09) :4239-4257
[2]   Correlated Multi-label Classification with Incomplete Label Space and Class Imbalance [J].
Braytee, Ali ;
Liu, Wei ;
Anaissi, Ali ;
Kennedy, Paul J. .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (05)
[3]   Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data [J].
Castro, Cristiano L. ;
Braga, Antonio P. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) :888-899
[4]   REMEDIAL-HwR: Tackling multilabel imbalance through label decoupling and data resampling hybridization [J].
Charte, Francisco ;
Rivera, Antonio J. ;
del Jesus, Maria J. ;
Herrera, Francisco .
NEUROCOMPUTING, 2019, 326 :110-122
[5]   MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation [J].
Charte, Francisco ;
Rivera, Antonio J. ;
del Jesus, Maria J. ;
Herrera, Francisco .
KNOWLEDGE-BASED SYSTEMS, 2015, 89 :385-397
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]   SMOTEBoost: Improving prediction of the minority class in boosting [J].
Chawla, NV ;
Lazarevic, A ;
Hall, LO ;
Bowyer, KW .
KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 :107-119
[8]   Multi-label feature selection with application to TCM state identification [J].
Dai, Liang ;
Zhang, Jia ;
Li, Candong ;
Zhou, Changen ;
Li, Shaozi .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (23)
[9]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[10]   Towards graph-based class-imbalance learning for hospital readmission [J].
Du, Guodong ;
Zhang, Jia ;
Ma, Fenglong ;
Zhao, Min ;
Lin, Yaojin ;
Li, Shaozi .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176 (176)