Graph-Based Class-Imbalance Learning With Label Enhancement

被引:47
作者
Du, Guodong [1 ,2 ]
Zhang, Jia [3 ]
Jiang, Min [4 ,5 ]
Long, Jinyi [3 ,6 ]
Lin, Yaojin [7 ]
Li, Shaozi [8 ,9 ]
Tan, Kay Chen [10 ]
机构
[1] Xiamen Univ, Inst Artificial Intelligence, Dept Cognit Sci, Xiamen, Peoples R China
[2] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen 361005, Peoples R China
[3] Jinan Univ, Coll Informat Sci & Technol, Guangzhou 510632, Peoples R China
[4] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[5] Xiamen Univ, Fujian Key Lab Machine Intelligence & Robot, Xiamen 361005, Peoples R China
[6] Jinan Univ, Guangdong Key Lab Tradit Chinese Med Informat Tech, Guangzhou 510632, Peoples R China
[7] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
[8] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[9] Minnan Normal Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Peoples R China
[10] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
关键词
Class-imbalance learning; graph theory; label enhancement; multilabel learning; single-label learning; supervised learning; ENSEMBLE; CLASSIFIERS; NETWORKS;
D O I
10.1109/TNNLS.2021.3133262
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance is a common issue in the community of machine learning and data mining. The class-imbalance distribution can make most classical classification algorithms neglect the significance of the minority class and tend toward the majority class. In this article, we propose a label enhancement method to solve the class-imbalance problem in a graph manner, which estimates the numerical label and trains the inductive model simultaneously. It gives a new perspective on the class-imbalance learning based on the numerical label rather than the original logical label. We also present an iterative optimization algorithm and analyze the computation complexity and its convergence. To demonstrate the superiority of the proposed method, several single-label and multilabel datasets are applied in the experiments. The experimental results show that the proposed method achieves a promising performance and outperforms some state-of-the-art single-label and multilabel class-imbalance learning methods.
引用
收藏
页码:6081 / 6095
页数:15
相关论文
共 68 条
[41]   AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification [J].
Tagami, Yukihiro .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :455-464
[42]   Inverse random under sampling for class imbalance problem and its application to multi-label classification [J].
Tahir, Muhammad Atif ;
Kittler, Josef ;
Yan, Fei .
PATTERN RECOGNITION, 2012, 45 (10) :3738-3750
[43]   On parameter settings of Hopfield networks applied to traveling salesman problems [J].
Tan, KC ;
Tang, HJ ;
Ge, SS .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2005, 52 (05) :994-1002
[44]   Global exponential stability of discrete-time neural networks for constrained quadratic optimization [J].
Tan, KC ;
Tang, HJ ;
Yi, Z .
NEUROCOMPUTING, 2004, 56 :399-406
[45]   A columnar competitive model for solving combinatorial optimization problems [J].
Tang, HJ ;
Tan, KC ;
Yi, Z .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (06) :1568-1573
[46]   Self-adaptive cost weights-based support vector machine cost-sensitive ensemble for imbalanced data classification [J].
Tao, Xinmin ;
Li, Qing ;
Guo, Wenjie ;
Ren, Chao ;
Li, Chenxi ;
Liu, Rui ;
Zou, Junrong .
INFORMATION SCIENCES, 2019, 487 :31-56
[47]   A review of methods for imbalance d multi-lab el classification [J].
Tarekegn, Adane Nega ;
Giacobini, Mario ;
Michalak, Krzysztof .
PATTERN RECOGNITION, 2021, 118
[48]  
Vanschoren Joaquin., 2013, SIGKDD EXPLORATIONS, V15, P49, DOI [DOI 10.1145/2641190.2641198, 10.1145/2641190.2641198]
[49]   Active k-labelsets ensemble for multi-label classification [J].
Wang, Ran ;
Kwong, Sam ;
Wang, Xu ;
Jia, Yuheng .
PATTERN RECOGNITION, 2021, 109
[50]   Diversity Analysis on Imbalanced Data Sets by Using Ensemble Models [J].
Wang, Shuo ;
Yao, Xin .
2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, :324-331