Feature selection via label enhancement and neighborhood rough set for multi-label data with unbalanced distribution

被引:0
作者
Qian, Wenbin [1 ,2 ]
Ruan, Wenyong [1 ]
Lu, Xiwen [1 ]
Yang, Wenji [2 ]
Huang, Jintao [3 ]
机构
[1] Jiangxi Agr Univ, Sch Comp & Informat Engn, Nanchang 330045, Peoples R China
[2] Jiangxi Agr Univ, Sch Software, Nanchang 330045, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Neighborhood rough set; Multi-label learning; Label enhancement; Label distribution; MUTUAL INFORMATION; CLASSIFICATION; SIMILARITY;
D O I
10.1016/j.asoc.2025.113028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label learning has gained significant attention in classification tasks, but challenges remain in handling high-dimensional data. Although feature selection techniques can alleviate these issues, neglecting the unbalanced data distribution problem severely undermines the models' accuracy. Furthermore, existing methods fail to account for the importance and correlation of labels. In this paper, we present a novel multi-label feature selection algorithm that addresses these issues through three innovations: (1) using k-nearest neighbors to capture local similarities in unbalanced data, (2) enhancing labels by converting them into distributions to enrich semantic information, and (3) introducing a new evaluation function to assess label correlations. A multi-criteria strategy is established to maximize feature-label relevance, minimize redundancy, and strengthen label correlations. Experimental results on fifteen multi-label datasets demonstrate the algorithm's superiority over five state-of-the-art methods.
引用
收藏
页数:19
相关论文
共 50 条
[1]   Label correlation in multi-label classification using local attribute reductions with fuzzy rough sets [J].
Che, Xiaoya ;
Chen, Degang ;
Mi, Jusheng .
FUZZY SETS AND SYSTEMS, 2022, 426 :121-144
[2]   Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition [J].
Chen, Shikai ;
Wang, Jianfeng ;
Chen, Yuedong ;
Shi, Zhongchao ;
Geng, Xin ;
Rui, Yong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13981-13990
[3]   Novel multi-label feature selection via label symmetric uncertainty correlation learning and feature redundancy evaluation [J].
Dai, Jianhua ;
Chen, Jiaolong ;
Liu, Ye ;
Hu, Hu .
KNOWLEDGE-BASED SYSTEMS, 2020, 207
[4]   Feature selection for label distribution learning using dual-similarity based neighborhood fuzzy entropy [J].
Deng, Zhixuan ;
Li, Tianrui ;
Deng, Dayong ;
Liu, Keyu ;
Zhang, Pengfei ;
Zhang, Shiming ;
Luo, Zhipeng .
INFORMATION SCIENCES, 2022, 615 :385-404
[5]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[6]   Multi-label feature selection based on label correlations and feature redundancy [J].
Fan, Yuling ;
Chen, Baihua ;
Huang, Weiqin ;
Liu, Jinghua ;
Weng, Wei ;
Lan, Weiyao .
KNOWLEDGE-BASED SYSTEMS, 2022, 241
[7]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[8]   Multilabel Feature Selection With Constrained Latent Structure Shared Term [J].
Gao, Wanfu ;
Li, Yonghao ;
Hu, Liang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) :1253-1262
[9]   Feature redundancy term variation for mutual information-based feature selection [J].
Gao, Wanfu ;
Hu, Liang ;
Zhang, Ping .
APPLIED INTELLIGENCE, 2020, 50 (04) :1272-1288
[10]   An efficient Pareto-based feature selection algorithm for multi-label classification [J].
Hashemi, Amin ;
Dowlatshahi, Mohammad Bagher ;
Nezamabadi-pour, Hossein .
INFORMATION SCIENCES, 2021, 581 :428-447