Multi-label feature selection for imbalanced data via KNN-based multi-label rough set theory

被引:0
作者
Xu, Weihua [1 ]
Li, Yuzhe [1 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label feature selection; k-nearest neighborhood; Neighborhood rough set; Imbalanced data; MODEL; INFORMATION;
D O I
10.1016/j.ins.2025.122220
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the realm of multi-label feature selection, the intricacy of data structures and semantics has been escalating, rendering traditional single-label feature selection methodologies inadequate for contemporary demands to meet contemporary demands. This manuscript introduces an innovative neighborhood rough set model that integrates delta-neighborhood rough sets with k-nearest neighbor techniques, facilitating a transition from single-label to multi-label learning frameworks. The study delves into the attribute dependency concept within rough set theory and proposes a novel importance function based thereon, which can effectively quantify the significance of features within multi-label decision-making contexts. Building on this theoretical foundation, we have crafted a feature selection algorithm specifically tailored for imbalanced datasets. Extensive experiments conducted on 12 datasets, coupled with comparative analyses with 10 cutting-edge methods, have substantiated the superior performance of our algorithm in managing imbalanced datasets. This research not only offers a fresh theoretical perspective but also has significant practical implications, particularly in scenarios involving imbalanced datasets with multiple labels.
引用
收藏
页数:23
相关论文
共 41 条
[1]   A soft neighborhood rough set model and its applications [J].
An, Shuang ;
Guo, Xingyu ;
Wang, Changzhong ;
Guo, Ge ;
Dai, Jianhua .
INFORMATION SCIENCES, 2023, 624 :185-199
[2]   Multi-label feature selection based on fuzzy rough sets with metric learning and label enhancement [J].
Cai, Mingjie ;
Yan, Mei ;
Wang, Pei ;
Xu, Feng .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 168
[3]  
Cai Y-P., 2015, INT J DATABASE THEOR, V8, P307, DOI [DOI 10.14257/IJDTA.2015.8.4.31, 10.14257/ijdta.2015.8.4.31]
[4]   Feature selection for imbalanced data based on neighborhood rough sets [J].
Chen, Hongmei ;
Li, Tianrui ;
Fan, Xin ;
Luo, Chuan .
INFORMATION SCIENCES, 2019, 483 :1-20
[5]   Integration of deep learning model and feature selection for multi-label classification [J].
Ebrahimi, Hossein ;
Majidzadeh, Kambiz ;
Gharehchopogh, Farhad Soleimanian .
INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01) :2871-2883
[6]   Concept-cognitive learning survey: Mining and fusing knowledge from data [J].
Guo, Doudou ;
Xu, Weihua ;
Ding, Weiping ;
Yao, Yiyu ;
Wang, Xizhao ;
Pedrycz, Witold ;
Qian, Yuhua .
INFORMATION FUSION, 2024, 109
[7]   Incremental updating knowledge in neighborhood multigranulation rough sets under dynamic granular structures [J].
Hu, Chengxiang ;
Zhang, Li ;
Wang, Bangjun ;
Zhang, Zhao ;
Li, Fanzhang .
KNOWLEDGE-BASED SYSTEMS, 2019, 163 :811-829
[8]   Mixed feature selection based on granulation and approximation [J].
Hu, Qinghua ;
Liu, Jinfu ;
Yu, Daren .
KNOWLEDGE-BASED SYSTEMS, 2008, 21 (04) :294-304
[9]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594
[10]   Multi-Label Feature Selection via Label Enhancement and Analytic Hierarchy Process [J].
Huang, Jintao ;
Qian, Wenbin ;
Vong, Chi-Man ;
Ding, Weiping ;
Shu, Wenhao ;
Huang, Qin .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (05) :1377-1393