Feature selection for multilabel classification with missing labels via multi-scale fusion fuzzy uncertainty measures

被引:18
作者
Yin, Tengyu [1 ,2 ,3 ,4 ]
Chen, Hongmei [1 ,2 ,3 ,4 ]
Wang, Zhihong [1 ,2 ,3 ,4 ]
Liu, Keyu [1 ,2 ,3 ,4 ]
Yuan, Zhong [5 ]
Horng, Shi-Jinn [6 ]
Li, Tianrui [1 ,2 ,3 ,4 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[3] Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China
[4] Southwest Jiaotong Univ, Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 611756, Peoples R China
[5] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[6] Asia Univ, Dept Comp Sci & Informat Engn, Taichung 41354, Taiwan
关键词
Multi-scale fuzzy rough sets; Multilabel feature selection; Missing labels; Uncertainty measures; NEIGHBORHOOD ROUGH SETS;
D O I
10.1016/j.patcog.2024.110580
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous high -dimension multilabel data are generated, posing a challenge for multilabel learning. Building effective learning models with discriminative features is essential to improve the performance of multilabel learning. Multilabel feature selection can filter out the discriminative features according to their contribution to classification. However, ambiguity, uncertainty, and missing labels coexist in real -life multilabel data, which brings adverse effects to multilabel feature selection. The multi -scale fuzzy rough set gives an effective way to mine intrinsic knowledge hidden in uncertain data. This paper first extends the multi -scale learning to multilabel data with missing labels and proposes a feature selection method for multilabel classification with missing labels via multi -scale fusion fuzzy uncertainty measures called FSMML. The missing label space construction and feature evaluation metric are carefully investigated in the framework of multi -scale learning. A multilabel multi -scale learning strategy is formalized with the fuzzy granularity cognitive mechanism as the core, and the multi -scale fusion fuzzy label learning is given to reconstruct the missing label space. Then, a novel multilabel multi -scale fuzzy rough sets with missing labels is developed, and the significance of each scale is quantified. Moreover, some multi -scale fusion fuzzy uncertainty measures are defined by capturing the sample fuzzy similarity in the feature and reconstructed label spaces. Accordingly, the relevance between features and label set and the interactivity and redundancy between features in feature evaluation are discussed. Finally, FSMML chooses high -quality features to maximize relevance and interactivity and minimize redundancy. Extensive experiments demonstrate the effectiveness of FSMML on fifteen datasets with missing labels.
引用
收藏
页数:14
相关论文
共 40 条
[1]  
Dai J., 2023, Pattern Recognit., V145
[2]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[3]   Deep Multi-Label Joint Learning for RNA and DNA-Binding Proteins Prediction [J].
Du, Xiuquan ;
Hu, Jiajia .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) :307-320
[4]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209
[5]   Learning correlation information for multi-label feature selection [J].
Fan, Yuling ;
Liu, Jinghua ;
Tang, Jianeng ;
Liu, Peizhong ;
Lin, Yaojin ;
Du, Yongzhao .
PATTERN RECOGNITION, 2024, 145
[6]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[7]   On knowledge acquisition in multi-scale decision systems [J].
Gu, Shen-Ming ;
Wu, Wei-Zhi .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (05) :477-486
[8]   Multi-Scale Spatial and Temporal Speech Associations to Swallowing for Dysphagia Screening [J].
He, Fei ;
Hu, Xiaoyi ;
Zhu, Ce ;
Li, Ying ;
Liu, Yipeng .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 :2888-2899
[9]   Feature Subset Selection With Multi-Scale Fuzzy Granulation [J].
Huang Z. ;
Li J. .
IEEE Transactions on Artificial Intelligence, 2023, 4 (01) :121-134
[10]   Multi-dimensional multi-label classification: Towards encompassing heterogeneous label spaces and multi-label annotations [J].
Jia, Bin -Bin ;
Zhang, Min -Ling .
PATTERN RECOGNITION, 2023, 138