Novel multi-label feature selection via label symmetric uncertainty correlation learning and feature redundancy evaluation

被引:67
作者
Dai, Jianhua [1 ,2 ]
Chen, Jiaolong [1 ,2 ]
Liu, Ye [1 ,2 ]
Hu, Hu [1 ]
机构
[1] Hunan Normal Univ, Hunan Prov Key Lab Intelligent Comp & Language In, Changsha 410081, Peoples R China
[2] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label feature selection; Symmetric uncertainty; Fuzzy mutual information; Feature redundancy evaluation; ATTRIBUTE SELECTION; MUTUAL INFORMATION; CLASSIFICATION; RECOGNITION; REDUCTION;
D O I
10.1016/j.knosys.2020.106342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label data with high dimensionality, widely existed in the real world, bring many challenges to the applications of machine learning, pattern recognition and other fields. Scholars have proposed some multi-label feature selection methods from various aspects. However, there are few studies on the feature selection for multi-label data based on fuzzy mutual information, and most existing methods neglect the correlation between labels. In this study, we propose two novel multi-label feature selection approaches via label symmetric uncertainty correlation and feature redundancy evaluation. Firstly, we propose the concept of symmetric uncertainty correlation between labels via fuzzy mutual information, and design a label importance weight based on label symmetric uncertainty correlation learning. Further, we define a label similarity relation matrix on multi-label space via the label importance weight. Secondly, we define the symmetric uncertainty correlation between features and labels, and propose the first multi-label feature selection approach. Thirdly, considering the above-proposed method can only get a feature sequence and does not remove the redundancy features, we further propose an improved multi-label removing-redundancy feature selection approach through introducing feature redundancy evaluation. Finally, comprehensive experiments are executed to demonstrate the performance of our methods. The results illustrate that our study is better than other representative feature selection methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 61 条
[1]   RFBoost: An improved multi-label boosting algorithm and its application to text categorisation [J].
Al-Salemi, Bassam ;
Noah, Shahrul Azman Mohd ;
Ab Aziz, Mohd Juzaiddin .
KNOWLEDGE-BASED SYSTEMS, 2016, 103 :104-117
[2]  
[Anonymous], 2001, EUR C PRINC DAT MIN
[3]  
[Anonymous], 1994, 11 INT C MACHINE LEA, DOI 10.1016/B978-1-55860-335-6.50023-4
[4]   Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach [J].
Briggs, Forrest ;
Lakshminarayanan, Balaji ;
Neal, Lawrence ;
Fern, Xiaoli Z. ;
Raich, Raviv ;
Hadley, Sarah J. K. ;
Hadley, Adam S. ;
Betts, Matthew G. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (06) :4640-4650
[5]   Addressing imbalance in multilabel classification: Measures and random resampling algorithms [J].
Charte, Francisco ;
Rivera, Antonio J. ;
del Jesus, Maria J. ;
Herrera, Francisco .
NEUROCOMPUTING, 2015, 163 :3-16
[6]   A novel approach for learning label correlation with application to feature selection of multi-label data [J].
Che, Xiaoya ;
Chen, Degang ;
Mi, Jusheng .
INFORMATION SCIENCES, 2020, 512 :795-812
[7]   Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models [J].
Chen, Degang ;
Yang, Yanyan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) :1325-1334
[8]   A Rough Set-Based Method for Updating Decision Rules on Attribute Values' Coarsening and Refining [J].
Chen, Hongmei ;
Li, Tianrui ;
Luo, Chuan ;
Horng, Shi-Jinn ;
Wang, Guoyin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) :2886-2899
[9]   Extended adaptive Lasso for multi-class and multi-label feature selection [J].
Chen, Si-Bao ;
Zhang, Yu-Mei ;
Ding, Chris H. Q. ;
Zhang, Jian ;
Luo, Bin .
KNOWLEDGE-BASED SYSTEMS, 2019, 173 :28-36
[10]   Feature selection via normative fuzzy information weight with application into tumor classification [J].
Dai, Jianhua ;
Chen, Jiaolong .
APPLIED SOFT COMPUTING, 2020, 92