Novel multi-label feature selection via label symmetric uncertainty correlation learning and feature redundancy evaluation

被引:56
作者
Dai, Jianhua [1 ,2 ]
Chen, Jiaolong [1 ,2 ]
Liu, Ye [1 ,2 ]
Hu, Hu [1 ]
机构
[1] Hunan Normal Univ, Hunan Prov Key Lab Intelligent Comp & Language In, Changsha 410081, Peoples R China
[2] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label feature selection; Symmetric uncertainty; Fuzzy mutual information; Feature redundancy evaluation; ATTRIBUTE SELECTION; MUTUAL INFORMATION; CLASSIFICATION; RECOGNITION; REDUCTION;
D O I
10.1016/j.knosys.2020.106342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label data with high dimensionality, widely existed in the real world, bring many challenges to the applications of machine learning, pattern recognition and other fields. Scholars have proposed some multi-label feature selection methods from various aspects. However, there are few studies on the feature selection for multi-label data based on fuzzy mutual information, and most existing methods neglect the correlation between labels. In this study, we propose two novel multi-label feature selection approaches via label symmetric uncertainty correlation and feature redundancy evaluation. Firstly, we propose the concept of symmetric uncertainty correlation between labels via fuzzy mutual information, and design a label importance weight based on label symmetric uncertainty correlation learning. Further, we define a label similarity relation matrix on multi-label space via the label importance weight. Secondly, we define the symmetric uncertainty correlation between features and labels, and propose the first multi-label feature selection approach. Thirdly, considering the above-proposed method can only get a feature sequence and does not remove the redundancy features, we further propose an improved multi-label removing-redundancy feature selection approach through introducing feature redundancy evaluation. Finally, comprehensive experiments are executed to demonstrate the performance of our methods. The results illustrate that our study is better than other representative feature selection methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 61 条
  • [1] RFBoost: An improved multi-label boosting algorithm and its application to text categorisation
    Al-Salemi, Bassam
    Noah, Shahrul Azman Mohd
    Ab Aziz, Mohd Juzaiddin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 103 : 104 - 117
  • [2] Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach
    Briggs, Forrest
    Lakshminarayanan, Balaji
    Neal, Lawrence
    Fern, Xiaoli Z.
    Raich, Raviv
    Hadley, Sarah J. K.
    Hadley, Adam S.
    Betts, Matthew G.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (06) : 4640 - 4650
  • [3] Addressing imbalance in multilabel classification: Measures and random resampling algorithms
    Charte, Francisco
    Rivera, Antonio J.
    del Jesus, Maria J.
    Herrera, Francisco
    [J]. NEUROCOMPUTING, 2015, 163 : 3 - 16
  • [4] A novel approach for learning label correlation with application to feature selection of multi-label data
    Che, Xiaoya
    Chen, Degang
    Mi, Jusheng
    [J]. INFORMATION SCIENCES, 2020, 512 (512) : 795 - 812
  • [5] Attribute Reduction for Heterogeneous Data Based on the Combination of Classical and Fuzzy Rough Set Models
    Chen, Degang
    Yang, Yanyan
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (05) : 1325 - 1334
  • [6] A Rough Set-Based Method for Updating Decision Rules on Attribute Values' Coarsening and Refining
    Chen, Hongmei
    Li, Tianrui
    Luo, Chuan
    Horng, Shi-Jinn
    Wang, Guoyin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) : 2886 - 2899
  • [7] Extended adaptive Lasso for multi-class and multi-label feature selection
    Chen, Si-Bao
    Zhang, Yu-Mei
    Ding, Chris H. Q.
    Zhang, Jian
    Luo, Bin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 173 : 28 - 36
  • [8] Clare A., 2001, P EUR C PRINC DAT MI, V2168, P42
  • [9] Feature selection via normative fuzzy information weight with application into tumor classification
    Dai, Jianhua
    Chen, Jiaolong
    [J]. APPLIED SOFT COMPUTING, 2020, 92
  • [10] Maximal-Discernibility-Pair-Based Approach to Attribute Reduction in Fuzzy Rough Sets
    Dai, Jianhua
    Hu, Hu
    Wu, Wei-Zhi
    Qian, Yuhua
    Huang, Debiao
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (04) : 2174 - 2187