Mutual information-based label distribution feature selection for multi-label learning

被引:52
作者
Qian, Wenbin [1 ,2 ]
Huang, Jintao [3 ]
Wang, Yinglong [3 ]
Shu, Wenhao [4 ]
机构
[1] Jiangxi Agr Univ, Sch Software, Nanchang 330045, Jiangxi, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Jiangxi Agr Univ, Sch Comp & Informat Engn, Nanchang 330045, Jiangxi, Peoples R China
[4] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Multi-label data; Granular computing; Label enhancement; Mutual information; STREAMING FEATURE-SELECTION; ATTRIBUTE REDUCTION; MISSING LABELS; CLASSIFICATION; GRAPH; ACCELERATOR; ALGORITHM; DECISION; SPARSE;
D O I
10.1016/j.knosys.2020.105684
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection used for dimensionality reduction of the feature space plays an important role in multi-label learning where high-dimensional data are involved. Although most existing multi-label feature selection approaches can deal with the problem of label ambiguity which mainly focuses on the assumption of uniform distribution with logical labels, it cannot be applied to many practical applications where the significance of related label for every instance tends to be different. To deal with this issue, in this study, label distribution learning covered with a certain real number of labels is introduced to design a model for the labeling-significance. Nevertheless, multi-label feature selection is limited to handling only labels consisting of logical relations. In order to solve this problem, combining the random variable distribution with granular computing, we first propose a label enhancement algorithm to transform logical labels in multi-label data into label distribution with more supervised information, which can mine the hidden label significance from every instance. On this basis, to remove some redundant or irrelevant features in multi-label data, a label distribution feature selection algorithm using mutual information and label enhancement is developed. Finally, the experimental results show that the performance of the proposed method is superior to the other state-of-the-art approaches when dealing with multi-label data. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 72 条
[1]   Feature ranking for enhancing boosting-based multi-label text categorization [J].
Al-Salemi, Bassam ;
Ayob, Masri ;
Noah, Shahrul Azman Mohd .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 :531-543
[2]   Parallel attribute reduction in dominance-based neighborhood rough set [J].
Chen, Hongmei ;
Li, Tianrui ;
Cai, Yong ;
Luo, Chuan ;
Fujita, Hamido .
INFORMATION SCIENCES, 2016, 373 :351-368
[3]   Structured random forest for label distribution learning [J].
Chen, Mengting ;
Wang, Xinggang ;
Feng, Bin ;
Liu, Wenyu .
NEUROCOMPUTING, 2018, 320 :171-182
[4]   Constrained multi-objective population extremal optimization based economic-emission dispatch incorporating renewable energy resources [J].
Chen, Min-Rong ;
Zeng, Guo-Qiang ;
Lu, Kang-Di .
RENEWABLE ENERGY, 2019, 143 :277-294
[5]   Extended adaptive Lasso for multi-class and multi-label feature selection [J].
Chen, Si-Bao ;
Zhang, Yu-Mei ;
Ding, Chris H. Q. ;
Zhang, Jian ;
Luo, Bin .
KNOWLEDGE-BASED SYSTEMS, 2019, 173 :28-36
[6]   MLTSVM: A novel twin support vector machine to multi-label learning [J].
Chen, Wei-Jie ;
Shao, Yuan -Hai ;
Li, Chun-Na ;
Deng, Nai-Yang .
PATTERN RECOGNITION, 2016, 52 :61-74
[7]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[8]   Hypotheses Analysis and Assessment in Counterterrorism Activities: A Method Based on OWA and Fuzzy Probabilistic Rough Sets [J].
Fujita, Hamido ;
Gaeta, Angelo ;
Loia, Vincenzo ;
Orciuoli, Francesco .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) :831-845
[9]   Resilience Analysis of Critical Infrastructures: A Cognitive Approach Based on Granular Computing [J].
Fujita, Hamido ;
Gaeta, Angelo ;
Loia, Vincenzo ;
Orciuoli, Francesco .
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (05) :1835-1848
[10]   Deep Label Distribution Learning With Label Ambiguity [J].
Gao, Bin-Bin ;
Xing, Chao ;
Xie, Chen-Wei ;
Wu, Jianxin ;
Geng, Xin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) :2825-2838