SCLS: Multi-label feature selection based on scalable criterion for large label set

被引:128
作者
Lee, Jaesung [1 ]
Kim, Dae-Won [1 ]
机构
[1] Chung Ang Univ, Sch Comp Sci & Engn, 221 Heukseok Dong, Seoul 156756, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning; Multi-label learning; Multi-label feature selection; Relevance evaluation; Conditional relevance; MUTUAL INFORMATION; CLASSIFICATION;
D O I
10.1016/j.patcog.2017.01.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label feature selection involves the selection of relevant features from multi-labeled datasets, resulting in a potential improvement of multi-label learning accuracy. In conventional multi-label feature selection methods, the final feature subset is obtained by identifying the features of high relevance with low redundancy. Thus, accurate score evaluation is a key factor for obtaining an effective feature subset. However, conventional methods suffer from inaccurate conditional relevance evaluation when a large number of labels are involved. As a result, irrelevant features can be a member of the final feature subset, leading to low multi-label learning accuracy. In this paper, we propose a new multi-label feature selection method. Using a scalable relevance evaluation process that evaluates conditional relevance more accurately, the proposed method significantly improves multi-label learning accuracy compared with conventional multi-label feature selection methods.
引用
收藏
页码:342 / 352
页数:11
相关论文
共 45 条
[1]  
[Anonymous], 2013, WWW 13
[2]  
[Anonymous], 2008, Proceedings of the ECML/PKDD Discovery Chanllenge
[3]  
[Anonymous], 2008, ISMIR
[4]  
[Anonymous], 2012, ACM T INTEL SYST TEC, DOI DOI 10.1145/2168752.2168754
[5]   LAIM discretization for multi-label data [J].
Cano, Alberto ;
Maria Luna, Jose ;
Gibaja, Eva L. ;
Ventura, Sebastian .
INFORMATION SCIENCES, 2016, 330 :370-384
[6]   MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation [J].
Charte, Francisco ;
Rivera, Antonio J. ;
del Jesus, Maria J. ;
Herrera, Francisco .
KNOWLEDGE-BASED SYSTEMS, 2015, 89 :385-397
[7]  
Cover T.M., 1991, Elements of information theory, V6
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]   Mutual information-based feature selection for multilabel classification [J].
Doquire, Gauthier ;
Verleysen, Michel .
NEUROCOMPUTING, 2013, 122 :148-155
[10]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&