Multilabel feature selection: A comprehensive review and guiding experiments

被引:119
作者
Kashef, Shima [1 ,2 ]
Nezamabadi-pour, Hossein [1 ,2 ]
Nikpour, Bahareh [1 ,2 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Elect Engn, IDPL, POB 76619-133, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Mahani Math Res Ctr, Kerman, Iran
关键词
feature selection; multi-label data; classification; data mining; LABEL FEATURE-SELECTION; SUPERVISED FEATURE-SELECTION; FEATURE SUBSET-SELECTION; FEATURE RANKING; CLASSIFICATION; ALGORITHM; ENSEMBLE; INFORMATION; DATASETS; GRAPH;
D O I
10.1002/widm.1240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection has been an important issue in machine learning and data mining, and is unavoidable when confronting with high-dimensional data. With the advent of multilabel (ML) datasets and their vast applications, feature selection methods have been developed for dimensionality reduction and improvement of the classification performance. In this work, we provide a comprehensive review of the existing multilabel feature selection (ML-FS) methods, and categorize these methods based on different perspectives. As feature selection and data classification are closely related to each other, we provide a review on ML learning algorithms as well. Also, to facilitate research in this field, a section is provided for setup and benchmarking that presents evaluation measures, standard datasets, and existing software for ML data. At the end of this survey, we discuss some challenges and open problems in this field that can be pursued by researchers in future. This article is categorized under: Technologies > Data Preprocessing
引用
收藏
页数:29
相关论文
共 157 条
[41]  
Ding S., 2009, M KNOWL ACQ MOD 2009
[42]  
Diplaris S., 2005, M PANH C INF BERL HE
[43]  
Doak J, 1992, UC DAVIS DEP COMPUTE
[44]  
Doquire G., 2011, M INT WORK C ART NEU
[45]   Mutual information-based feature selection for multilabel classification [J].
Doquire, Gauthier ;
Verleysen, Michel .
NEUROCOMPUTING, 2013, 122 :148-155
[46]   A graph Laplacian based approach to semi-supervised feature selection for regression problems [J].
Doquire, Gauthier ;
Verleysen, Michel .
NEUROCOMPUTING, 2013, 121 :5-13
[47]  
Duivesteijn W., 2012, M INT S INT DAT AN
[48]   Ensemble of feature selection methods: A hesitant fuzzy sets approach [J].
Ebrahimpour, Mohammad Kazem ;
Eftekhari, Mahdi .
APPLIED SOFT COMPUTING, 2017, 50 :300-312
[49]  
El Kafrawy P., 2015, INT J COMPUTER APPL, V114, P406
[50]  
Elisseeff A., 2001, NIPS VANC BRIT COL C