FCBF3Rules: A Feature Selection Method for Multi-Label Datasets

被引：0

作者：

Kashef, Shima ^{[1
]}

Nezamabadi-pour, Hossein ^{[1
]}

Nikpour, Bahareh ^{[1
]}

机构：

[1] Shahid Bahonar Univ Kerman, Intelligent Data Proc Lab IDPL, Kerman, Iran

来源：

2018 3RD CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC2018), VOL 3 | 2018年

关键词：

Multi-label datasets; feature selection; FCBF; TRANSFORMATION; ALGORITHM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a novel multi-label feature selection algorithm is introduced based on fast correlation-based filter (FCBF) feature selection method, which is a filter approach for single-label datasets. The strategy of FCBF is that first, it eliminates the features that are irrelevant to classes. Unlike many filter methods which stop on this step, FCBF finds redundant features among relevant features that are remained from the previous step and eliminates them. Therefore, this is one of the most successful single-labels methods in finding the most effective and the smallest feature subset. Extending the step of finding the relevant features in multi-label datasets is not a difficult task. However, in the step of eliminating redundant features, FCBF may removes many effective features. It should be noticed that in multi-label datasets, one feature may be able to distinguish samples that are relevant to a label while another feature is suitable for another label. Hence, these two features cannot be considered to be redundant and one of them cannot be removed. The main contribution of this paper corresponds to the step in which effective and useful features are distinguished from redundant ones in FCBF method. To do so, three rules are implemented and when even one of these rules is not fulfilled, the feature is not removed. The proposed method along with three recently proposed multi-label feature selection methods are applied on 6 standard multi-label datasets for evaluation. The obtained results indicate the strong capability of the proposed algorithm to find the best feature subset, compared to other algorithms.

引用

页码：43 / 47

页数：5

共 22 条

[11] Kashef S, 2013, 2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), P50, DOI 10.1109/IKT.2013.6620037
[12] An advanced ACO algorithm for feature subset selection
Kashef, Shima
Nezamabadi-pour, Hossein
[J]. NEUROCOMPUTING, 2015, 147 : 271 - 279
[13] Memetic feature selection algorithm for multi-label classification
Lee, Jaesung
Kim, Dae-Won
[J]. INFORMATION SCIENCES, 2015, 293 : 80 - 96
[14] Feature selection for multi-label classification using multivariate mutual information
Lee, Jaesung
Kim, Dae-Won
[J]. PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
[15] Multi-label feature selection based on max-dependency and min-redundancy
Lin, Yaojin
Hu, Qinghua
Liu, Jinghua
Duan, Jie
[J]. NEUROCOMPUTING, 2015, 168 : 92 - 103
[16] Read J., 2008, P 2008 NZ COMP SCI R
[17] Multi-label Classification using Ensembles of Pruned Sets
Read, Jesse
Pfahringer, Bernhard
Holmes, Geoff
[J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 995 - 1000
[18] Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context
Reyes, Oscar
Morell, Carlos
Ventura, Sebastian
[J]. NEUROCOMPUTING, 2015, 161 : 168 - 182
[19] A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach
Spolaor, Newton
Cherman, Everton Alvares
Monard, Maria Carolina
Lee, Huei Diana
[J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 292 : 135 - 151
[20] ReliefF for Multi-label Feature Selection
Spolaor, Newton
Cherman, Everton Alvares
Monard, Maria Carolina
Lee, Huei Diana
[J]. 2013 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2013, : 6 - 11

← 1 2 3 →