FCBF3Rules: A Feature Selection Method for Multi-Label Datasets

被引:0
作者
Kashef, Shima [1 ]
Nezamabadi-pour, Hossein [1 ]
Nikpour, Bahareh [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Intelligent Data Proc Lab IDPL, Kerman, Iran
来源
2018 3RD CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC2018), VOL 3 | 2018年
关键词
Multi-label datasets; feature selection; FCBF; TRANSFORMATION; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel multi-label feature selection algorithm is introduced based on fast correlation-based filter (FCBF) feature selection method, which is a filter approach for single-label datasets. The strategy of FCBF is that first, it eliminates the features that are irrelevant to classes. Unlike many filter methods which stop on this step, FCBF finds redundant features among relevant features that are remained from the previous step and eliminates them. Therefore, this is one of the most successful single-labels methods in finding the most effective and the smallest feature subset. Extending the step of finding the relevant features in multi-label datasets is not a difficult task. However, in the step of eliminating redundant features, FCBF may removes many effective features. It should be noticed that in multi-label datasets, one feature may be able to distinguish samples that are relevant to a label while another feature is suitable for another label. Hence, these two features cannot be considered to be redundant and one of them cannot be removed. The main contribution of this paper corresponds to the step in which effective and useful features are distinguished from redundant ones in FCBF method. To do so, three rules are implemented and when even one of these rules is not fulfilled, the feature is not removed. The proposed method along with three recently proposed multi-label feature selection methods are applied on 6 standard multi-label datasets for evaluation. The obtained results indicate the strong capability of the proposed algorithm to find the best feature subset, compared to other algorithms.
引用
收藏
页码:43 / 47
页数:5
相关论文
共 22 条
  • [11] Kashef S, 2013, 2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), P50, DOI 10.1109/IKT.2013.6620037
  • [12] An advanced ACO algorithm for feature subset selection
    Kashef, Shima
    Nezamabadi-pour, Hossein
    [J]. NEUROCOMPUTING, 2015, 147 : 271 - 279
  • [13] Memetic feature selection algorithm for multi-label classification
    Lee, Jaesung
    Kim, Dae-Won
    [J]. INFORMATION SCIENCES, 2015, 293 : 80 - 96
  • [14] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
  • [15] Multi-label feature selection based on max-dependency and min-redundancy
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Duan, Jie
    [J]. NEUROCOMPUTING, 2015, 168 : 92 - 103
  • [16] Read J., 2008, P 2008 NZ COMP SCI R
  • [17] Multi-label Classification using Ensembles of Pruned Sets
    Read, Jesse
    Pfahringer, Bernhard
    Holmes, Geoff
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 995 - 1000
  • [18] Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context
    Reyes, Oscar
    Morell, Carlos
    Ventura, Sebastian
    [J]. NEUROCOMPUTING, 2015, 161 : 168 - 182
  • [19] A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach
    Spolaor, Newton
    Cherman, Everton Alvares
    Monard, Maria Carolina
    Lee, Huei Diana
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 292 : 135 - 151
  • [20] ReliefF for Multi-label Feature Selection
    Spolaor, Newton
    Cherman, Everton Alvares
    Monard, Maria Carolina
    Lee, Huei Diana
    [J]. 2013 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2013, : 6 - 11