FCBF3Rules: A Feature Selection Method for Multi-Label Datasets

被引:0
作者
Kashef, Shima [1 ]
Nezamabadi-pour, Hossein [1 ]
Nikpour, Bahareh [1 ]
机构
[1] Shahid Bahonar Univ Kerman, Intelligent Data Proc Lab IDPL, Kerman, Iran
来源
2018 3RD CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC2018), VOL 3 | 2018年
关键词
Multi-label datasets; feature selection; FCBF; TRANSFORMATION; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel multi-label feature selection algorithm is introduced based on fast correlation-based filter (FCBF) feature selection method, which is a filter approach for single-label datasets. The strategy of FCBF is that first, it eliminates the features that are irrelevant to classes. Unlike many filter methods which stop on this step, FCBF finds redundant features among relevant features that are remained from the previous step and eliminates them. Therefore, this is one of the most successful single-labels methods in finding the most effective and the smallest feature subset. Extending the step of finding the relevant features in multi-label datasets is not a difficult task. However, in the step of eliminating redundant features, FCBF may removes many effective features. It should be noticed that in multi-label datasets, one feature may be able to distinguish samples that are relevant to a label while another feature is suitable for another label. Hence, these two features cannot be considered to be redundant and one of them cannot be removed. The main contribution of this paper corresponds to the step in which effective and useful features are distinguished from redundant ones in FCBF method. To do so, three rules are implemented and when even one of these rules is not fulfilled, the feature is not removed. The proposed method along with three recently proposed multi-label feature selection methods are applied on 6 standard multi-label datasets for evaluation. The obtained results indicate the strong capability of the proposed algorithm to find the best feature subset, compared to other algorithms.
引用
收藏
页码:43 / 47
页数:5
相关论文
共 22 条
  • [1] [Anonymous], 2013, IBEROAMERICAN C PATT, DOI DOI 10.1007/978-3-642-41827-3.66
  • [2] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [3] Document transformation for multi-label feature selection in text categorization
    Chen, Weizhu
    Yan, Jun
    Zhang, Benyu
    Chen, Zheng
    Yang, Qiang
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 451 - +
  • [4] Mutual information-based feature selection for multilabel classification
    Doquire, Gauthier
    Verleysen, Michel
    [J]. NEUROCOMPUTING, 2013, 122 : 148 - 155
  • [5] Doquire G, 2011, LECT NOTES COMPUT SC, V6691, P9, DOI 10.1007/978-3-642-21501-8_2
  • [6] El Kafrawy P., 2015, INT J COMPUTER APPL, V114
  • [7] FAYYAD UM, 1993, IJCAI-93, VOLS 1 AND 2, P1022
  • [8] Research on collaborative negotiation for e-commerce.
    Feng, YQ
    Lei, Y
    Li, Y
    Cao, RZ
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2085 - 2088
  • [9] Kashef S., 2017, ACCEPTED PUBLICATION
  • [10] Kashef S, 2017, 2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), P21, DOI 10.1109/CSIEC.2017.7940162