A noise resistant dependency measure for rough set-based feature selection

被引:1
作者
Javidi, Mohammad Masoud [1 ]
Eskandari, Sadegh [2 ,3 ]
机构
[1] Shahid Bahonar Univ Kerman, Dept Comp Sci, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Dept Appl Math, Kerman, Iran
[3] Univ Guilan, Dept Comp Sci, Rasht, Iran
关键词
Feature selection; rough sets theory; impurity measure; noise resistant dependency; INCREMENTAL APPROACH; ATTRIBUTE REDUCTION; KNOWLEDGE;
D O I
10.3233/JIFS-16853
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of feature selection (FS) is to select a small subset of most important and discriminative features. Many FS approaches based on rough set theory up to now, have employed reduct analysis using feature dependency measures. However the critical shortcoming for such approaches is that they are not able to manage useful information that may be destroyed by noise elements. Therefore several extensions to the original theory have been proposed. Three notable extensions are fuzzy rough set (FRS), variable precision rough set (VPRS), and tolerance rough set model (TRSM). Although successful, each of the extensions exhibits a critical shortcoming which makes that extension inapplicable in most of scenarios. For example, FRS is able to describe the existing dependencies between different attributes accurately, but its high run-times makes it inapplicable to larger datasets. As another e-ample, VPR is very fast, but requires more information than contained within the data itself, which is inaccessible for most of the applications. This paper e-amines a rough set FS technique which uses a noise resistant dependency measure to quantify information that may be hidden due to the noise elements. E-perimental results demonstrate that the use of this measure can result more discriminative reducts than those obtained using other RSFS approaches. Moreover, the proposed measure is as fast as VPRS and as accurate as FRS and TRSM, while it need no additional information other than contained within the data.
引用
收藏
页码:1613 / 1626
页数:14
相关论文
共 37 条
  • [1] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [2] Cohen WW., 1995, P 12 INT C MACH LEAR, V1995, P115, DOI [DOI 10.1016/B978-1-55860-377-6.50023-2, 10.1016/b978-1-55860-377-6.50023-2, 10.1016/B978-1-55860-377-6.50023-2]
  • [3] Entropy measures and granularity measures for set-valued information systems
    Dai, Jianhua
    Tian, Haowei
    [J]. INFORMATION SCIENCES, 2013, 240 : 72 - 82
  • [4] On the optimality of the simple Bayesian classifier under zero-one loss
    Domingos, P
    Pazzani, M
    [J]. MACHINE LEARNING, 1997, 29 (2-3) : 103 - 130
  • [5] Dubois D., 1992, INTELLIGENT DECISION, P203, DOI [10.1007/978-94-015-7975-9_14, DOI 10.1007/978-94-015-7975-9_14, 10.1007/978-94-015-7975-9 14, DOI 10.1007/978-94-015-7975-914]
  • [6] Online streaming feature selection using rough sets
    Eskandari, S.
    Javidi, M. M.
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2016, 69 : 35 - 57
  • [7] Rough sets theory for multicriteria decision analysis
    Greco, S
    Matarazzo, B
    Slowinski, R
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 129 (01) : 1 - 47
  • [8] Guyon I, 2003, J MACH LEARN RES, V3, P1157, DOI DOI 10.1162/153244303322753616
  • [9] Rough set approach for attribute reduction and rule generation: A case of patients with suspected breast cancer
    Hassanien, AE
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (11): : 954 - 962
  • [10] Tabu search for attribute reduction in rough set theory
    Hedar, Abdel-Rahman
    Wang, Jue
    Fukushima, Masao
    [J]. SOFT COMPUTING, 2008, 12 (09) : 909 - 918