Cost-sensitive feature selection based on adaptive neighborhood granularity with multi-level confidence

被引:64
作者
Zhao, Hong [1 ,2 ]
Wang, Ping [1 ,3 ]
Hu, Qinghua [4 ,5 ]
机构
[1] Tianjin Univ, Sch Comp Software, Tianjin 300072, Peoples R China
[2] Minnan Normal Univ, Lab Granular Comp, Zhangzhou 363000, Peoples R China
[3] Tianjin Univ, Sch Sci, Tianjin 300072, Peoples R China
[4] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
[5] Tianjin Key Lab Cognit Comp & Applicat, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Cost-sensitive learning; Feature selection; Granular computing; Neighborhood granularity; Neighborhood rough sets; ROUGH SETS; ATTRIBUTE REDUCTION; SYSTEMS; CLASSIFICATION;
D O I
10.1016/j.ins.2016.05.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neighborhood rough set model is considered as one of the effective granular computing models in dealing with numerical data. This model is now widely discussed in feature selection and rule learning. However, there is no theoretical analysis on the issue of neighborhood granularity selection, the influence of sampling resolution, test and misclassification costs on modeling. In this paper, we design an adaptive neighborhood rough set model according to data precision and develop a fast backtracking algorithm for neighborhood rough sets based cost-sensitive feature selection by considering the trade-off between test costs and misclassification costs. In the proposed model, the neighborhood granularity, based on the 3 sigma rule of statistics, is adaptive to data precision that is described by the multi-level confidence of the feature subsets. Our experiments, thoroughly performed on 12 datasets, demonstrate the effectiveness of the model and the efficiency of the backtracking algorithm. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:134 / 149
页数:16
相关论文
共 52 条
  • [1] Aggarwal C.C., 2007, P IEEE 23 INT C DAT
  • [2] [Anonymous], 2013, J INF COMPUT SCI, DOI DOI 10.12733/JICS20102163
  • [3] ON MULTI-CLASS COST-SENSITIVE LEARNING
    Zhou, Zhi-Hua
    Liu, Xu-Ying
    [J]. COMPUTATIONAL INTELLIGENCE, 2010, 26 (03) : 232 - 257
  • [4] Casella G., 2002, STAT INFERENCE
  • [5] Qualitative test-cost sensitive classification
    Cebe, Mumin
    Gunduz-Demir, Cigdem
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (13) : 2043 - 2051
  • [6] Chau M, 2006, LECT NOTES ARTIF INT, V3918, P199
  • [7] A rough set approach to feature selection based on ant colony optimization
    Chen, Yumin
    Miao, Duoqian
    Wang, Ruizhi
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (03) : 226 - 233
  • [8] Fisher RA., 1922, PHILOS T R SOC A, V222, P309, DOI [10.1098/rsta.1922.0009, DOI 10.1098/RSTA.1922.0009]
  • [9] A granular description of ECG signals
    Gacek, Adam
    Pedrycz, Witold
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (10) : 1972 - 1982
  • [10] Granular modelling of signals: A framework of Granular Computing
    Gacek, Adam
    [J]. INFORMATION SCIENCES, 2013, 221 : 1 - 11