Data-Distribution-Aware Fuzzy Rough Set Model and its Application to Robust Classification

被引:60
作者
An, Shuang [1 ,2 ]
Hu, Qinghua [3 ,4 ]
Pedrycz, Witold [5 ,6 ]
Zhu, Pengfei [3 ,4 ]
Tsang, Eric C. C. [7 ]
机构
[1] Northeastern Univ, Shenyang, Peoples R China
[2] Tianjin Univ, Tianjin 150001, Peoples R China
[3] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 150001, Peoples R China
[4] Tianjin Key Lab Cognit Comp & Applicat, Tianjin 150001, Peoples R China
[5] King Abdulaziz Univ, Dept Elect & Comp Engn, Fac Engn, Jeddah 21589, Saudi Arabia
[6] Polish Acad Sci, Syst Res Inst, PL-00901 Warsaw, Poland
[7] Macau Univ Sci & Technol, Fac Informat Technol, Macau 519020, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Data distribution; fuzzy rough covering; fuzzy rough sets (FRSs); prototype selection; robust classification; SELECTION; REGRESSION; FRPS;
D O I
10.1109/TCYB.2015.2496425
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fuzzy rough sets (FRSs) are considered to be a powerful model for analyzing uncertainty in data. This model encapsulates two types of uncertainty: 1) fuzziness coming from the vagueness in human concept formation and 2) roughness rooted in the granulation coming with human cognition. The rough set theory has been widely applied to feature selection, attribute reduction, and classification. However, it is reported that the classical FRS model is sensitive to noisy information. To address this problem, several robust models have been developed in recent years. Nevertheless, these models do not consider a statistical distribution of data, which is an important type of uncertainty. Data distribution serves as crucial information for designing an optimal classification or regression model. Thus, we propose a data-distribution-aware FRS model that considers distribution information and incorporates it in computing lower and upper fuzzy approximations. The proposed model considers not only the similarity between samples, but also the probability density of classes. In order to demonstrate the effectiveness of the proposed model, we design a new sample evaluation index for prototype-based classification based on the model, and a prototype selection algorithm is developed using this index. Furthermore, a robust classification algorithm is constructed with prototype covering and nearest neighbor classification. Experimental results confirm the robustness and effectiveness of the proposed model.
引用
收藏
页码:3073 / 3085
页数:13
相关论文
共 44 条
  • [1] INSTANCE-BASED LEARNING ALGORITHMS
    AHA, DW
    KIBLER, D
    ALBERT, MK
    [J]. MACHINE LEARNING, 1991, 6 (01) : 37 - 66
  • [2] Fuzzy rough regression with application to wind speed prediction
    An, Shuang
    Shi, Hong
    Hu, Qinghua
    Li, Xiaoqi
    Dang, Jianwu
    [J]. INFORMATION SCIENCES, 2014, 282 : 388 - 400
  • [3] Soft Minimum-Enclosing-Ball Based Robust Fuzzy Rough Sets
    An, Shuang
    Hu, Qinghua
    Yu, Daren
    Liu, Jinfu
    [J]. FUNDAMENTA INFORMATICAE, 2012, 115 (2-3) : 189 - 202
  • [4] [Anonymous], 1993, MACH LEARN
  • [5] Support vector machines for histogram-based image classification
    Chapelle, O
    Haffner, P
    Vapnik, VN
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1055 - 1064
  • [6] A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets
    Chen, Degang
    Zhang, Lei
    Zhao, Suyun
    Hu, Qinghua
    Zhu, Pengfei
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) : 385 - 389
  • [7] Cornelis C, 2007, LECT NOTES ARTIF INT, V4482, P87
  • [8] Cornelis C, 2010, LECT NOTES ARTIF INT, V6401, P78, DOI 10.1007/978-3-642-16248-0_16
  • [9] An Uncertainty Measure for Incomplete Decision Tables and Its Applications
    Dai, Jianhua
    Wang, Wentao
    Xu, Qing
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (04) : 1277 - 1289
  • [10] Feature Selection Inspired Classifier Ensemble Reduction
    Diao, Ren
    Chao, Fei
    Peng, Taoxin
    Snooke, Neal
    Shen, Qiang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (08) : 1259 - 1268