Feature subset selection based on fuzzy neighborhood rough sets

被引:211
作者
Wang, Changzhong [1 ]
Shao, Mingwen [2 ]
He, Qiang [3 ]
Qian, Yuhua [4 ]
Qi, Yali [1 ]
机构
[1] Bohai Univ, Dept Math, Jinzhou 121000, Peoples R China
[2] Chinese Univ Petr, Coll Comp & Commun Engn, Qingdao 266580, Shandong, Peoples R China
[3] Beijing Univ Civil Engn & Architecture, Coll Sci, Beijing 100044, Peoples R China
[4] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China
基金
中国国家自然科学基金;
关键词
Fuzzy neighborhood; Fuzzy decision; Feature selection; Rough set model; ATTRIBUTE REDUCTION;
D O I
10.1016/j.knosys.2016.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory has been extensively discussed in machine learning and pattern recognition. It provides us another important theoretical tool for feature selection. In this paper, we construct a novel rough set model for feature subset selection. First, we define the fuzzy decision of a sample by using the concept of fuzzy neighborhood. A parameterized fuzzy relation is introduced to characterize fuzzy information granules for analysis of real-valued data. Then, we use the relationship between fuzzy neighborhood and fuzzy decision to construct a new rough set model: fuzzy neighborhood rough set model. Based on this model, the definitions of upper and lower approximation, boundary region and positive region are given, and the effects of parameters on these concepts are discussed. To make the new model tolerate noises in data, we introduce a variable-precision fuzzy neighborhood rough set model. This model can decrease the possibility that a sample is classified into a wrong category. Finally, we define the dependency between fuzzy decision and condition attributes and employ the dependency to evaluate the significance of a candidate feature, using which a greedy feature subset selection algorithm is designed. The proposed algorithm is compared with some classical algorithms. The experiments show that the proposed algorithm gets higher classification performance and the numbers of selected features are relatively small. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 179
页数:7
相关论文
共 26 条
[1]  
[Anonymous], 2000, P 17 INT C MACH LEAR
[2]   On the compact computational domain of fuzzy-rough sets [J].
Bhatt, RB ;
Gopal, M .
PATTERN RECOGNITION LETTERS, 2005, 26 (11) :1632-1640
[3]   A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets [J].
Chen, Degang ;
Zhang, Lei ;
Zhao, Suyun ;
Hu, Qinghua ;
Zhu, Pengfei .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) :385-389
[4]  
Cornelis C., 2007, Inf. Sci., V177, P3
[5]   Consistency-based search in feature selection [J].
Dash, M ;
Liu, HA .
ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) :155-176
[6]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209
[7]  
Dudek W.A., 2005, INT J MATH MATH SCI, V3, P437, DOI DOI 10.1155/IJMMS.2005.437
[8]   Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].
Hu, QH ;
Yu, DR ;
Xie, ZX .
PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423
[9]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594
[10]   Kernelized Fuzzy Rough Sets and Their Applications [J].
Hu, Qinghua ;
Yu, Daren ;
Pedrycz, Witold ;
Chen, Degang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (11) :1649-1667