Fuzzy-Rough Simultaneous Attribute Selection and Feature Extraction Algorithm

被引:41
作者
Maji, Pradipta [1 ]
Garai, Partha [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, India
关键词
Attribute selection; classification; feature extraction; pattern recognition; rough sets; REDUCTION; CLASSIFICATION; INFORMATION;
D O I
10.1109/TSMCB.2012.2225832
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Among the huge number of attributes or features present in real-life data sets, only a small fraction of them are effective to represent the data set accurately. Prior to analysis of the data set, selecting or extracting relevant and significant features is an important preprocessing step used for pattern recognition, data mining, and machine learning. In this regard, a novel dimensionality reduction method, based on fuzzy-rough sets, that simultaneously selects attributes and extracts features using the concept of feature significance is presented. The method is based on maximizing both the relevance and significance of the reduced feature set, whereby redundancy therein is removed. This paper also presents classical and neighborhood rough sets for computing the relevance and significance of the feature set and compares their performances with that of fuzzy-rough sets based on the predictive accuracy of nearest neighbor rule, support vector machine, and decision tree. An important finding is that the proposed dimensionality reduction method based on fuzzy-rough sets is shown to be more effective for generating a relevant and significant feature subset. The effectiveness of the proposed fuzzy-rough-set-based dimensionality reduction method, along with a comparison with existing attribute selection and feature extraction methods, is demonstrated on real-life data sets.
引用
收藏
页码:1166 / 1177
页数:12
相关论文
共 29 条
[1]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[2]  
ASPIN AA, 1949, BIOMETRIKA, V36, P290, DOI 10.1093/biomet/36.3-4.290
[3]   On the selection and classification of independent features [J].
Bressan, M ;
Vitrià, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (10) :1312-1317
[4]   A Novel Algorithm for Finding Reducts With Fuzzy Rough Sets [J].
Chen, Degang ;
Zhang, Lei ;
Zhao, Suyun ;
Hu, Qinghua ;
Zhu, Pengfei .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (02) :385-389
[5]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[6]   Rough set-aided keyword reduction for text categorization [J].
Chouchoulas, A ;
Shen, Q .
APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (09) :843-873
[7]  
DEVIJVER PA, 1982, PATTERN RECOGNITION
[8]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209
[9]  
Frank A., 2010, UCI Machine Learning Repository
[10]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157