Feature selection via neighborhood multi-granulation fusion

被引:75
作者
Lin, Yaojin [1 ]
Li, Jinjin [1 ,2 ]
Lin, Peirong [1 ]
Lin, Guoping [2 ]
Chen, Jinkun [2 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China
[2] Minnan Normal Univ, Sch Math & Stat, Zhangzhou 363000, Peoples R China
基金
中国国家自然科学基金;
关键词
Granular computing; Feature selection; Multi-granulation; Neighborhood rough sets; Granularity influence; INFORMATION GRANULATION; ATTRIBUTE REDUCTION; ROUGH; SYSTEMS; RATIO;
D O I
10.1016/j.knosys.2014.05.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an important data preprocessing technique, and has been widely studied in data mining, machine learning, and granular computing. However, very little research has considered a multi-granulation perspective. In this paper, we present a new feature selection method that selects distinguishing features by fusing neighborhood multi-granulation. We first use neighborhood rough sets as an effective granular computing tool, and analyze the influence of the granularity of neighborhood information. Then, we obtain all feature rank lists based on the significance of features in different granularities. Finally, we obtain a new feature selection algorithm by fusing all individual feature rank lists. Experimental results show that the proposed method can effectively select a discriminative feature subset, and performs as well as or better than other popular feature selection algorithms in terms of classification performance. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:162 / 168
页数:7
相关论文
共 50 条
[1]  
[Anonymous], INT REV EC FINANCE
[2]  
Blake C. L., 1998, Uci repository of machine learning databases
[3]  
Brown G, 2012, J MACH LEARN RES, V13, P27
[4]   Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification [J].
Dai, Jianhua ;
Xu, Qing .
APPLIED SOFT COMPUTING, 2013, 13 (01) :211-221
[5]   Consistency-based search in feature selection [J].
Dash, M ;
Liu, HA .
ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) :155-176
[6]   Rule learning for classification based on neighborhood covering reduction [J].
Du, Yong ;
Hu, Qinghua ;
Zhu, Pengfei ;
Ma, Peijun .
INFORMATION SCIENCES, 2011, 181 (24) :5457-5467
[7]   Comparing top k lists [J].
Fagin, R ;
Kumar, R ;
Sivakumar, D .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2003, 17 (01) :134-160
[8]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157
[9]   Mixed feature selection based on granulation and approximation [J].
Hu, Qinghua ;
Liu, Jinfu ;
Yu, Daren .
KNOWLEDGE-BASED SYSTEMS, 2008, 21 (04) :294-304
[10]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594