A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set

被引:30
作者
Sun, Lin [1 ,2 ,3 ]
Zhang, Xiaoyu [1 ]
Xu, Jiucheng [1 ]
Wang, Wei [1 ,3 ]
Liu, Ruonan [1 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang, Henan, Peoples R China
[2] Henan Normal Univ, Coll Life Sci, Postdoctoral Mobile Stn Biol, Xinxiang, Henan, Peoples R China
[3] Engn Technol Res Ctr Comp Intelligence & Data Min, Xinxiang, Henan, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Gene selection; Fisher linear discriminant; neighborhood rough set; reduction; UNCERTAINTY MEASURES; ATTRIBUTE REDUCTION; FEATURE-EXTRACTION; ALGORITHM; CLASSIFICATION; RELEVANCE;
D O I
10.1080/21655979.2017.1403678
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
In recent years, tumor classification based on gene expression profiles has drawn great attention, and related research results have been widely applied to the clinical diagnosis of major gene diseases. These studies are of tremendous importance for accurate cancer diagnosis and subtype recognition. However, the microarray data of gene expression profiles have small samples, high dimensionality, large noise and data redundancy. To further improve the classification performance of microarray data, a gene selection approach based on the Fisher linear discriminant (FLD) and the neighborhood rough set (NRS) is proposed. First, the FLD method is employed to reduce the preliminarily genetic data to obtain features with a strong classification ability, which can form a candidate gene subset. Then, neighborhood precision and neighborhood roughness are defined in a neighborhood decision system, and the calculation approaches for neighborhood dependency and the significance of an attribute are given. A reduction model of neighborhood decision systems is presented. Thus, a gene selection algorithm based on FLD and NRS is proposed. Finally, four public gene datasets are used in the simulation experiments. Experimental results under the SVM classifier demonstrate that the proposed algorithm is effective, and it can select a smaller and more well-classified gene subset, as well as obtain better classification performance.
引用
收藏
页码:144 / 151
页数:8
相关论文
共 34 条
[1]   A Hamming distance based binary particle swarm optimization (HDBPSO) algorithm for high dimensional feature selection, classification and validation [J].
Banka, Haider ;
Dara, Suresh .
PATTERN RECOGNITION LETTERS, 2015, 52 :94-100
[2]   Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection [J].
Belhumeur, PN ;
Hespanha, JP ;
Kriegman, DJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) :711-720
[3]   Privacy preserving sub-feature selection in distributed data mining [J].
Bhuyan, Hemanta Kumar ;
Kamila, Narendra Kumar .
APPLIED SOFT COMPUTING, 2015, 36 :552-569
[4]   A new feature extraction technique for on-line recognition of handwritten alphanumeric characters [J].
Chakraborty, B ;
Chakraborty, G .
INFORMATION SCIENCES, 2002, 148 (1-4) :55-70
[5]   Parallel attribute reduction in dominance-based neighborhood rough set [J].
Chen, Hongmei ;
Li, Tianrui ;
Cai, Yong ;
Luo, Chuan ;
Fujita, Hamido .
INFORMATION SCIENCES, 2016, 373 :351-368
[6]   Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification [J].
Dai, Jianhua ;
Xu, Qing .
APPLIED SOFT COMPUTING, 2013, 13 (01) :211-221
[7]   Unsupervised feature selection based on maximum information and minimum redundancy for hyperspectral images [J].
Feng, Jie ;
Jiao, Licheng ;
Liu, Fang ;
Sun, Tao ;
Zhang, Xiangrong .
PATTERN RECOGNITION, 2016, 51 :295-309
[8]  
Feng Liu, 2010, Journal of Biomedical Science & Engineering, V3, P217, DOI 10.4236/jbise.2010.32029
[9]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594
[10]   Kernel Nonparametric Weighted Feature Extraction for Hyperspectral Image Classification [J].
Kuo, Bor-Chen ;
Li, Cheng-Hsuan ;
Yang, Jinn-Min .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (04) :1139-1155