A Novel Feature Selection Method for High-Dimensional Biomedical Data Based on an Improved Binary Clonal Flower Pollination Algorithm

被引:27
作者
Yan, Chaokun [1 ]
Ma, Jingjing [1 ]
Luo, Huimin [1 ]
Zhang, Ge [1 ]
Luo, Junwei [2 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Kaifeng 475004, Peoples R China
[2] Henan Polytech Univ, Coll Comp Sci & Technol, Jiaozuo 454000, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Microarray datasets; Clonal flower pollination algorithm; Absolute balance group strategy; Adaptive Gaussian mutation; OPTIMIZATION ALGORITHM;
D O I
10.1159/000501652
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In the biomedical field, large amounts of biological and clinical data have been accumulated rapidly, which can be analyzed to emphasize the assessment of at-risk patients and improve diagnosis. However, a major challenge encountered associated with biomedical data analysis is the so-called "curse of dimensionality." For this issue, a novel feature selection method based on an improved binary clonal flower pollination algorithm is proposed to eliminate unnecessary features and ensure a highly accurate classification of disease. The absolute balance group strategy and adaptive Gaussian mutation are adopted, which can increase the diversity of the population and improve the search performance. The KNN classifier is used to evaluate the classification accuracy. Extensive experimental results in six, publicly available, high-dimensional, biomedical datasets show that the proposed method can obtain high classification accuracy and outperforms other state-of-the-art methods.
引用
收藏
页码:34 / 46
页数:13
相关论文
共 27 条
[1]  
[Anonymous], 2014, DATA MINING EITH DEC, DOI [DOI 10.1142/9097, 10.1142/9097]
[2]  
Babatunde O. H., 2014, Int. J. Electron. Commun. Comput. Eng., V5, P899
[3]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[4]  
Castro LN, 2000, GECCO 2000, P36
[5]  
Chiroma H, 2015, INT C SOFT COMP SOFT, P435
[6]  
De Bruijn N G, 1948, COMBINATORIAL PROBLE, V51, P1277
[7]  
Ghanad N.K., 2015, ADV COMPUT SCI INT J, V4, P119
[8]  
Hinterding R, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, P384, DOI 10.1109/ICEC.1995.489178
[9]  
Hira Zena M., 2015, Advances in Bioinformatics, V2015, P198363, DOI 10.1155/2015/198363
[10]  
Hu B, 2016, IEEE ACM T COMPUT BI, P1545