Hybridizing mRMR and harmony search for gene selection and classification of microarray data

被引:0
作者
Wang, Aiguo [1 ]
An, Ning [1 ]
Chen, Guilin [2 ]
Li, Lian [1 ]
机构
[1] School of Computer and Information, Hefei University of Technology, Hefei
[2] School of Computer and Information Engineering, Chuzhou University, Chuzhou
来源
Journal of Computational Information Systems | 2015年 / 11卷 / 05期
关键词
Filter; Gene selection; Harmony search; Microarray data; Wrapper;
D O I
10.12733/jcis13210
中图分类号
学科分类号
摘要
Gene selection, aimed at eliminating noisy and irrelevant genes, plays a crucial role in the analysis of microarray data. In this paper, we propose to hybridize a filter and a wrapper method in selecting discriminative genes for the classification of microarray data. First, the minimum Redundancy Maximum Relevance (mRMR) algorithm is exploited to select a subset of genes that are relevant to the disease and less redundant to each other from the original gene space. Then, harmony search algorithm combined with a classifier works on the reduced gene subset in a stochastic way to further explore the gene subset and obtain a more discriminative gene subset. To verify the effectiveness of the proposed approach, two other well-performed gene selection algorithms, ReliefF and FCBF, are involved as well, and experimental comparisons on six publicly available microarray data were conducted with 1-Nearest-Neighbor and Naive Bayes classifiers, respectively. Experimental results demonstrate that our approach greatly improves the classification accuracy and outperforms others for both two-category and multi-category problems. Copyright © 2015 Binary Information Press.
引用
收藏
页码:1563 / 1570
页数:7
相关论文
共 16 条
[1]  
Van't Veer L.J., Dai H., Van De Vijver M.J., He Y.D., Hart A.A., Et al., Gene expression profiling predicts clinical outcome of breast cancer, Nature, 415, 6871, pp. 530-536, (2002)
[2]  
Piatetsky-Shapiro G., Tamayo P., Microarray data mining: facing the challenges, ACM SIGKDD Explorations Newsletter, 5, 2, pp. 1-5, (2003)
[3]  
Jain A.K., Duin R.P.W., Mao J., Statistical pattern recognition: A review, Pattern Analysis and Machine Intelligence, IEEE Transactions on, 22, 1, pp. 4-37, (2000)
[4]  
Boln-Canedo V., Snchez-Marono N., Alonso-Betanzos A., An ensemble of filters and classifiers for microarray data classification, Pattern Recognition, 45, 1, pp. 531-539, (2012)
[5]  
Guyon I., Weston J., Barnhill S., Vapnik V., Gene selection for cancer classification using support vector machines, Machine Learning, 46, 1-3, pp. 389-422, (2002)
[6]  
Gheyas I.A., Smith L.S., Feature subset selection in large dimensionality domains, Pattern Recognition, 43, 1, pp. 5-13, (2010)
[7]  
Kohavi R., John G.H., Wrappers for feature subset selection, Artificial intelligence, 97, 1, pp. 273-324, (1997)
[8]  
Ruiz R., Riquelme J.C., Aguilar-Ruiz J.S., Incremental wrapper-based gene selection from microarray data for cancer classification, Pattern Recognition, 39, 12, pp. 2383-2392, (2006)
[9]  
Ding C., Peng H., Minimum redundancy feature selection from microarray gene expression data, Journal of Bioinformatics and Computational Biology, 3, 2, pp. 185-205, (2005)
[10]  
Geem Z.W., Kim J.H., Loganathan G.V., A new heuristic optimization algorithm: harmony search, Simulation, 76, 2, pp. 60-68, (2001)