Microarray data classification based on ensemble independent component selection

被引:15
作者
Liu, Kun-Hong [1 ]
Li, Bo [2 ]
Wu, Qing-Qiang [1 ]
Zhang, Jun [3 ]
Du, Ji-Xiang [4 ]
Liu, Guo-Yan [5 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen 361005, Fujian, Peoples R China
[2] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430081, Hubei, Peoples R China
[3] Anhui Univ, Sch Elect Sci & Technol, Hefei, Anhui, Peoples R China
[4] Huaqiao Univ, Dept Comp Sci & Technol, Quanzhou 362021, Fujian, Peoples R China
[5] Xiamen Univ, Digest Dis Res Inst, Gen Surg Affiliated Zhongshan Hosp, Xiamen 361004, Fujian, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Microarray data classification; Independent component analysis; Ensemble component selection; Genetic algorithm; SUBSPACE;
D O I
10.1016/j.compbiomed.2009.07.006
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Independent component analysis (ICA) has been widely deployed to the analysis of microarray datasets. Although it was pointed out that after ICA transformation, different independent components (ICs) are of different biological significance, the IC selection problem is still far from fully explored. In this paper, we propose a genetic algorithm (GA) based ensemble independent component selection (EICS) system. In this system, GA is applied to select a set of optimal IC subsets, which are then used to build diverse and accurate base classifiers. Finally, all base classifiers are combined with majority vote rule. To show the validity of the proposed method, we apply it to classify three DNA microarray data sets involving various human normal and tumor tissue samples. The experimental results show that our ensemble method obtains stable and satisfying classification results when compared with several existing methods. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:953 / 960
页数:8
相关论文
共 33 条
[1]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[2]  
[Anonymous], Journal of machine learning research
[3]   Face recognition by independent component analysis [J].
Bartlett, MS ;
Movellan, JR ;
Sejnowski, TJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (06) :1450-1464
[4]   Gene selection and classification of microarray data using random forest -: art. no. 3 [J].
Díaz-Uriarte, R ;
de Andrés, SA .
BMC BIOINFORMATICS, 2006, 7 (1)
[5]  
Duin R.P. W., 2004, PRTools4, A Matlab Toolbox for Pattern Recognition
[6]   Improvements on cross-validation: The .632+ bootstrap method [J].
Efron, B ;
Tibshirani, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) :548-560
[7]   Feature selection in the independent component subspace for face recognition [J].
Ekenel, HK ;
Sankur, B .
PATTERN RECOGNITION LETTERS, 2004, 25 (12) :1377-1388
[8]   Independent component analysis reveals new and biologically significant structures in micro array data [J].
Frigyesi, Attila ;
Veerla, Srinivas ;
Lindgren, David ;
Hoglund, Mattias .
BMC BIOINFORMATICS, 2006, 7 (1)
[9]   The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming [J].
Hong, JH ;
Cho, SB .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2006, 36 (01) :43-58
[10]   Independent component analysis-based penalized discriminant method for tumor classification using gene expression data [J].
Huang, De-Shuang ;
Zheng, Chun-Hou .
BIOINFORMATICS, 2006, 22 (15) :1855-1862