Microarray data classification based on ensemble independent component selection

被引:15
作者
Liu, Kun-Hong [1 ]
Li, Bo [2 ]
Wu, Qing-Qiang [1 ]
Zhang, Jun [3 ]
Du, Ji-Xiang [4 ]
Liu, Guo-Yan [5 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen 361005, Fujian, Peoples R China
[2] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430081, Hubei, Peoples R China
[3] Anhui Univ, Sch Elect Sci & Technol, Hefei, Anhui, Peoples R China
[4] Huaqiao Univ, Dept Comp Sci & Technol, Quanzhou 362021, Fujian, Peoples R China
[5] Xiamen Univ, Digest Dis Res Inst, Gen Surg Affiliated Zhongshan Hosp, Xiamen 361004, Fujian, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Microarray data classification; Independent component analysis; Ensemble component selection; Genetic algorithm; SUBSPACE;
D O I
10.1016/j.compbiomed.2009.07.006
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Independent component analysis (ICA) has been widely deployed to the analysis of microarray datasets. Although it was pointed out that after ICA transformation, different independent components (ICs) are of different biological significance, the IC selection problem is still far from fully explored. In this paper, we propose a genetic algorithm (GA) based ensemble independent component selection (EICS) system. In this system, GA is applied to select a set of optimal IC subsets, which are then used to build diverse and accurate base classifiers. Finally, all base classifiers are combined with majority vote rule. To show the validity of the proposed method, we apply it to classify three DNA microarray data sets involving various human normal and tumor tissue samples. The experimental results show that our ensemble method obtains stable and satisfying classification results when compared with several existing methods. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:953 / 960
页数:8
相关论文
共 33 条
[11]  
HUANG DS, 2008, BIOINFORMATICS, DOI DOI 10.1093/BIOINFORMATICS/BTN644
[12]   Fast and robust fixed-point algorithms for independent component analysis [J].
Hyvärinen, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (03) :626-634
[13]   Oligonucleotide microarray for prediction of early intrahepatic recurrence of hepatocellular carcinoma after curative resection [J].
Iizuka, N ;
Oka, M ;
Yamada-Okabe, H ;
Nishida, M ;
Maeda, Y ;
Mori, N ;
Takao, T ;
Tamesa, T ;
Tangoku, A ;
Tabuchi, H ;
Hamada, K ;
Nakayama, H ;
Ishitsuka, H ;
Miyamoto, T ;
Hirabayashi, A ;
Uchimura, S ;
Hamamoto, Y .
LANCET, 2003, 361 (9361) :923-929
[14]  
Kuncheva LI, 2000, IEEE T EVOLUT COMPUT, V4, P327, DOI 10.1109/4235.887233
[15]   Application of independent component analysis to microarrays [J].
Lee, SI ;
Batzoglou, S .
GENOME BIOLOGY, 2003, 4 (11)
[16]   An ensemble method for gene discovery based on DNA microarray data [J].
Li, X ;
Rao, SQ ;
Zhang, TW ;
Guo, Z ;
Zhang, QP ;
Moser, KL ;
Topol, EJ .
SCIENCE IN CHINA SERIES C-LIFE SCIENCES, 2004, 47 (05) :396-405
[17]   Linear modes of gene expression determined by independent component analysis [J].
Liebermeister, W .
BIOINFORMATICS, 2002, 18 (01) :51-60
[18]   Cancer classification using Rotation Forest [J].
Liu, Kun-Hong ;
Huang, De-Shuang .
COMPUTERS IN BIOLOGY AND MEDICINE, 2008, 38 (05) :601-610
[19]  
Margineantu DD, 1997, P 14 INT C MACH LEAR, P211
[20]  
Oliveira LS, 2006, STUD COMP INTELL, V16, P49