Parallelized Evolutionary Learning for Detection of Biclusters in Gene Expression Data

被引:43
作者
Huang, Qinghua [1 ]
Tao, Dacheng [2 ]
Li, Xuelong [3 ]
Liew, Alan Wee-Chung [4 ]
机构
[1] S China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Guangdong, Peoples R China
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW 2007, Australia
[3] Chinese Acad Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, State Key Lab Transient Opt & Photon, Xian Inst Opt & Precis Mech, Xian 710119, Peoples R China
[4] Griffith Univ, Sch Informat & Commun Technol, Gold Coast, Qld 4222, Australia
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Biclustering; genetic learning; subdimensional search strategy; gene expression data analysis; MICROARRAY DATA; CLUSTERING ANALYSIS;
D O I
10.1109/TCBB.2011.53
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The analysis of gene expression data obtained from microarray experiments is important for discovering the biological process of genes. Biclustering algorithms have been proven to be able to group the genes with similar expression patterns under a number of experimental conditions. In this paper, we propose a new biclustering algorithm based on evolutionary learning. By converting the biclustering problem into a common clustering problem, the algorithm can be applied in a search space constructed by the conditions. To further reduce the size of the search space, we randomly separate the full conditions into a number of condition subsets (subspaces), each of which has a smaller number of conditions. The algorithm is applied to each subspace and is able to discover bicluster seeds within a limited computing time. Finally, an expanding and merging procedure is employed to combine the bicluster seeds into larger biclusters according to a homogeneity criterion. We test the performance of the proposed algorithm using synthetic and real microarray data sets. Compared with several previously developed biclustering algorithms, our algorithm demonstrates a significant improvement in discovering additive biclusters.
引用
收藏
页码:560 / 570
页数:11
相关论文
共 36 条
  • [1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [2] [Anonymous], 2002, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, DOI DOI 10.1145/564691.564737
  • [3] BenDor A., 2002, Proceedings of the sixth annual international conference on computational biology, P49, DOI [10.1145/565196.565203, DOI 10.1145/565196.565203]
  • [4] Characterizing gene sets with FuncAssociate
    Berriz, GF
    King, OD
    Bryant, B
    Sander, C
    Roth, FP
    [J]. BIOINFORMATICS, 2003, 19 (18) : 2502 - 2504
  • [5] An EA framework for biclustering of gene expression data
    Bleuler, S
    Preli, A
    Zitzler, E
    [J]. CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 166 - 173
  • [6] Application of simulated annealing to the biclustering of gene expression data
    Bryan, Kenneth
    Cunningham, Padraig
    Bolshakova, Nadia
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (03): : 519 - 525
  • [7] Cheng Y, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P93
  • [8] Biclustering of expression data with evolutionary computation
    Divina, F
    Aguilar-Ruiz, JS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (05) : 590 - 602
  • [9] Genomic expression programs in the response of yeast cells to environmental changes
    Gasch, AP
    Spellman, PT
    Kao, CM
    Carmel-Harel, O
    Eisen, MB
    Storz, G
    Botstein, D
    Brown, PO
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2000, 11 (12) : 4241 - 4257
  • [10] Coupled two-way clustering analysis of gene microarray data
    Getz, G
    Levine, E
    Domany, E
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) : 12079 - 12084