Gene Expression Data Analysis Using a Novel Approach to Biclustering Combining Discrete and Continuous Data

被引:7
|
作者
Christinat, Yann [1 ]
Wachmann, Bernd [2 ]
Zhang, Lei [2 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, Lab Computat Biol & Bioinformat, CH-1015 Lausanne, Switzerland
[2] Siemens Corp Res, Princeton, NJ 08540 USA
关键词
Data mining; biclustering algorithm; gene expression data; discrete data; simultaneous clustering; microarray analysis;
D O I
10.1109/TCBB.2007.70251
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Many different methods exist for pattern detection in gene expression data. In contrast to classical methods, biclustering has the ability to cluster a group of genes together with a group of conditions (replicates, set of patients, or drug compounds). However, since the problem is NP-complex, most algorithms use heuristic search functions and, therefore, might converge toward local maxima. By using the results of biclustering on discrete data as a starting point for a local search function on continuous data, our algorithm avoids the problem of heuristic initialization. Similar to Order-Preserving Submatrices (OPSM), our algorithm aims to detect biclusters whose rows and columns can be ordered such that row values are growing across the bicluster's columns and vice versa. Results have been generated on the yeast genome (Saccharomyces cerevisiae), a human cancer data set, and random data. Results on the yeast genome showed that 89 percent of the 100 biggest nonoverlapping biclusters were enriched with Gene Ontology annotations. A comparison with the methods OPSM and Iterative Signature Algorithm (ISA, a generalization of singular value decomposition) demonstrated a better efficiency when using gene and condition orders. We present results on random and real data sets that show the ability of our algorithm to capture statistically significant and biologically relevant biclusters.
引用
收藏
页码:583 / 593
页数:11
相关论文
共 50 条
  • [1] Using the bagging approach for biclustering of gene expression data
    Hanczar, B.
    Nadif, M.
    NEUROCOMPUTING, 2011, 74 (10) : 1595 - 1605
  • [2] A Novel Approach for Biclustering Gene Expression Data Using Modular Singular Value Decomposition
    Aradhya, V. N. Manjunath
    Masulli, Francesco
    Rovetta, Stefano
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 254 - 265
  • [3] An evolutionary approach for biclustering of gene expression data
    Sheta, Walaa
    Hany, Maha
    Mahdi, Shereef
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2010, 2 (06) : 413 - 421
  • [4] On Biclustering of Gene Expression Data
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    CURRENT BIOINFORMATICS, 2010, 5 (03) : 204 - 216
  • [5] On Biclustering of Gene Expression Data
    Mounir, Mahmoud
    Hamdy, Mohamed
    2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 641 - 648
  • [6] Biclustering On Gene Expression Data
    Shruthi, M. P.
    2017 INTERNATIONAL CONFERENCE ON ALGORITHMS, METHODOLOGY, MODELS AND APPLICATIONS IN EMERGING TECHNOLOGIES (ICAMMAET), 2017,
  • [7] Randomized Algorithmic Approach for Biclustering of Gene Expression Data
    Nayak, Sradhanjali
    Mishra, Debahuti
    Das, Satyabrata
    Rath, Amiya Kumar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2010, 1 (06) : 80 - 86
  • [8] A comparative analysis of biclustering algorithms for gene expression data
    Eren, Kemal
    Deveci, Mehmet
    Kucuktunc, Onur
    Catalyurek, Umit V.
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (03) : 279 - 292
  • [9] Biclustering of gene expression data using genetic algorithm
    Chakraborty, A
    Maka, H
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 17 - 24
  • [10] Bayesian biclustering of gene expression data
    Jiajun Gu
    Jun S Liu
    BMC Genomics, 9