Gene Expression Data Analysis Using a Novel Approach to Biclustering Combining Discrete and Continuous Data

被引:7
|
作者
Christinat, Yann [1 ]
Wachmann, Bernd [2 ]
Zhang, Lei [2 ]
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, Lab Computat Biol & Bioinformat, CH-1015 Lausanne, Switzerland
[2] Siemens Corp Res, Princeton, NJ 08540 USA
关键词
Data mining; biclustering algorithm; gene expression data; discrete data; simultaneous clustering; microarray analysis;
D O I
10.1109/TCBB.2007.70251
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Many different methods exist for pattern detection in gene expression data. In contrast to classical methods, biclustering has the ability to cluster a group of genes together with a group of conditions (replicates, set of patients, or drug compounds). However, since the problem is NP-complex, most algorithms use heuristic search functions and, therefore, might converge toward local maxima. By using the results of biclustering on discrete data as a starting point for a local search function on continuous data, our algorithm avoids the problem of heuristic initialization. Similar to Order-Preserving Submatrices (OPSM), our algorithm aims to detect biclusters whose rows and columns can be ordered such that row values are growing across the bicluster's columns and vice versa. Results have been generated on the yeast genome (Saccharomyces cerevisiae), a human cancer data set, and random data. Results on the yeast genome showed that 89 percent of the 100 biggest nonoverlapping biclusters were enriched with Gene Ontology annotations. A comparison with the methods OPSM and Iterative Signature Algorithm (ISA, a generalization of singular value decomposition) demonstrated a better efficiency when using gene and condition orders. We present results on random and real data sets that show the ability of our algorithm to capture statistically significant and biologically relevant biclusters.
引用
收藏
页码:583 / 593
页数:11
相关论文
共 50 条
  • [31] An improved biclustering algorithm for gene expression data
    Jin, Sheng-Hua
    Hua, Li
    Open Cybernetics and Systemics Journal, 2014, 8 (01): : 1141 - 1144
  • [32] Biclustering of gene expression data by simulated annealing
    Chakraborty, Anupam
    EIGHTH INTERNATIONAL CONFERENCE ON HIGH-PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION, PROCEEDINGS, 2005, : 627 - 632
  • [33] On Evolutionary Algorithms for Biclustering of Gene Expression Data
    Carballido Jessica, A.
    Gallo Cristian, A.
    Dussaut Julieta, S.
    Ignacio, Ponzoni
    CURRENT BIOINFORMATICS, 2015, 10 (03) : 259 - 267
  • [34] Evolutionary fuzzy biclustering of gene expression data
    Mitra, Sushmita
    Banka, Haider
    Paik, Jiaul Hoque
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2007, 4481 : 284 - +
  • [35] Biclustering of Gene Expression Data Using Cuckoo Search and Genetic Algorithm
    Yin, Lu
    Qiu, Junlin
    Gao, Shangbing
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (11)
  • [36] Efficient Biclustering Algorithms for Time Series Gene Expression Data Analysis
    Madeira, Sara C.
    Oliveira, Arlindo L.
    DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 1013 - 1019
  • [37] Exact biclustering algorithm for the analysis of large gene expression data sets
    Oliver Voggenreiter
    Stefan Bleuler
    Wilhelm Gruissem
    BMC Bioinformatics, 13 (Suppl 18)
  • [38] Pattern-Based Biclustering with Constraints for Gene Expression Data Analysis
    Henriques, Rui
    Madeira, Sara C.
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 326 - 339
  • [39] RN+: A Novel Biclustering Algorithm for Analysis of Gene Expression Data Using Protein-Protein Interaction Network
    Ahn, Jaegyoon
    Choi, Junhyeok
    Kim, Harrim
    Kim, Jibum
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (05) : 432 - 441
  • [40] Biclustering of gene expression data using EDA-GA hybrid
    Liu, Feng
    Zhou, Huaibei
    Liu, Juan
    He, Guoliang
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 1583 - +