A graphical model method for integrating multiple sources of genome-scale data

被引:5
|
作者
Dvorkin, Daniel [1 ]
Biehs, Brian [2 ,3 ]
Kechris, Katerina [1 ,4 ]
机构
[1] Univ Colorado, Sch Med, Computat Biosci Program, Aurora, CO 80045 USA
[2] Univ Calif San Francisco, Cardiovasc Res Inst, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Dept Biochem & Biophys, San Francisco, CA 94143 USA
[4] Colorado Sch Publ Hlth, Dept Biostat & Informat, Aurora, CO 80045 USA
关键词
data integration; genomics; graphical models; mixture models; GENE-EXPRESSION DATA; MIXTURE-MODELS; DNA-BINDING; CHIP-CHIP; DISCOVERY; IDENTIFICATION; TRANSCRIPTION; TARGETS; SAMPLES; DORSAL;
D O I
10.1515/sagmb-2012-0051
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Making effective use of multiple data sources is a major challenge in modern bioinformatics. Genome-wide data such as measures of transcription factor binding, gene expression, and sequence conservation, which are used to identify binding regions and genes that are important to major biological processes such as development and disease, can be difficult to use together due to the different biological meanings and statistical distributions of the heterogeneous data types, but each can provide valuable information for understanding the processes under study. Here we present methods for integrating multiple data sources to gain a more complete picture of gene regulation and expression. Our goal is to identify genes and cis-regulatory regions which play specific biological roles. We describe a graphical mixture model approach for data integration, examine the effect of using different model topologies, and discuss methods for evaluating the effectiveness of the models. Model fitting is computationally efficient and produces results which have clear biological and statistical interpretations. The Hedgehog and Dorsal signaling pathways in Drosophila, which are critical in embryonic development, are used as examples.
引用
收藏
页码:469 / 487
页数:19
相关论文
共 50 条
  • [1] Integrating genome-scale data for gene essentiality prediction
    Roberts, Seth B.
    Mazurie, Aurelien L.
    Buck, Gregory A.
    CHEMISTRY & BIODIVERSITY, 2007, 4 (11) : 2618 - 2630
  • [2] Genome-scale protein function prediction in yeast Saccharomyces cerevisiae through integrating multiple sources of high-throughput data
    Chen, Y
    Xu, D
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, 2005, : 471 - 482
  • [3] Comparative analysis of multiple genome-scale data sets
    Werner-Washburne, M
    Wylie, B
    Boyack, K
    Fuge, E
    Galbraith, J
    Weber, J
    Davidson, G
    GENOME RESEARCH, 2002, 12 (10) : 1564 - 1573
  • [4] Integrating quantitative proteomics and metabolomics with a genome-scale metabolic network model
    Yizhak, Keren
    Benyamini, Tomer
    Liebermeister, Wolfram
    Ruppin, Eytan
    Shlomi, Tomer
    BIOINFORMATICS, 2010, 26 (12) : i255 - i260
  • [5] Integrating gene expression data into a genome-scale metabolic model to identify reprogramming during adaptive evolution
    Yazdanpanah, Shaghayegh
    Motamedian, Ehsan
    Shojaosadati, Seyed Abbas
    PLOS ONE, 2023, 18 (10):
  • [6] Integrating -omics data into genome-scale metabolic network models: principles and challenges
    Ramon, Charlotte
    Gollub, Mattia G.
    Stelling, Jorg
    SYSTEMS BIOLOGY, 2018, 62 (04): : 563 - 574
  • [7] TCLUST: A Fast Method for Clustering Genome-Scale Expression Data
    Dost, Banu
    Wu, Chunlei
    Su, Andrew
    Bafna, Vineet
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 808 - 818
  • [8] Integrating transcriptional activity in genome-scale models of metabolism
    Banos, Daniel Trejo
    Trebulle, Pauline
    Elati, Mohamed
    BMC SYSTEMS BIOLOGY, 2017, 11
  • [9] Integrating Omics Data in Genome-Scale Metabolic Modeling: A Methodological Perspective for Precision Medicine
    Sen, Partho
    Oresic, Matej
    METABOLITES, 2023, 13 (07)
  • [10] PROBABILISTIC METHOD FOR INTEGRATING MULTIPLE SOURCES OF RANGE DATA
    BOVE, VM
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1990, 7 (12): : 2193 - 2198