Identification of Functional Modules by Integration of Multiple Data Sources Using a Bayesian Network Classifier

被引:3
|
作者
Wang, Jinlian [1 ]
Zuo, Yiming [1 ,2 ]
Liu, Lun [3 ]
Man, Yangao [4 ]
Tadesse, Mahlet G. [5 ]
Ressom, Habtom W. [1 ]
机构
[1] Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Arlington, VA USA
[3] Beijing Acad Agr & Forestry Sci, Beijing Res Ctr Informat Technol, Beijing, Peoples R China
[4] Henry Jackson Fdn, Diagnost & Translat Res Ctr, Gaithersburg, MD USA
[5] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
关键词
genomics; systems biology; models; statistical; computational biology; gene expression; genetics; protein interaction domains and motifs; PROTEIN-PROTEIN INTERACTIONS; GENE NETWORKS; DOMAIN-DOMAIN; PATHWAYS;
D O I
10.1161/CIRCGENETICS.113.000087
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background- Prediction of functional modules is indispensable for detecting protein deregulation in human complex diseases such as cancer. Bayesian network is one of the most commonly used models to integrate heterogeneous data from multiple sources such as protein domain, interactome, functional annotation, genome-wide gene expression, and the literature. Methods and Results- In this article, we present a Bayesian network classifier that is customized to (1) increase the ability to integrate diverse information from different sources, (2) effectively predict protein-protein interactions, (3) infer aberrant networks with scale-free and small-world properties, and (4) group molecules into functional modules or pathways based on the primary function and biological features. Application of this model in discovering protein biomarkers of hepatocellular carcinoma leads to the identification of functional modules that provide insights into the mechanism of the development and progression of hepatocellular carcinoma. These functional modules include cell cycle deregulation, increased angiogenesis (eg, vascular endothelial growth factor, blood vessel morphogenesis), oxidative metabolic alterations, and aberrant activation of signaling pathways involved in cellular proliferation, survival, and differentiation. Conclusions- The discoveries and conclusions derived from our customized Bayesian network classifier are consistent with previously published results. The proposed approach for determining Bayesian network structure facilitates the integration of heterogeneous data from multiple sources to elucidate the mechanisms of complex diseases.
引用
收藏
页码:206 / 217
页数:12
相关论文
共 50 条
  • [21] Identification of ovarian cancer driver genes by using module network integration of multi-omics data
    Gevaert, Olivier
    Villalobos, Victor
    Sikic, Branimir I.
    Plevritis, Sylvia K.
    INTERFACE FOCUS, 2013, 3 (04)
  • [22] Inference of emission rates from multiple sources using Bayesian probability theory
    Yee, Eugene
    Flesch, Thomas K.
    JOURNAL OF ENVIRONMENTAL MONITORING, 2010, 12 (03): : 622 - 634
  • [23] Predicting eukaryotic transcriptional cooperativity by Bayesian network integration of genome-wide data
    Wang, Yong
    Zhang, Xiang-Sun
    Xia, Yu
    NUCLEIC ACIDS RESEARCH, 2009, 37 (18) : 5943 - 5958
  • [24] SENSITIVITY ANALYSIS AND EMULATION FOR FUNCTIONAL DATA USING BAYESIAN ADAPTIVE SPLINES
    Francom, Devin
    Sanso, Bruno
    Kupresanin, Ana
    Johannesson, Gardar
    STATISTICA SINICA, 2018, 28 (02) : 791 - 816
  • [25] Identification of Subtype Specific miRNA-mRNA Functional Regulatory Modules in Matched miRNA-mRNA Expression Data: Multiple Myeloma as a Case
    Zhang, Yunpeng
    Liu, Wei
    Xu, Yanjun
    Li, Chunquan
    Wang, Yingying
    Yang, Haixiu
    Zhang, Chunlong
    Su, Fei
    Li, Yixue
    Li, Xia
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [26] Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato
    Clark, Sarah
    Yu, Feng
    Gu, Lianfeng
    Min, Xiang Jia
    FRONTIERS IN PLANT SCIENCE, 2019, 10
  • [27] Understanding the Molecular Drivers of Disease Heterogeneity in Crohn's Disease Using Multi-omic Data Integration and Network Analysis
    Sudhakar, Padhmanand
    Verstockt, Bram
    Cremer, Jonathan
    Verstockt, Sare
    Sabino, Joao
    Ferrante, Marc
    Vermeire, Severine
    INFLAMMATORY BOWEL DISEASES, 2021, 27 (06) : 870 - 886
  • [28] Identification of Biomarkers and Functional Modules from Genomic Data in Stage-wise Breast Cancer
    Kanathezath, Athira
    Chembra, Vrinda
    Variyath, Sunil Kumar Padingare
    Nair, Gopakumar Gopalakrishnan
    CURRENT BIOINFORMATICS, 2021, 16 (05) : 722 - 733
  • [29] A computational approach for identification of core modules from a co-expression network and GWAS data
    Sabik, Olivia L.
    Ackert-Bicknell, Cheryl L.
    Farber, Charles R.
    STAR PROTOCOLS, 2021, 2 (03):
  • [30] Identification of a Core Module for Bone Mineral Density through the Integration of a Co-expression Network and GWAS Data
    Sabik, Olivia L.
    Calabrese, Gina M.
    Taleghani, Eric
    Ackert-Bicknell, Cheryl L.
    Farber, Charles R.
    CELL REPORTS, 2020, 32 (11):