Identification of Functional Modules by Integration of Multiple Data Sources Using a Bayesian Network Classifier

被引:3
|
作者
Wang, Jinlian [1 ]
Zuo, Yiming [1 ,2 ]
Liu, Lun [3 ]
Man, Yangao [4 ]
Tadesse, Mahlet G. [5 ]
Ressom, Habtom W. [1 ]
机构
[1] Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Arlington, VA USA
[3] Beijing Acad Agr & Forestry Sci, Beijing Res Ctr Informat Technol, Beijing, Peoples R China
[4] Henry Jackson Fdn, Diagnost & Translat Res Ctr, Gaithersburg, MD USA
[5] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
关键词
genomics; systems biology; models; statistical; computational biology; gene expression; genetics; protein interaction domains and motifs; PROTEIN-PROTEIN INTERACTIONS; GENE NETWORKS; DOMAIN-DOMAIN; PATHWAYS;
D O I
10.1161/CIRCGENETICS.113.000087
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background- Prediction of functional modules is indispensable for detecting protein deregulation in human complex diseases such as cancer. Bayesian network is one of the most commonly used models to integrate heterogeneous data from multiple sources such as protein domain, interactome, functional annotation, genome-wide gene expression, and the literature. Methods and Results- In this article, we present a Bayesian network classifier that is customized to (1) increase the ability to integrate diverse information from different sources, (2) effectively predict protein-protein interactions, (3) infer aberrant networks with scale-free and small-world properties, and (4) group molecules into functional modules or pathways based on the primary function and biological features. Application of this model in discovering protein biomarkers of hepatocellular carcinoma leads to the identification of functional modules that provide insights into the mechanism of the development and progression of hepatocellular carcinoma. These functional modules include cell cycle deregulation, increased angiogenesis (eg, vascular endothelial growth factor, blood vessel morphogenesis), oxidative metabolic alterations, and aberrant activation of signaling pathways involved in cellular proliferation, survival, and differentiation. Conclusions- The discoveries and conclusions derived from our customized Bayesian network classifier are consistent with previously published results. The proposed approach for determining Bayesian network structure facilitates the integration of heterogeneous data from multiple sources to elucidate the mechanisms of complex diseases.
引用
收藏
页码:206 / 217
页数:12
相关论文
共 50 条
  • [1] Integration of Multiple Data Sources for Identifying Functional Modules Using Bayesian Network
    Wang, Jinlian
    Yuan, Hongyan
    Tadesse, Mahlet G.
    Ressom, Habtom W.
    2012 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS), 2012, : 13 - 17
  • [2] Systematic Identification of Functional Plant Modules through the Integration of Complementary Data Sources
    Heyndrickx, Ken S.
    Vandepoele, Klaas
    PLANT PHYSIOLOGY, 2012, 159 (03) : 884 - 901
  • [3] Integration of Multiple Data Sources for Gene Network Inference Using Genetic Perturbation Data
    Liang, Xiao
    Young, William Chad
    Hung, Ling-Hong
    Raftery, Adrian E.
    Yeung, Ka Yee
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (10) : 1113 - 1129
  • [4] Integration of Multiple Data Sources for Gene Network Inference using Genetic Perturbation Data
    Liang, Xiao
    Young, William Chad
    Hung, Ling-Hong
    Raftery, Adrian E.
    Yeung, Ka Yee
    ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 601 - 602
  • [5] Bayesian Calibration of a Stochastic Kinetic Computer Model Using Multiple Data Sources
    Henderson, D. A.
    Boys, R. J.
    Wilkinson, D. J.
    BIOMETRICS, 2010, 66 (01) : 249 - 256
  • [6] Bayesian Demographic Accounts: Subnational Population Estimation Using Multiple Data Sources
    Bryant, John R.
    Graham, Patrick J.
    BAYESIAN ANALYSIS, 2013, 8 (03): : 591 - 622
  • [7] Multilevel functional genomics data integration as a tool for understanding physiology: a network biology perspective
    Davidsen, Peter K.
    Turan, Nil
    Egginton, Stuart
    Falciani, Francesco
    JOURNAL OF APPLIED PHYSIOLOGY, 2016, 120 (03) : 297 - 309
  • [8] Integration of Epigenetic Data in Bayesian Network Modeling of Gene Regulatory Network
    Zheng, Jie
    Chaturvedi, Iti
    Rajapakse, Jagath C.
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 87 - 96
  • [9] A graph-based integrative method of detecting consistent protein functional modules from multiple data sources
    Zhang, Yuan
    Cheng, Yue
    Ge, Liang
    Du, Nan
    Jia, Kebin
    Zhang, Aidong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (02) : 122 - 140
  • [10] Identification of functional modules that correlate with phenotypic difference: the influence of network topology
    Hung, Jui-Hung
    Whitfield, Troy W.
    Yang, Tun-Hsiang
    Hu, Zhenjun
    Weng, Zhiping
    DeLisi, Charles
    GENOME BIOLOGY, 2010, 11 (02):