Identification of Functional Modules by Integration of Multiple Data Sources Using a Bayesian Network Classifier

被引:3
|
作者
Wang, Jinlian [1 ]
Zuo, Yiming [1 ,2 ]
Liu, Lun [3 ]
Man, Yangao [4 ]
Tadesse, Mahlet G. [5 ]
Ressom, Habtom W. [1 ]
机构
[1] Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Arlington, VA USA
[3] Beijing Acad Agr & Forestry Sci, Beijing Res Ctr Informat Technol, Beijing, Peoples R China
[4] Henry Jackson Fdn, Diagnost & Translat Res Ctr, Gaithersburg, MD USA
[5] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
关键词
genomics; systems biology; models; statistical; computational biology; gene expression; genetics; protein interaction domains and motifs; PROTEIN-PROTEIN INTERACTIONS; GENE NETWORKS; DOMAIN-DOMAIN; PATHWAYS;
D O I
10.1161/CIRCGENETICS.113.000087
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background- Prediction of functional modules is indispensable for detecting protein deregulation in human complex diseases such as cancer. Bayesian network is one of the most commonly used models to integrate heterogeneous data from multiple sources such as protein domain, interactome, functional annotation, genome-wide gene expression, and the literature. Methods and Results- In this article, we present a Bayesian network classifier that is customized to (1) increase the ability to integrate diverse information from different sources, (2) effectively predict protein-protein interactions, (3) infer aberrant networks with scale-free and small-world properties, and (4) group molecules into functional modules or pathways based on the primary function and biological features. Application of this model in discovering protein biomarkers of hepatocellular carcinoma leads to the identification of functional modules that provide insights into the mechanism of the development and progression of hepatocellular carcinoma. These functional modules include cell cycle deregulation, increased angiogenesis (eg, vascular endothelial growth factor, blood vessel morphogenesis), oxidative metabolic alterations, and aberrant activation of signaling pathways involved in cellular proliferation, survival, and differentiation. Conclusions- The discoveries and conclusions derived from our customized Bayesian network classifier are consistent with previously published results. The proposed approach for determining Bayesian network structure facilitates the integration of heterogeneous data from multiple sources to elucidate the mechanisms of complex diseases.
引用
收藏
页码:206 / 217
页数:12
相关论文
共 50 条
  • [31] Data integration and network reconstruction with ∼omics data using Random Forest regression in potato
    Acharjee, Animesh
    Kloosterman, Bjorn
    de Vos, Ric C. H.
    Werij, Jeroen S.
    Bachem, Christian W. B.
    Visser, Richard G. F.
    Maliepaard, Chris
    ANALYTICA CHIMICA ACTA, 2011, 705 (1-2) : 56 - 63
  • [32] Comparing and combining data across multiple sources via integration of paired-sample data to correct for measurement error
    Huang, Yunda
    Huang, Ying
    Moodie, Zoe
    Li, Sue
    Self, Steve
    STATISTICS IN MEDICINE, 2012, 31 (28) : 3748 - 3759
  • [33] Bayesian Network Reconstruction Using Systems Genetics Data: Comparison of MCMC Methods
    Tasaki, Shinya
    Ben Sauerwine
    Hoff, Bruce
    Toyoshiba, Hiroyoshi
    Gaiteri, Chris
    Chaibub Neto, Elias
    GENETICS, 2015, 199 (04) : 973 - U128
  • [34] The Five-Gene-Network Data Analysis with Local Causal Discovery Algorithm Using Causal Bayesian Networks
    Yoo, Changwon
    Brilz, Erik M.
    CHALLENGES OF SYSTEMS BIOLOGY: COMMUNITY EFFORTS TO HARNESS BIOLOGICAL COMPLEXITY, 2009, 1158 : 93 - 101
  • [35] Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data
    Maity, Arnab Kumar
    Bhattacharya, Anirban
    Mallick, Bani K.
    Baladandayuthapani, Veerabhadran
    BIOMETRICS, 2020, 76 (01) : 316 - 325
  • [36] Identification of rifampin-regulated functional modules and related microRNAs in human hepatocytes based on the protein interaction network
    Li, Jin
    Wang, Ying
    Wang, Lei
    Dai, Xuefeng
    Cong, Wang
    Feng, Weixing
    Xu, Chengzhen
    Deng, Yulin
    Wang, Yue
    Skaar, Todd C.
    Liang, Hong
    Liu, Yunlong
    BMC GENOMICS, 2016, 17
  • [37] Estimating contact network properties by integrating multiple data sources associated with infectious diseases
    Goyal, Ravi
    Carnegie, Nicole
    Slipher, Sally
    Turk, Philip
    Little, Susan J.
    De Gruttola, Victor
    STATISTICS IN MEDICINE, 2023, 42 (20) : 3593 - 3615
  • [38] Prediction of Groundwater Quality Index and Identification of Key Variables Using Bayesian Neural Network
    Maiti, Saumen
    Gupta, Surabhi
    Gupta, Praveen Kumar
    WATER AIR AND SOIL POLLUTION, 2024, 235 (10)
  • [39] META-ANALYSIS OF FUNCTIONAL NEUROIMAGING DATA USING BAYESIAN NONPARAMETRIC BINARY REGRESSION
    Yue, Yu Ryan
    Lindquist, Martin A.
    Loh, Ji Meng
    ANNALS OF APPLIED STATISTICS, 2012, 6 (02) : 697 - 718
  • [40] Identification of Risk Pathways and Functional Modules for Coronary Artery Disease Based on Genome-wide SNP Data
    Zhao, Xiang
    Luan, Yi-Zhao
    Zuo, Xiaoyu
    Chen, Ye-Da
    Qin, Jiheng
    Jin, Lv
    Tan, Yiqing
    Lin, Meihua
    Zhang, Naizun
    Liang, Yan
    Rao, Shao-Qi
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2016, 14 (06) : 349 - 356