Sampling-based Subnetwork Identification from Microarray Data and Protein-protein Interaction Network

被引:1
作者
Wang, Xiao [1 ]
Gu, Jinghua [1 ]
Xuan, Jianhua [1 ]
Chen, Li [2 ]
Shajahan, Ayesha N. [3 ,4 ]
Clarke, Robert [3 ,4 ]
机构
[1] Virginia Tech, Bradley Dept Elect & Comp Engn, Arlington, VA 22203 USA
[2] Johns Hopkins Med Inst, Dept Pathol, Baltimore, MD 21231 USA
[3] Georgetown Univ, Dept Physiol, Washington, DC 20057 USA
[4] Georgetown Univ, Dept Biophys, Washington, DC 20057 USA
来源
2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2 | 2012年
基金
美国国家卫生研究院;
关键词
Gene expression; Protein-protein interaction (PPI); Markov random field (MRF); Markov Chain Monte Carlo (MCMC); Subnetwork identification; Breast cancer; BREAST-CANCER;
D O I
10.1109/ICMLA.2012.221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of condition-specific protein interaction subnetworks has emerged as an attractive research field to reveal molecular mechanisms of diseases and provide reliable network biomarkers for disease diagnosis. Several methods have been proposed, which integrate gene expression and protein-protein interaction (PPI) data to identify subnetworks. However, existing methods treat differential expression of genes and network topology independently, which is an oversimplified assumption to model real biological systems. In this paper, we propose a sampling-based subnetwork identification approach to take into account the dependency between gene expression and network topology. Specifically, we apply Markov random field (MRF) theory to model the dependency of genes in PPI network using a Bayesian framework, followed by a Markov Chain Monte Carlo (MCMC) approach to identify significant subnetworks. The MCMC approach estimates the posterior distribution of genes' significant scores and network structure iteratively. Experimental results on both synthetic data and real breast cancer data demonstrated the effectiveness of the proposed method in identifying subnetworks, especially several functionally important, aberrant subnetworks associated with pathways involved in the development and recurrence of breast cancer.
引用
收藏
页码:158 / 163
页数:6
相关论文
共 23 条
  • [1] [Anonymous], 1996, Tools for Statistical Inference
  • [2] [Anonymous], 2011, THESIS
  • [3] Chen L, 2012, BIOCOMPUT-PAC SYM, P31
  • [4] UNDERSTANDING THE METROPOLIS-HASTINGS ALGORITHM
    CHIB, S
    GREENBERG, E
    [J]. AMERICAN STATISTICIAN, 1995, 49 (04) : 327 - 335
  • [5] Network-based classification of breast cancer metastasis
    Chuang, Han-Yu
    Lee, Eunjung
    Liu, Yu-Tsueng
    Lee, Doheon
    Ideker, Trey
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
  • [6] Identifying functional modules in protein-protein interaction networks:: an integrated exact approach
    Dittrich, Marcus T.
    Klau, Gunnar W.
    Rosenwald, Andreas
    Dandekar, Thomas
    Mueller, Tobias
    [J]. BIOINFORMATICS, 2008, 24 (13) : I223 - I231
  • [7] Methods for approximating integrals in statistics with special emphasis on Bayesian integration problems
    Evans, M
    Swartz, T
    [J]. STATISTICAL SCIENCE, 1995, 10 (03) : 254 - 272
  • [8] Crosstalk Between IGF1R and Estrogen Receptor Signaling in Breast Cancer
    Fagan, Dedra H.
    Yee, Douglas
    [J]. JOURNAL OF MAMMARY GLAND BIOLOGY AND NEOPLASIA, 2008, 13 (04) : 423 - 429
  • [9] STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES
    GEMAN, S
    GEMAN, D
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) : 721 - 741
  • [10] Grb2 signaling in cell motility and cancer
    Giubellino, Alessio
    Burke, Terrence R., Jr.
    Bottaro, Donald P.
    [J]. EXPERT OPINION ON THERAPEUTIC TARGETS, 2008, 12 (08) : 1021 - 1033