A HIERARCHICAL BAYESIAN MODEL FOR INFERENCE OF COPY NUMBER VARIANTS AND THEIR ASSOCIATION TO GENE EXPRESSION

被引:10
|
作者
Cassese, Alberto [1 ]
Guindani, Michele [2 ]
Tadesse, Mahlet G. [3 ]
Falciani, Francesco [4 ]
Vannucci, Marina [1 ]
机构
[1] Rice Univ, Dept Stat, Houston, TX 77005 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[3] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
[4] Univ Liverpool, Inst Integrat Biol, Ctr Computat Biol & Modelling CCMB, Liverpool L69 3BX, Merseyside, England
来源
ANNALS OF APPLIED STATISTICS | 2014年 / 8卷 / 01期
基金
美国国家科学基金会;
关键词
Bayesian hierarchical models; comparative genomic hybridization arrays; gene expression; hidden Markov models; measurement error; variable selection; HIDDEN-MARKOV MODEL; VARIABLE SELECTION; MIXTURE MODEL; SIGNATURES; ARGININE; DETECT;
D O I
10.1214/13-AOAS705
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A number of statistical models have been successfully developed for the analysis of high-throughput data from a single source, but few methods are available for integrating data from different sources. Here we focus on integrating gene expression levels with comparative genomic hybridization (CGH) array measurements collected on the same subjects. We specify a measurement error model that relates the gene expression levels to latent copy number states which, in turn, are related to the observed surrogate CGH measurements via a hidden Markov model. We employ selection priors that exploit the dependencies across adjacent copy number states and investigate MCMC stochastic search techniques for posterior inference. Our approach results in a unified modeling framework for simultaneously inferring copy number variants (CNV) and identifying their significant associations with mRNA transcripts abundance. We show performance on simulated data and illustrate an application to data from a genomic study on human cancer cell lines.
引用
收藏
页码:148 / 175
页数:28
相关论文
共 50 条
  • [1] Copy number variants, diseases and gene expression
    Henrichsen, Charlotte N.
    Chaignat, Evelyne
    Reymond, Alexandre
    HUMAN MOLECULAR GENETICS, 2009, 18 : R1 - R8
  • [2] Estimating the effects of copy-number variants on intelligence using hierarchical Bayesian models
    Jiang, Lai
    Huguet, Guillaume
    Schramm, Catherine
    Ciampi, Antonio
    Main, Antoine
    Passo, Claudine
    Jean-Louis, Martineau
    Auger, Maude
    Schumann, Gunter
    Porteous, David
    Jacquemont, Sebastien
    Greenwood, Celia M. T.
    GENETIC EPIDEMIOLOGY, 2020, 44 (08) : 825 - 840
  • [3] Estimating the effects of copy number variants on intelligence quotient using hierarchical Bayesian models
    Jiang, Lai
    Huguet, Guillaume
    Schramm, Catherine
    Jacquemont, Sebastien
    Paus, Tomas
    Schumann, Gunter
    Conrod, Patricia
    Pausova, Zdenka
    Greenwood, Celia M. T.
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 707 - 708
  • [4] Fast Bayesian Inference of Copy Number Variants using Hidden Markov Models with Wavelet Compression
    Wiedenhoeft, John
    Brugel, Eric
    Schliep, Alexander
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (05)
  • [5] Copy number variants associated with epilepsy from gene expression microarrays
    Wang, Dong
    Li, Xia
    Jia, Shanshan
    Wang, Yan
    Wang, Zhijing
    Song, Xixiao
    Liu, Liang
    JOURNAL OF CLINICAL NEUROSCIENCE, 2015, 22 (12) : 1907 - 1910
  • [6] GENERAL BAYESIAN MODEL FOR HIERARCHICAL INFERENCE
    KELLY, CW
    BARCLAY, S
    ORGANIZATIONAL BEHAVIOR AND HUMAN PERFORMANCE, 1973, 10 (03): : 388 - 403
  • [7] Bayesian inference for intratumour heterogeneity in mutations and copy number variation
    Lee, Juhee
    Muller, Peter
    Sengupta, Subhajit
    Gulukota, Kamalakar
    Ji, Yuan
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2016, 65 (04) : 547 - 563
  • [8] Hierarchical Bayesian model for analysis of gene expression data
    Rekaya, R.
    Zhang, W.
    JOURNAL OF DAIRY SCIENCE, 2005, 88 : 104 - 105
  • [9] Hierarchical Bayesian model for analysis of gene expression data
    Rekaya, R.
    Zhang, W.
    JOURNAL OF ANIMAL SCIENCE, 2005, 83 : 104 - 105
  • [10] Copy Number Variants in the Kallikrein Gene Cluster
    Lindahl, Pernilla
    Sall, Torbjorn
    Bjartell, Anders
    Johansson, Anna M.
    Lilja, Hans
    Hallden, Christer
    PLOS ONE, 2013, 8 (07):