Joint estimation of multiple dependent Gaussian graphical models with applications to mouse genomics

被引:20
作者
Xie, Yuying [1 ]
Liu, Yufeng [2 ]
Valdar, William [3 ]
机构
[1] Michigan State Univ, Dept Computat Math Sci & Engn, E Lansing, MI 48824 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[3] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
EM algorithm; Gaussian graphical model; Mouse genomics; Shrinkage; Sparsity; Variable selection; INVERSE COVARIANCE ESTIMATION; MAXIMUM-LIKELIHOOD; VARIABLE SELECTION; MATRIX ESTIMATION; SPARSE; LASSO; OBESITY;
D O I
10.1093/biomet/asw035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gaussian graphical models are widely used to represent conditional dependencies among random variables. In this paper, we propose a novel estimator for data arising from a group of Gaussian graphical models that are themselves dependent. A motivating example is that of modelling gene expression collected on multiple tissues from the same individual: here the multivariate outcome is affected by dependencies acting not only at the level of the specific tissues, but also at the level of the whole body; existing methods that assume independence among graphs are not applicable in this case. To estimate multiple dependent graphs, we decompose the problem into two graphical layers: the systemic layer, which affects all outcomes and thereby induces cross-graph dependence, and the category-specific layer, which represents graph-specific variation. We propose a graphical EM technique that estimates both layers jointly, establish estimation consistency and selection sparsistency of the proposed estimator, and confirm by simulation that the EM method is superior to a simpler one-step method. We apply our technique to mouse genomics data and obtain biologically plausible results.
引用
收藏
页码:493 / 511
页数:19
相关论文
共 32 条
  • [1] Inferring sparse Gaussian graphical models with latent structure
    Ambroise, Christophe
    Chiquet, Julien
    Matias, Catherine
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2009, 3 : 205 - 238
  • [2] [Anonymous], ADV NEURAL INFORM PR
  • [3] Banerjee O, 2008, J MACH LEARN RES, V9, P485
  • [4] Adaptive Thresholding for Sparse Covariance Matrix Estimation
    Cai, Tony
    Liu, Weidong
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) : 672 - 684
  • [5] Extended Bayesian information criteria for model selection with large model spaces
    Chen, Jiahua
    Chen, Zehua
    [J]. BIOMETRIKA, 2008, 95 (03) : 759 - 771
  • [6] Analyses of allele-specific gene expression in highly divergent mouse crosses identifies pervasive allelic imbalance
    Crowley, James J.
    Zhabotynsky, Vasyl
    Sun, Wei
    Huang, Shunping
    Pakatci, Isa Kemal
    Kim, Yunjung
    Wang, Jeremy R.
    Morgan, Andrew P.
    Calaway, John D.
    Aylor, David L.
    Yun, Zaining
    Bell, Timothy A.
    Buus, Ryan J.
    Calaway, Mark E.
    Didion, John P.
    Gooch, Terry J.
    Hansen, Stephanie D.
    Robinson, Nashiya N.
    Shaw, Ginger D.
    Spence, Jason S.
    Quackenbush, Corey R.
    Barrick, Cordelia J.
    Nonneman, Randal J.
    Kim, Kyungsu
    Xenakis, James
    Xie, Yuying
    Valdar, William
    Lenarcic, Alan B.
    Wang, Wei
    Welsh, Catherine E.
    Fu, Chen-Ping
    Zhang, Zhaojun
    Holt, James
    Guo, Zhishan
    Threadgill, David W.
    Tarantino, Lisa M.
    Miller, Darla R.
    Zou, Fei
    McMillan, Leonard
    Sullivan, Patrick F.
    de Villena, Fernando Pardo-Manuel
    [J]. NATURE GENETICS, 2015, 47 (04) : 353 - U102
  • [7] First-order methods for sparse covariance selection
    D'Aspremont, Alexandre
    Banerjee, Onureena
    El Ghaoui, Laurent
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2008, 30 (01) : 56 - 66
  • [8] The joint graphical lasso for inverse covariance estimation across multiple classes
    Danaher, Patrick
    Wang, Pei
    Witten, Daniela M.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (02) : 373 - 397
  • [9] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [10] Multi-tissue coexpression networks reveal unexpected subnetworks associated with disease
    Dobrin, Radu
    Zhu, Jun
    Molony, Cliona
    Argman, Carmen
    Parrish, Mark L.
    Carlson, Sonia
    Allan, Mark F.
    Pomp, Daniel
    Schadt, Eric E.
    [J]. GENOME BIOLOGY, 2009, 10 (05):