SCALABLE MULTIPLE NETWORK INFERENCE WITH THE JOINT GRAPHICAL HORSESHOE

被引:3
作者
Lingjaerde, Camilla [1 ]
Fairfax, Benjamin P. [2 ]
Richardson, Sylvia [1 ]
Ruffieux, Helene [1 ]
机构
[1] Univ Cambridge, MRC Biostat Unit, Cambridge, England
[2] Univ Oxford, MRC Weatherall Inst Mol Med, Dept Oncol, Oxford, England
基金
英国医学研究理事会;
关键词
Bayesian graphical models; cancer genomics; expectation conditional maximisation; gene networks; genomics; graphical horseshoe; horseshoe prior; high-dimensional inference; integrative analysis; multiomics; network models; INVERSE COVARIANCE ESTIMATION; MAXIMUM-LIKELIHOOD; VARIABLE-SELECTION; GENE-EXPRESSION; LASSO; ESTIMATOR; AIRE; ARYLFORMAMIDASE; HOTSPOTS;
D O I
10.1214/23-AOAS1863
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Network models are useful tools for modelling complex associations. In statistical omics such models are increasingly popular for identifying and assessing functional relationships and pathways. If a Gaussian graphical model is assumed, conditional independence is determined by the nonzero entries of the inverse covariance (precision) matrix of the data. The Bayesian graphical horseshoe estimator provides a robust and flexible framework for precision matrix inference, as it introduces local, edge-specific parameters which prevent over-shrinkage of nonzero off-diagonal elements. However, its applicability is currently limited in statistical omics settings, which often involve high-dimensional data from multiple conditions that might share common structures. We propose: (i) a scalable expectation conditional maximisation (ECM) algorithm for the original graphical horseshoe and (ii) a novel joint graphical horseshoe estimator, which borrows information across multiple related networks to improve estimation. We show numerically that our single-network ECM approach is more scalable than the existing graphical horseshoe Gibbs implementation, while achieving the same level of accuracy. We also show that our joint-network proposal successfully leverages shared edge-specific information between networks while still retaining differences, outperforming state-of-the-art methods at any level of network similarity. Finally, we leverage our approach to clarify gene regulation activity within and across immune stimulation conditions in monocytes, and formulate hypotheses on the pathogenesis of immune-mediated diseases.
引用
收藏
页码:1899 / 1923
页数:25
相关论文
共 62 条
[1]  
Akaike H., 1973, International Symposium on Information Theory, P267, DOI [DOI 10.1007/978-1-4612-0919-5_38, 10.1007/978-1-4612-0919-5_38]
[2]   The role of AIRE in human autoimmune disease [J].
Akirav, Eitan M. ;
Ruddle, Nancy H. ;
Herold, Kevan C. .
NATURE REVIEWS ENDOCRINOLOGY, 2011, 7 (01) :25-33
[3]   PHD fingers in human diseases: Disorders arising from misinterpreting epigenetic marks [J].
Baker, Lindsey A. ;
Allis, C. David ;
Wang, Gang G. .
MUTATION RESEARCH-FUNDAMENTAL AND MOLECULAR MECHANISMS OF MUTAGENESIS, 2008, 647 (1-2) :3-12
[4]   Lasso Meets Horseshoe: A Survey [J].
Bhadra, Anindya ;
Datta, Jyotishka ;
Polson, Nicholas G. ;
Willard, Brandon .
STATISTICAL SCIENCE, 2019, 34 (03) :405-427
[5]   The Horseshoe plus Estimator of Ultra-Sparse Signals [J].
Bhadra, Anindya ;
Datta, Jyotishka ;
Polson, Nicholas G. ;
Willard, Brandon .
BAYESIAN ANALYSIS, 2017, 12 (04) :1105-1131
[6]   Macrophage plasticity and interaction with lymphocyte subsets: cancer as a paradigm [J].
Biswas, Subhra K. ;
Mantovani, Alberto .
NATURE IMMUNOLOGY, 2010, 11 (10) :889-896
[7]  
BUSATTO C., 2023, arXiv
[8]  
Carvalho C.M., 2009, International Conference on Artificial Intelligence and Statistics, P73
[9]   The horseshoe estimator for sparse signals [J].
Carvalho, Carlos M. ;
Polson, Nicholas G. ;
Scott, James G. .
BIOMETRIKA, 2010, 97 (02) :465-480
[10]   Content-rich biological network constructed by mining PubMed abstracts [J].
Chen, H ;
Sharp, BM .
BMC BIOINFORMATICS, 2004, 5 (1)