l1-Penalized censored Gaussian graphical model

被引:8
作者
Augugliaro, Luigi [1 ]
Abbruzzo, Antonino [1 ]
Vinciotti, Veronica [2 ]
机构
[1] Univ Palermo, Dept Econ Business & Stat, Bldg 13,Viale Sci, I-90128 Palermo, Italy
[2] Brunel Univ London, Dept Math, Kingston Lane, Uxbridge UB8 3PH, Middx, England
关键词
Censored data; Expectation-Maximization algorithm; Gaussian graphical model; Graphical Lasso; High-dimensional data; MAXIMUM-LIKELIHOOD; MULTIVARIATE; SELECTION;
D O I
10.1093/biostatistics/kxy043
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Graphical lasso is one of the most used estimators for inferring genetic networks. Despite its diffusion, there are several fields in applied research where the limits of detection of modern measurement technologies make the use of this estimator theoretically unfounded, even when the assumption of a multivariate Gaussian distribution is satisfied. Typical examples are data generated by polymerase chain reactions and flow cytometer. The combination of censoring and high-dimensionality make inference of the underlying genetic networks from these data very challenging. In this article, we propose an l(1)-penalized Gaussian graphical model for censored data and derive two EM-like algorithms for inference. We evaluate the computational efficiency of the proposed algorithms by an extensive simulation study and show that, when censored data are available, our proposal is superior to existing competitors both in terms of network recovery and parameter estimation. We apply the proposed method to gene expression data generated by microfluidic Reverse Transcription quantitative Polymerase Chain Reaction technology in order to make inference on the regulatory mechanisms of blood development. A software implementation of our method is available on github (https://github.com/LuigiAugugliaro/cglasso).
引用
收藏
页码:E1 / E16
页数:16
相关论文
共 39 条
  • [1] Multivariate truncated moments
    Arismendi, J. C.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 117 : 41 - 75
  • [2] Augugliaro L., 2016, COMPUTATIONAL NETWOR, P201
  • [3] Regularized estimation of large covariance matrices
    Bickel, Peter J.
    Levina, Elizaveta
    [J]. ANNALS OF STATISTICS, 2008, 36 (01) : 199 - 227
  • [4] Brown M, 2000, CLIN CHEM, V46, P1221
  • [5] Gene Regulatory Network Inference from Single-Cell Data Using Multivariate Information Measures
    Chan, Thalia E.
    Stumpf, Michael P. H.
    Babtie, Ann C.
    [J]. CELL SYSTEMS, 2017, 5 (03) : 251 - +
  • [6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [7] How to do successful gene expression analysis using real-time PCR
    Derveaux, Stefaan
    Vandesompele, Jo
    Hellemans, Jan
    [J]. METHODS, 2010, 50 (04) : 227 - 230
  • [8] Foygel R., 2010, ADV NEURAL INFORM PR, P604
  • [9] Friedman J. H., 2018, GLASSO GRAPHICAL LAS
  • [10] Sparse inverse covariance estimation with the graphical lasso
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Robert
    [J]. BIOSTATISTICS, 2008, 9 (03) : 432 - 441