High-dimensional joint estimation of multiple directed Gaussian graphical models

被引:11
作者
Wang, Yuhao [1 ,5 ]
Segarra, Santiago [2 ,5 ]
Uhler, Caroline [3 ,4 ]
机构
[1] Univ Cambridge, Stat Lab, Cambridge, England
[2] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77251 USA
[3] MIT, Lab Informat & Decis Syst, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] MIT, Inst Data Syst & Soc, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Causal inference; linear structural equation model; high-dimensional statistics; graphical model; MARKOV EQUIVALENCE CLASSES; INVERSE COVARIANCE ESTIMATION; GENE-EXPRESSION PROFILES; OVARIAN-CANCER; BAYESIAN NETWORKS; IDENTIFICATION;
D O I
10.1214/20-EJS1724
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of jointly estimating multiple related directed acyclic graph (DAG) models based on high-dimensional data from each graph. This problem is motivated by the task of learning gene regulatory networks based on gene expression data from different tissues, developmental stages or disease states. We prove that under certain regularity conditions, the proposed l(0)-penalized maximum likelihood estimator converges in Frobenius norm to the adjacency matrices consistent with the data-generating distributions and has the correct sparsity. In particular, we show that this joint estimation procedure leads to a faster convergence rate than estimating each DAG model separately. As a corollary, we also obtain high-dimensional consistency results for causal inference from a mix of observational and interventional data. For practical purposes, we propose jointGES consisting of Greedy Equivalence Search (GES) to estimate the union of all DAG models followed by variable selection using lasso to obtain the different DAGs, and we analyze its consistency guarantees. The proposed method is illustrated through an analysis of simulated data as well as epithelial ovarian cancer gene expression data.
引用
收藏
页码:2439 / 2483
页数:45
相关论文
共 50 条
[21]   Direct estimation of differential networks under high-dimensional nonparanormal graphical models [J].
Zhang, Qingyang .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2020, 48 (02) :187-203
[22]   On Joint Estimation of Gaussian Graphical Models for Spatial and Temporal Data [J].
Lin, Zhixiang ;
Wang, Tao ;
Yang, Can ;
Zhao, Hongyu .
BIOMETRICS, 2017, 73 (03) :769-779
[23]   Jewel: A Novel Method for Joint Estimation of Gaussian Graphical Models [J].
Angelini, Claudia ;
De Canditiis, Daniela ;
Plaksienko, Anna .
MATHEMATICS, 2021, 9 (17)
[24]   Joint Learning of Multiple Sparse Matrix Gaussian Graphical Models [J].
Huang, Feihu ;
Chen, Songcan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (11) :2606-2620
[25]   Missing Values in Multiple Joint Inference of Gaussian Graphical Models [J].
Tozzo, Veronica ;
Garbarino, Davide ;
Barla, Annalisa .
INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 :497-508
[26]   Symmetries in directed Gaussian graphical models [J].
Makam, Visu ;
Reichenbach, Philipp ;
Seigal, Anna .
ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02) :3969-4010
[27]   Transfer learning for high-dimensional transelliptical graphical models [J].
Zhang, Yuhao ;
Yu, Yang ;
Sheng, Danshu ;
Liang, Wanfeng .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2025, 69 (03)
[28]   Transfer learning for high-dimensional transelliptical graphical models [J].
Zhang, Yuhao ;
Yu, Yang ;
Sheng, Danshu ;
Liang, Wanfeng .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2025,
[29]   Consistent multiple changepoint estimation with fused Gaussian graphical models [J].
Gibberd, A. ;
Roy, S. .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (02) :283-309
[30]   Consistent multiple changepoint estimation with fused Gaussian graphical models [J].
A. Gibberd ;
S. Roy .
Annals of the Institute of Statistical Mathematics, 2021, 73 :283-309