High-dimensional joint estimation of multiple directed Gaussian graphical models

被引:11
作者
Wang, Yuhao [1 ,5 ]
Segarra, Santiago [2 ,5 ]
Uhler, Caroline [3 ,4 ]
机构
[1] Univ Cambridge, Stat Lab, Cambridge, England
[2] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77251 USA
[3] MIT, Lab Informat & Decis Syst, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] MIT, Inst Data Syst & Soc, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Causal inference; linear structural equation model; high-dimensional statistics; graphical model; MARKOV EQUIVALENCE CLASSES; INVERSE COVARIANCE ESTIMATION; GENE-EXPRESSION PROFILES; OVARIAN-CANCER; BAYESIAN NETWORKS; IDENTIFICATION;
D O I
10.1214/20-EJS1724
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of jointly estimating multiple related directed acyclic graph (DAG) models based on high-dimensional data from each graph. This problem is motivated by the task of learning gene regulatory networks based on gene expression data from different tissues, developmental stages or disease states. We prove that under certain regularity conditions, the proposed l(0)-penalized maximum likelihood estimator converges in Frobenius norm to the adjacency matrices consistent with the data-generating distributions and has the correct sparsity. In particular, we show that this joint estimation procedure leads to a faster convergence rate than estimating each DAG model separately. As a corollary, we also obtain high-dimensional consistency results for causal inference from a mix of observational and interventional data. For practical purposes, we propose jointGES consisting of Greedy Equivalence Search (GES) to estimate the union of all DAG models followed by variable selection using lasso to obtain the different DAGs, and we analyze its consistency guarantees. The proposed method is illustrated through an analysis of simulated data as well as epithelial ovarian cancer gene expression data.
引用
收藏
页码:2439 / 2483
页数:45
相关论文
共 50 条
[41]   Graphical Local Genetic Algorithm for High-Dimensional Log-Linear Models [J].
Roach, Lyndsay ;
Gao, Xin .
MATHEMATICS, 2023, 11 (11)
[42]   A FOCUSED INFORMATION CRITERION FOR GRAPHICAL MODELS IN FMRI CONNECTIVITY WITH HIGH-DIMENSIONAL DATA [J].
Pircalabelu, Eugen ;
Claeskens, Gerda ;
Jahfari, Sara ;
Waldorp, Lourens J. .
ANNALS OF APPLIED STATISTICS, 2015, 9 (04) :2179-2214
[43]   Estimation of Linear Functionals in High-Dimensional Linear Models: From Sparsity to Nonsparsity [J].
Zhao, Junlong ;
Zhou, Yang ;
Liu, Yufeng .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) :1579-1591
[44]   Graphical Modeling of High-Dimensional Time Series [J].
Tugnait, Jitendra K. .
2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, :840-844
[45]   Estimation and inference of change points in high-dimensional factor models [J].
Bai, Jushan ;
Han, Xu ;
Shi, Yutang .
JOURNAL OF ECONOMETRICS, 2020, 219 (01) :66-100
[46]   Joint estimation of multiple mixed graphical models for pan-cancer network analysis [J].
Jia, Bochao ;
Liang, Faming .
STAT, 2020, 9 (01)
[47]   ASYMPTOTIC NORMALITY AND OPTIMALITIES IN ESTIMATION OF LARGE GAUSSIAN GRAPHICAL MODELS [J].
Ren, Zhao ;
Sun, Tingni ;
Zhang, Cun-Hui ;
Zhou, Harrison H. .
ANNALS OF STATISTICS, 2015, 43 (03) :991-1026
[48]   Learning Sparse High-Dimensional Matrix-Valued Graphical Models From Dependent Data [J].
Tugnait, Jitendra K. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 :3363-3379
[49]   Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models [J].
Mei, Song ;
Wu, Yuchen .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2025, 71 (04) :2930-2954
[50]   A projection-based conditional dependence measure with applications to high-dimensional undirected graphical models [J].
Fan, Jianqing ;
Feng, Yang ;
Xia, Lucy .
JOURNAL OF ECONOMETRICS, 2020, 218 (01) :119-139