INNOVATED SCALABLE EFFICIENT ESTIMATION IN ULTRA-LARGE GAUSSIAN GRAPHICAL MODELS

被引:37
作者
Fan, Yingying [1 ]
Lv, Jinchi [1 ]
机构
[1] Univ Southern Calif, Marshall Sch Business, Dept Data Sci & Operat, Los Angeles, CA 90089 USA
关键词
Gaussian graphical model; precision matrix; big data; scalability; efficiency; sparsity; COVARIANCE-MATRIX ESTIMATION; NONCONCAVE PENALIZED LIKELIHOOD; FALSE DISCOVERY RATE; VARIABLE SELECTION; ORACLE PROPERTIES; DANTZIG SELECTOR; ADAPTIVE LASSO; REGRESSION; CLASSIFICATION;
D O I
10.1214/15-AOS1416
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale precision matrix estimation is of fundamental importance yet challenging in many contemporary applications for recovering Gaussian graphical models. In this paper, we suggest a new approach of innovated scalable efficient estimation (ISEE) for estimating large precision matrix. Motivated by the innovated transformation, we convert the original problem into that of large covariance matrix estimation. The suggested method combines the strengths of recent advances in high-dimensional sparse modeling and large covariance matrix estimation. Compared to existing approaches, our method is scalable and can deal with much larger precision matrices with simple tuning. Under mild regularity conditions, we establish that this procedure can recover the underlying graphical structure with significant probability and provide efficient estimation of link strengths. Both computational and theoretical advantages of the procedure are evidenced through simulation and real data examples.
引用
收藏
页码:2098 / 2126
页数:29
相关论文
共 48 条
[21]   OPTIMAL CLASSIFICATION IN SPARSE GAUSSIAN GRAPHIC MODEL [J].
Fan, Yingying ;
Jin, Jiashun ;
Yao, Zhigang .
ANNALS OF STATISTICS, 2013, 41 (05) :2537-2571
[22]   Asymptotic Equivalence of Regularization Methods in Thresholded Parameter Space [J].
Fan, Yingying ;
Lv, Jinchi .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (503) :1044-1061
[23]   Sparse inverse covariance estimation with the graphical lasso [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Robert .
BIOSTATISTICS, 2008, 9 (03) :432-441
[24]   ON ALMOST LINEARITY OF LOW-DIMENSIONAL PROJECTIONS FROM HIGH-DIMENSIONAL DATA [J].
HALL, P ;
LI, KC .
ANNALS OF STATISTICS, 1993, 21 (02) :867-889
[25]   INNOVATED HIGHER CRITICISM FOR DETECTING SPARSE SIGNALS IN CORRELATED NOISE [J].
Hall, Peter ;
Jini, Jiashun .
ANNALS OF STATISTICS, 2010, 38 (03) :1686-1732
[26]   Tilting methods for assessing the influence of components in a classifier [J].
Hall, Peter ;
Titterington, D. M. ;
Xue, Jing-Hao .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 :783-803
[27]   Pharmacogenomic predictor of sensitivity to preoperative chemotherapy with paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide in breast cancer [J].
Hess, Kenneth R. ;
Anderson, Keith ;
Symmans, W. Fraser ;
Valero, Vicente ;
Ibrahim, Nuhad ;
Mejia, Jaime A. ;
Booser, Daniel ;
Theriault, Richard L. ;
Buzdar, Aman U. ;
Dempsey, Peter J. ;
Rouzier, Roman ;
Sneige, Nour ;
Ross, Jeffrey S. ;
Vidaurre, Tatiana ;
Gomez, Henry L. ;
Hortobagyi, Gabriel N. ;
Pusztai, Lajos .
JOURNAL OF CLINICAL ONCOLOGY, 2006, 24 (26) :4236-4244
[28]   FDP vs FDR and the Effect of Conditioning Comment [J].
Jin, Jiashun .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (499) :1042-1045
[29]   SPARSISTENCY AND RATES OF CONVERGENCE IN LARGE COVARIANCE MATRIX ESTIMATION [J].
Lam, Clifford ;
Fan, Jianqing .
ANNALS OF STATISTICS, 2009, 37 (6B) :4254-4278