REGULARIZED RANK-BASED ESTIMATION OF HIGH-DIMENSIONAL NONPARANORMAL GRAPHICAL MODELS

被引:162
作者
Xue, Lingzhou [1 ]
Zou, Hui [1 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
CLIME; Dantzig selector; graphical lasso; nonparanormal graphical model; rate of convergence; variable transformation; COVARIANCE ESTIMATION; VARIABLE SELECTION; DANTZIG SELECTOR; LASSO; BIOSYNTHESIS; LIKELIHOOD; PATHWAYS;
D O I
10.1214/12-AOS1041
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A sparse precision matrix can be directly translated into a sparse Gaussian graphical model under the assumption that the data follow a joint normal distribution. This neat property makes high-dimensional precision matrix estimation very appealing in many applications. However, in practice we often face nonnormal data, and variable transformation is often used to achieve normality. In this paper we consider the nonparanormal model that assumes that the variables follow a joint normal distribution after a set of unknown monotone transformations. The nonparanormal model is much more flexible than the normal model while retaining the good interpretability of the latter in that each zero entry in the sparse precision matrix of the nonparanormal model corresponds to a pair of conditionally independent variables. In this paper we show that the nonparanormal graphical model can be efficiently estimated by using a rank-based estimation scheme which does not require estimating these unknown transformation functions. In particular, we study the rank-based graphical lasso, the rank-based neighborhood Dantzig selector and the rank-based CLIME. We establish their theoretical properties in the setting where the dimension is nearly exponentially large relative to the sample size. It is shown that the proposed rank-based estimators work as well as their oracle counterparts defined with the oracle data. Furthermore, the theory motivates us to consider the adaptive version of the rank-based neighborhood Dantzig selector and the rank-based CLIME that are shown to enjoy graphical model selection consistency without assuming the irrepresentable condition for the oracle and rank-based graphical lasso. Simulated and real data are used to demonstrate the finite performance of the rank-based estimators.
引用
收藏
页码:2541 / 2571
页数:31
相关论文
共 50 条
  • [41] Conditional score matching for high-dimensional partial graphical models
    Fan, Xinyan
    Zhang, Qingzhao
    Ma, Shuangge
    Fang, Kuangnan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 153 (153)
  • [42] Estimation for High-Dimensional Linear Mixed-Effects Models Using l1-Penalization
    Schelldorfer, Juerg
    Buehlmann, Peter
    Van De Geer, Sara
    SCANDINAVIAN JOURNAL OF STATISTICS, 2011, 38 (02) : 197 - 214
  • [43] High-Dimensional LASSO-Based Computational Regression Models: Regularization, Shrinkage, and Selection
    Emmert-Streib, Frank
    Dehmer, Matthias
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (01): : 359 - 383
  • [44] Asymptotically faster estimation of high-dimensional additive models using subspace learning
    He, Kejun
    He, Shiyuan
    Huang, Jianhua Z.
    SCANDINAVIAN JOURNAL OF STATISTICS, 2024, 51 (04) : 1587 - 1618
  • [45] Regularized estimation in sparse high-dimensional multivariate regression, with application to a DNA methylation study
    Zhang, Haixiang
    Zheng, Yinan
    Yoon, Grace
    Zhang, Zhou
    Gao, Tao
    Joyce, Brian
    Zhang, Wei
    Schwartz, Joel
    Vokonas, Pantel
    Colicino, Elena
    Baccarelli, Andrea
    Hou, Lifang
    Liu, Lei
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2017, 16 (03) : 159 - 171
  • [46] High-dimensional undirected graphical models for arbitrary mixed data
    Goebler, Konstantin
    Drton, Mathias
    Mukherjee, Sach
    Miloschewski, Anne
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 2339 - 2404
  • [47] Variable selection and estimation for high-dimensional spatial autoregressive models
    Cai, Liqian
    Maiti, Tapabrata
    SCANDINAVIAN JOURNAL OF STATISTICS, 2020, 47 (02) : 587 - 607
  • [48] ESTIMATION IN HIGH-DIMENSIONAL LINEAR MODELS WITH DETERMINISTIC DESIGN MATRICES
    Shao, Jun
    Deng, Xinwei
    ANNALS OF STATISTICS, 2012, 40 (02) : 812 - 831
  • [49] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [50] Doubly regularized estimation and selection in linear mixed-effects models for high-dimensional longitudinal data
    Li, Yun
    Wang, Sijian
    Song, Peter X-K
    Wang, Naisyin
    Zhou, Ling
    Zhu, Ji
    STATISTICS AND ITS INTERFACE, 2018, 11 (04) : 721 - 737