PENALIZED ESTIMATION IN HIGH-DIMENSIONAL HIDDEN MARKOV MODELS WITH STATE-SPECIFIC GRAPHICAL MODELS

被引:16
|
作者
Stadler, Nicolas [1 ]
Mukherjee, Sach [1 ]
机构
[1] Netherlands Canc Inst, Dept Biochem, NL-1066 CX Amsterdam, Netherlands
关键词
HMM; Graphical Lasso; universal regularization; model selection; MMDL; greedy backward pruning; genome biology; chromatin modeling; VARIABLE SELECTION; MIXTURE;
D O I
10.1214/13-AOAS662
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider penalized estimation in hidden Markov models (HMMs) with multivariate Normal observations. In the moderate-to-large dimensional setting, estimation for HMMs remains challenging in practice, due to several concerns arising from the hidden nature of the states. We address these concerns by l(1)-penalization of state-specific inverse covariance matrices. Penalized estimation leads to sparse inverse covariance matrices which can be interpreted as state-specific conditional independence graphs. Penalization is nontrivial in this latent variable setting; we propose a penalty that automatically adapts to the number of states K and the state-specific sample sizes and can cope with scaling issues arising from the unknown states. The methodology is adaptive and very general, applying in particular to both low- and high-dimensional settings without requiring hand tuning. Furthermore, our approach facilitates exploration of the number of states K by coupling estimation for successive candidate values K. Empirical results on simulated examples demonstrate the effectiveness of the proposed approach. In a challenging real data example from genome biology, we demonstrate the ability of our approach to yield gains in predictive power and to deliver richer estimates than existing methods.
引用
收藏
页码:2157 / 2179
页数:23
相关论文
共 50 条
  • [31] Double penalized variable selection for high-dimensional partial linear mixed effects models
    Yang, Yiping
    Luo, Chuanqin
    Yang, Weiming
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 204
  • [32] Efficient Computation of High-Dimensional Penalized Piecewise Constant Hazard Random Effects Models
    Heiling, Hillary M.
    Rashid, Naim U.
    Li, Quefeng
    Peng, Xianlu L.
    Yeh, Jen Jen
    Ibrahim, Joseph G.
    STATISTICS IN MEDICINE, 2025, 44 (06)
  • [33] A two-step method for estimating high-dimensional Gaussian graphical models
    Yang, Yuehan
    Zhu, Ji
    SCIENCE CHINA-MATHEMATICS, 2020, 63 (06) : 1203 - 1218
  • [34] Strong oracle guarantees for partial penalized tests of high-dimensional generalized linear models
    Jacobson, Tate
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2025,
  • [35] Variable selection and estimation for high-dimensional spatial autoregressive models
    Cai, Liqian
    Maiti, Tapabrata
    SCANDINAVIAN JOURNAL OF STATISTICS, 2020, 47 (02) : 587 - 607
  • [36] REGULARIZED ESTIMATION IN SPARSE HIGH-DIMENSIONAL TIME SERIES MODELS
    Basu, Sumanta
    Michailidis, George
    ANNALS OF STATISTICS, 2015, 43 (04) : 1535 - 1567
  • [37] ESTIMATION IN HIGH-DIMENSIONAL LINEAR MODELS WITH DETERMINISTIC DESIGN MATRICES
    Shao, Jun
    Deng, Xinwei
    ANNALS OF STATISTICS, 2012, 40 (02) : 812 - 831
  • [38] Shrinkage Estimation of High-Dimensional Factor Models with Structural Instabilities
    Cheng, Xu
    Liao, Zhipeng
    Schorfheide, Frank
    REVIEW OF ECONOMIC STUDIES, 2016, 83 (04) : 1511 - 1543
  • [39] Penalized estimation in finite mixture of ultra-high dimensional regression models
    Tang, Shiyi
    Zheng, Jiali
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (17) : 5971 - 5992
  • [40] Minimax Adaptive Estimation of Nonparametric Hidden Markov Models
    De Castro, Yohann
    Gassiat, Elisabeth
    Lacour, Claire
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17