PENALIZED ESTIMATION IN HIGH-DIMENSIONAL HIDDEN MARKOV MODELS WITH STATE-SPECIFIC GRAPHICAL MODELS

被引：16

作者：

Stadler, Nicolas ^{[1
]}

Mukherjee, Sach ^{[1
]}

机构：

[1] Netherlands Canc Inst, Dept Biochem, NL-1066 CX Amsterdam, Netherlands

来源：

ANNALS OF APPLIED STATISTICS | 2013年 / 7卷 / 04期

关键词：

HMM; Graphical Lasso; universal regularization; model selection; MMDL; greedy backward pruning; genome biology; chromatin modeling; VARIABLE SELECTION; MIXTURE;

D O I：

10.1214/13-AOAS662

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We consider penalized estimation in hidden Markov models (HMMs) with multivariate Normal observations. In the moderate-to-large dimensional setting, estimation for HMMs remains challenging in practice, due to several concerns arising from the hidden nature of the states. We address these concerns by l(1)-penalization of state-specific inverse covariance matrices. Penalized estimation leads to sparse inverse covariance matrices which can be interpreted as state-specific conditional independence graphs. Penalization is nontrivial in this latent variable setting; we propose a penalty that automatically adapts to the number of states K and the state-specific sample sizes and can cope with scaling issues arising from the unknown states. The methodology is adaptive and very general, applying in particular to both low- and high-dimensional settings without requiring hand tuning. Furthermore, our approach facilitates exploration of the number of states K by coupling estimation for successive candidate values K. Empirical results on simulated examples demonstrate the effectiveness of the proposed approach. In a challenging real data example from genome biology, we demonstrate the ability of our approach to yield gains in predictive power and to deliver richer estimates than existing methods.

引用

页码：2157 / 2179

页数：23

共 50 条

[31] Double penalized variable selection for high-dimensional partial linear mixed effects models
Yang, Yiping
Luo, Chuanqin
Yang, Weiming
JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 204
[32] Efficient Computation of High-Dimensional Penalized Piecewise Constant Hazard Random Effects Models
Heiling, Hillary M.
Rashid, Naim U.
Li, Quefeng
Peng, Xianlu L.
Yeh, Jen Jen
Ibrahim, Joseph G.
STATISTICS IN MEDICINE, 2025, 44 (06)
[33] A two-step method for estimating high-dimensional Gaussian graphical models
Yang, Yuehan
Zhu, Ji
SCIENCE CHINA-MATHEMATICS, 2020, 63 (06) : 1203 - 1218
[34] Strong oracle guarantees for partial penalized tests of high-dimensional generalized linear models
Jacobson, Tate
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2025,
[35] Variable selection and estimation for high-dimensional spatial autoregressive models
Cai, Liqian
Maiti, Tapabrata
SCANDINAVIAN JOURNAL OF STATISTICS, 2020, 47 (02) : 587 - 607
[36] REGULARIZED ESTIMATION IN SPARSE HIGH-DIMENSIONAL TIME SERIES MODELS
Basu, Sumanta
Michailidis, George
ANNALS OF STATISTICS, 2015, 43 (04) : 1535 - 1567
[37] ESTIMATION IN HIGH-DIMENSIONAL LINEAR MODELS WITH DETERMINISTIC DESIGN MATRICES
Shao, Jun
Deng, Xinwei
ANNALS OF STATISTICS, 2012, 40 (02) : 812 - 831
[38] Shrinkage Estimation of High-Dimensional Factor Models with Structural Instabilities
Cheng, Xu
Liao, Zhipeng
Schorfheide, Frank
REVIEW OF ECONOMIC STUDIES, 2016, 83 (04) : 1511 - 1543
[39] Penalized estimation in finite mixture of ultra-high dimensional regression models
Tang, Shiyi
Zheng, Jiali
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (17) : 5971 - 5992
[40] Minimax Adaptive Estimation of Nonparametric Hidden Markov Models
De Castro, Yohann
Gassiat, Elisabeth
Lacour, Claire
JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17

← 1 2 3 4 5 →