Order selection for regression-based hidden Markov model

被引:8
作者
Lin, Yiqi [1 ]
Song, Xinyuan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Stat, Hong Kong, Peoples R China
关键词
ECM-ITD algorithm; Group-Sort-Fuse procedure; Hidden Markov model; Longitudinal data; Order selection; NONCONCAVE PENALIZED LIKELIHOOD; FINITE MIXTURE-MODELS; ALZHEIMERS-DISEASE; VARIABLE SELECTION; SHRINKAGE; ALGORITHM; NUMBER;
D O I
10.1016/j.jmva.2022.105061
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Hidden Markov models (HMMs) describe the relationship between two stochastic processes: an observed process and an unobservable finite-state transition process. Owing to their modeling dynamic heterogeneity, HMMs are widely used to analyze heterogeneous longitudinal data. Traditional HMMs frequently assume that the number of hidden states (i.e., the order of HMM) is a constant and should be specified prior to analysis. This assumption is unrealistic and restrictive in many applications. In this study, we consider regression-based hidden Markov model (RHMM) while allowing the number of hidden states to be unknown and determined by the data. We propose a novel likelihood-based double penalized method, along with an efficient expectation-conditional maximization with iterative thresholding-based descent (ECM-ITD) algorithm, to perform order selection in the context of RHMM. An extended Group-Sort-Fuse procedure is proposed to rank the regression coefficients and impose penalties on the discrepancy of adjacent coefficients. The order selection consistency and convergence of the ECM-ITD algorithm are established under mild conditions. Simulation studies are conducted to evaluate the empirical performance of the proposed method. An application of the proposed methodology to a real-life study on Alzheimer's disease is presented. (C) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:20
相关论文
共 46 条
[1]  
Agresti A., 2002, Categorical data analysis., VSecond, DOI DOI 10.1002/0471249688
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[4]  
[Anonymous], 1997, Hidden Markov and other models for discrete-valued time series
[5]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[6]  
Beal M.J., 2003, Variational Algorithms for Approximate Bayesian Inference
[7]  
Bickel PJ, 1998, ANN STAT, V26, P1614
[8]   OPTIMAL RATE OF CONVERGENCE FOR FINITE MIXTURE-MODELS [J].
CHEN, JH .
ANNALS OF STATISTICS, 1995, 23 (01) :221-233
[9]   Order Selection in Finite Mixture Models With a Nonsmooth Penalty [J].
Chen, Jiahua ;
Khalili, Abbas .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (485) :187-196
[10]  
Dacunha-Castelle D, 1999, ANN STAT, V27, P1178