Learning Hidden Markov Models with Structured Transition Dynamics

被引:0
作者
Ma, Simin [1 ]
Dehghanian, Amin [1 ]
Garcia, Gian-Gabriel [1 ]
Serban, Nicoleta [1 ]
机构
[1] Georgia Inst Technol, H Milton Stewart Sch Ind & Syst Engn, Atlanta 30332, GA USA
基金
美国国家卫生研究院;
关键词
expectation-maximization algorithm; hidden Markov model; convex optimization; accelerated gradient method; statistical learning; LIKELIHOOD RATIO TEST; PATIENTS PRICE; CONCUSSION; ALGORITHM; STATEMENT; SOLVERS; PRIVACY; NUMBER;
D O I
10.1287/ijoc.2022.0342
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The hidden Markov model (HMM) provides a natural framework for modeling the dynamic evolution of latent diseases. The unknown probability matrices of HMMs can be learned through the well-known Baum-Welch algorithm, a special case of the expectation-maximization algorithm. In many disease models, the probability matrices possess nontrivial properties that may be represented through a set of linear constraints. In these cases, the traditional Baum-Welch algorithm is no longer applicable because the maximization step cannot be solved by an explicit formula. In this paper, we propose a novel approach to efficiently solve the maximization step problem under linear constraints by providing a Lagrangian dual reformulation that we solve by an accelerated gradient method. The performance of this approach critically depends on devising a fast method to compute the gradient in each iteration. For this purpose, we employ dual decomposition and derive Karush-Kuhn-Tucker conditions to reduce our problem into a set of single variable equations, solved using a simple bisection method. We apply this method to a case study on sports-related concussion and provide an extensive numerical study using simulation. We show that our approach is in orders of magnitude computationally faster and more accurate than other alternative approaches. Moreover, compared with other methods, our approach is far less sensitive with respect to increases in problem size. Overall, our contribution lies in the advancement of accurately and efficiently handling HMM parameter estimation under linear constraints, which comprises a wide range of applications in disease modeling and beyond.
引用
收藏
页数:27
相关论文
共 50 条
[31]   Hidden Markov Models for Pose Estimation [J].
Czuni, Laszlo ;
Nagy, Amr M. .
PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, :598-603
[32]   Private predictions on hidden Markov models [J].
Huseyin Polat ;
Wenliang Du ;
Sahin Renckes ;
Yusuf Oysal .
Artificial Intelligence Review, 2010, 34 :53-72
[33]   Private Filtering for Hidden Markov Models [J].
Mochaourab, Rami ;
Oechtering, Tobias J. .
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (06) :888-892
[34]   Private predictions on hidden Markov models [J].
Polat, Huseyin ;
Du, Wenliang ;
Renckes, Sahin ;
Oysal, Yusuf .
ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (01) :53-72
[35]   Hidden Markov Models for churn prediction [J].
Rothenbuehler, Pierangelo ;
Runge, Julian ;
Garcin, Florent ;
Faltings, Boi .
2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, :723-730
[36]   On recursive estimation for hidden Markov models [J].
Ryden, T .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1997, 66 (01) :79-96
[37]   The order estimation for hidden Markov models [J].
Zheng, Jing ;
Huang, Jiafang ;
Tong, Changqing .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 527
[38]   Hidden Markov models with binary dependence [J].
Danisman, Ozgur ;
Kocer, Umay Uzunoglu .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2021, 567 (567)
[39]   A Hidden Markov Model of Developer Learning Dynamics in Open Source Software Projects [J].
Singh, Param Vir ;
Tan, Yong ;
Youn, Nara .
INFORMATION SYSTEMS RESEARCH, 2011, 22 (04) :790-807
[40]   Semi-supervised learning of Hidden Markov Models for biological sequence analysis [J].
Tamposis, Ioannis A. ;
Tsirigos, Konstantinos D. ;
Theodoropoulou, Margarita C. ;
Kontou, Panagiota, I ;
Bagos, Pantelis G. .
BIOINFORMATICS, 2019, 35 (13) :2208-2215