Learning Large Graph-Based MDPs With Historical Data

被引:1
作者
Haksar, Ravi N. [1 ]
Schwager, Mac [2 ]
机构
[1] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
来源
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS | 2022年 / 9卷 / 03期
关键词
Learning; Markov processes; optimization; social networks; stochastic/uncertain systems; HIDDEN MARKOV-MODELS;
D O I
10.1109/TCNS.2021.3128530
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider learning the dynamics and measurement model parameters of a graph-based Markov decision process (GMDP) given a history of measurements. Graph-based models have been used in modeling many data-based applications, such as recognition tasks, disease epidemics, forest wildfires, freeway traffic, and social networks. We leverage the expectation-maximization framework and develop an algorithm that optimizes the measurement likelihood and has favorable complexity for large models. In contrast to prior work, we directly consider GMDPs with significantly large discrete state spaces, arbitrary coupling structure, and long measurement sequences. We also consider a special structural property called Anonymous Influence, which we use to test hypotheses and gain insights into the data. We demonstrate the effectiveness of our learning algorithm by considering two real-world data sets on the 2020 Novel Coronavirus (COVID-19) pandemic in California and on user interactions on Twitter. Our results show that the learned GMDP models better explain the data compared to an uncoupled model assumption.
引用
收藏
页码:1447 / 1458
页数:12
相关论文
共 33 条
[1]  
[Anonymous], 2001, A new formulation of coupled hidden Markov models
[2]   Coupled hidden Markov models for complex action recognition [J].
Brand, M ;
Oliver, N ;
Pentland, A .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :994-999
[3]  
California Department of Public Health, COVID 19 CAS
[4]   Efficient approximate linear programming for factored MDPs [J].
Chen, Feng ;
Cheng, Qiang ;
Dong, Jianwu ;
Yu, Zhaofei ;
Wang, Guojun ;
Xu, Wenli .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2015, 63 :101-121
[5]  
Chis Tiberiu, 2014, 2014 22nd Annual IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS). Proceedings, P168, DOI 10.1109/MASCOTS.2014.29
[6]  
Chu StephenM., 2000, P IEEE INT C SPOKEN, P747
[7]  
De Abir, 2016, Advances in Neural Information Processing Systems
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
Dong W., 2012, UNCERT ARTIFICIAL IN, P227
[10]  
Dunmur AP, 1997, ADV NEUR IN, V9, P431