Improving Deep Reinforcement Learning With Transitional Variational Autoencoders: A Healthcare Application

被引:21
作者
Baucum, Matthew [1 ]
Khojandi, Anahita [1 ]
Vasudevan, Rama [2 ]
机构
[1] Univ Tennessee, Dept Ind & Syst Engn, Knoxville, TN 37996 USA
[2] Oak Ridge Natl Lab, Ctr Nanophase Mat Sci, Oak Ridge, TN 37830 USA
关键词
Hidden Markov models; Data models; Neural networks; Training; Trajectory; Biomedical measurement; Reinforcement learning; hidden Markov models; variational autoencoders; generative adversarial networks; long short-term memory networks;
D O I
10.1109/JBHI.2020.3027443
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is a powerful tool for developing personalized treatment regimens from healthcare data. Yet training reinforcement learning agents through direct interactions with patients is often impractical for ethical reasons. One solution is to train reinforcement learning agents using an 'environment model,' which is learned from retrospective patient data, and can simulate realistic patient trajectories. In this study, we propose transitional variational autoencoders (tVAE), a generative neural network architecture that learns a direct mapping between distributions over clinical measurements at adjacent time points. Unlike other models, the tVAE requires few distributional assumptions, and benefits from identical training, and testing architectures. This model produces more realistic patient trajectories than state-of-the-art sequential decision-making models, and generative neural networks, and can be used to learn effective treatment policies.
引用
收藏
页码:2273 / 2280
页数:8
相关论文
共 27 条
[11]  
Karsoliya S., 2012, Int J Engin Trends Technol, V3, P714
[12]  
Kingma DP, 2014, ADV NEUR IN, V27
[13]   Early Diagnosis and Prediction of Sepsis Shock by Combining Static and Dynamic Information using Convolutional-LSTM [J].
Lin, Chen ;
Zhang, Yuan ;
Ivy, Julie ;
Capan, Muge ;
Arnold, Ryan ;
Huddleston, Jeanne M. ;
Chi, Min .
2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, :219-228
[14]  
Liu Y.-Y., 2015, ADV NEURAL INF PROCE, V28, P3600
[15]   Evaluation of Antifactor-Xa Heparin Assay and Activated Partial Thromboplastin Time Values in Patients on Therapeutic Continuous Infusion Unfractionated Heparin Therapy [J].
McLaughlin, Kevin ;
Rimsans, Jessica ;
Sylvester, Katelyn W. ;
Fanikos, John ;
Dorfman, David M. ;
Senna, Patricia ;
Connors, Jean M. ;
Goldhaber, Samuel Z. .
CLINICAL AND APPLIED THROMBOSIS-HEMOSTASIS, 2019, 25
[16]  
Mirza Mehdi, 2014, ARXIV
[17]  
Mnih V, 2016, PR MACH LEARN RES, V48
[18]  
Nemati S, 2016, IEEE ENG MED BIO, P2978, DOI 10.1109/EMBC.2016.7591355
[19]  
Parbhoo Sonali, 2017, AMIA Jt Summits Transl Sci Proc, V2017, P239
[20]  
Raghu A, 2017, MACH LEARN HEALTHC C, V68, P147, DOI DOI 10.48550/ARXIV.1705.08422