Electronic health records based reinforcement learning for treatment optimizing

被引:37
作者
Li, Tianhao [1 ]
Wang, Zhishun [1 ]
Lu, Wei [1 ]
Zhang, Qian [1 ]
Li, Dengfeng [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Management & Econ, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Electronic health records; Deep reinforcement learning; Glucose control; Cooperative learning; DIABETIC-KETOACIDOSIS; GLUCOSE REGULATION;
D O I
10.1016/j.is.2021.101878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Electronic Health Records (EHRs) have become one of the main sources of evidence to evaluate clinical actions, improve medical quality, predict disease-risk, and optimize treatment effects. However, EHRs present several modeling challenges, including heterogeneous data types and dynamic characteristics. Reinforcement learning provides an efficient way for sequential decision-making. Powered by model-based reinforcement learning approach, we propose an EHRs-based reinforcement learning algorithm to optimize sequential treatment strategies for diseases, such as sepsis, diabetes, and their complications. We conduct our experiments with this algorithm to optimize physicians' historical treatment strategies and achieve better glucose control for diabetic ketoacidosis (DKA) patients, which is one serious complication of diabetes. The research includes the modeling process and reinforcement learning process. During the EHRs modeling process, besides considering the necessary physiological variables, we also consider the major disease factors to enhance the interpretability of the model. In the reinforcement learning process, a deep Q network is employed to explore the optimal insulin dose for patients. Moreover, inspired by the real medical scenes, we extend the algorithm to cooperative learning environment. We use the joint policy of the two agents to simulate doctor consultations, and achieve better treatment performances in terms of policy and blood glucose control than single agent and clinicians. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 63 条
[1]  
Anand Rajsavi S, 2018, AMIA Jt Summits Transl Sci Proc, V2017, P310
[2]  
[Anonymous], 2017, ARXIV171200654
[3]   An active inference approach to on-line agent monitoring in safety-critical systems [J].
Avila, Luis ;
Martinez, Ernesto .
ADVANCED ENGINEERING INFORMATICS, 2015, 29 (04) :1083-1095
[4]   Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection [J].
Avila, Luis ;
Martinez, Ernesto .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (14) :6327-6345
[5]   The use of reinforcement learning algorithms to meet the challenges of an artificial pancreas [J].
Bothe, Melanie K. ;
Dickens, Luke ;
Reichel, Katrin ;
Tellmann, Arn ;
Ellger, Bjoern ;
Westphal, Martin ;
Faisal, Ahmed A. .
EXPERT REVIEW OF MEDICAL DEVICES, 2013, 10 (05) :661-673
[6]   ISPAD Clinical Practice Consensus Guidelines 2018: Diabetes in adolescence [J].
Cameron, Fergus J. ;
Garvey, Katharine ;
Hood, Korey K. ;
Acerini, Carlo L. ;
Codner, Ethel .
PEDIATRIC DIABETES, 2018, 19 :250-261
[7]  
Castellini J, 2019, AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, P1862
[8]   Letter to the editor: Unexpected high mortality in COVID-19 and diabetic ketoacidosis [J].
Chamorro-Pareja, Natalia ;
Parthasarathy, Sahana ;
Annam, Jayabhargav ;
Hoffman, Julie ;
Coyle, Christine ;
Kishore, Preeti .
METABOLISM-CLINICAL AND EXPERIMENTAL, 2020, 110
[9]   Model-Free Machine Learning in Biomedicine: Feasibility Study in Type 1 Diabetes [J].
Daskalaki, Elena ;
Diem, Peter ;
Mougiakakou, Stavroula G. .
PLOS ONE, 2016, 11 (07)
[10]   An Actor-Critic based controller for glucose regulation in type 1 diabetes [J].
Daskalaki, Elena ;
Diem, Peter ;
Mougiakakou, Stavroula G. .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2013, 109 (02) :116-125