Offline reinforcement learning for safer blood glucose control in people with type 1 diabetes

被引:21
作者
Emerson, Harry [1 ]
Guy, Matthew [1 ,2 ]
McConville, Ryan [1 ]
机构
[1] Univ Bristol, 1 Cathedral Sq, Bristol BS1 5TS, England
[2] Univ Hosp Southampton, Tremona Rd, Southampton SO16 6YD, England
基金
英国工程与自然科学研究理事会;
关键词
Reinforcement learning; Type; 1; diabetes; Glucose control; Artificial pancreas; HYBRID CLOSED-LOOP; INSULIN DELIVERY; IN-SILICO; ADULTS; MODEL;
D O I
10.1016/j.jbi.2023.104376
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The widespread adoption of effective hybrid closed loop systems would represent an important milestone of care for people living with type 1 diabetes (T1D). These devices typically utilise simple control algorithms to select the optimal insulin dose for maintaining blood glucose levels within a healthy range. Online reinforcement learning (RL) has been utilised as a method for further enhancing glucose control in these devices. Previous approaches have been shown to reduce patient risk and improve time spent in the target range when compared to classical control algorithms, but are prone to instability in the learning process, often resulting in the selection of unsafe actions. This work presents an evaluation of offline RL for developing effective dosing policies without the need for potentially dangerous patient interaction during training. This paper examines the utility of BCQ, CQL and TD3-BC in managing the blood glucose of the 30 virtual patients available within the FDA-approved UVA/Padova glucose dynamics simulator. When trained on less than a tenth of the total training samples required by online RL to achieve stable performance, this work shows that offline RL can significantly increase time in the healthy blood glucose range from 61.6 +/- 0.3% to 65.3 +/- 0.5% when compared to the strongest state-of-art baseline (p < 0.001). This is achieved without any associated increase in low blood glucose events. Offline RL is also shown to be able to correct for common and challenging control scenarios such as incorrect bolus dosing, irregular meal timings and compression errors. The code for this work is available at: https://github.com/hemerson1/offline- glucose.
引用
收藏
页数:11
相关论文
共 70 条
  • [1] Effect of a Hybrid Closed-Loop System on Glycemic and Psychosocial Outcomes in Children and Adolescents With Type 1 Diabetes A Randomized Clinical Trial
    Abraham, Mary B.
    de Bock, Martin
    Smith, Grant J.
    Dart, Julie
    Fairchild, Janice M.
    King, Bruce R.
    Ambler, Geoffrey R.
    Cameron, Fergus J.
    McAuley, Sybil A.
    Keech, Anthony C.
    Jenkins, Alicia
    Davis, Elizabeth A.
    O'Neal, David N.
    Jones, Timothy W.
    [J]. JAMA PEDIATRICS, 2021, 175 (12) : 1227 - 1235
  • [2] Meal timing, meal frequency, and breakfast skipping in adult individuals with type 1 diabetes - associations with glycaemic control
    Ahola, Aila J.
    Mutter, Stefan
    Forsblom, Carol
    Harjutsalo, Valma
    Groop, Per-Henrik
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [3] Blood glucose regulation using a neural network predictor with a fuzzy logic controller
    Allam, Fayrouz
    Nossair, Zaki
    Gomma, Hesham
    Ibrahim, Ibrahim
    Abdelsalam, Mona
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2013, 25 (02) : 403 - 413
  • [4] Neural network-based model predictive control for type 1 diabetic rats on artificial pancreas system
    Bahremand, Saeid
    Ko, Hoo Sang
    Balouchzadeh, Ramin
    Lee, H. Felix
    Park, Sarah
    Kwon, Guim
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2019, 57 (01) : 177 - 191
  • [5] Clinical Targets for Continuous Glucose Monitoring Data Interpretation: Recommendations From the International Consensus on Time in Range
    Battelino, Tadej
    Danne, Thomas
    Bergenstal, Richard M.
    Amiel, Stephanie A.
    Beck, Roy
    Biester, Torben
    Bosi, Emanuele
    Buckingham, Bruce A.
    Cefalu, William T.
    Close, Kelly L.
    Cobelli, Claudio
    Dassau, Eyal
    DeVries, J. Hans
    Donaghue, Kim C.
    Dovc, Klemen
    Doyle, Francis J.
    Garg, Satish
    Grunberger, George
    Heller, Simon
    Heinemann, Lutz
    Hirsch, Irl B.
    Hovorka, Roman
    Jia, Weiping
    Kordonouri, Olga
    Kovatchev, Boris
    Kowalski, Aaron
    Laffel, Lori
    Levine, Brian
    Mayorov, Alexander
    Mathieu, Chantal
    Murphy, Helen R.
    Nimri, Revital
    Norgaard, Kirsten
    Parkin, Christopher G.
    Renard, Eric
    Rodbard, David
    Saboo, Banshi
    Schatz, Desmond
    Stoner, Keaton
    Urakami, Tatsuiko
    Weinzimer, Stuart A.
    Phillip, Moshe
    [J]. DIABETES CARE, 2019, 42 (08) : 1593 - 1603
  • [6] Batuhan KirilmazO., 2022, Journal of Diabetes and Clinical Research, V4, P1
  • [7] Validation of Time in Range as an Outcome Measure for Diabetes Clinical Trials
    Beck, Roy W.
    Bergenstal, Richard M.
    Riddlesworth, Tonya D.
    Kollman, Craig
    Li, Zhaomian
    Brown, Adam S.
    Close, Kelly L.
    [J]. DIABETES CARE, 2019, 42 (03) : 400 - 405
  • [8] Bergenstal RM., 2018, Role of Continuous Glucose Monitoring in Diabetes Treatment
  • [9] The Ornstein-Uhlenbeck process as a model of a low pass filtered white noise
    Bibbona, Enrico
    Panfilo, Gianna
    Tavella, Patrizia
    [J]. METROLOGIA, 2008, 45 (06) : S117 - S126
  • [10] One Year Real-World Use of the Control-IQ Advanced Hybrid Closed-Loop Technology
    Breton, Marc D.
    Kovatchev, Boris P.
    [J]. DIABETES TECHNOLOGY & THERAPEUTICS, 2021, 23 (09) : 601 - 608