A goal-oriented reinforcement learning for optimal drug dosage control

被引:0
|
作者
Zhang, Qian [1 ]
Li, Tianhao [1 ]
Li, Dengfeng [1 ]
Lu, Wei [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Management & Econ, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Goal-oriented; Reinforcement learning; Hierarchical decision; Multi-agent; Drug dosage control; SEPTIC SHOCK; SEPSIS; MORTALITY; LEVEL; CARE;
D O I
10.1007/s10479-024-06029-x
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The dosage control of therapeutic drugs is a concern for clinicians. Whether the clinician's dosing decision is correct and efficient determines patient's life. In intensive care units (ICU), medication decision is a dynamic and continuous process, which is difficult to solve by traditional intelligent technologies. while reinforcement learning (RL) has an advantage in handling sequential decision making, it faces challenges in multi-level problems because of the delayed rewards and complex states. Hierarchical reinforcement learning (HRL) is a layered algorithm based on RL. HRL has been proved to be effective in delayed sparse reward issues and reduce the learning difficulty by dividing the long-term goal into stages. Inspired by this, we propose a goal-oriented reinforcement learning (GORL) approach to optimize the drug dosage control for sepsis patients. Specifically, GORL employs two agents to make dosage decisions cooperatively by simulating the behaviors of clinicians. GORL decompose a long-term goal into several short-term goals to reduce the exploration space. In the long-term goal, the concept of the goal-oriented is introduced to solve the sparse reward. A goal-oriented hierarchical structure can help agents to interact and cooperate to achieve the short-term goal. In addition, we design a hindsight intrinsic reward to balance the long-term and short-term goals, and are thus able to learn an optimal policy of drug dosage control. We conduct our experiments on MIMIC-IV, which is one of the biggest medical datasets. The experimental results show that our model outperforms other baseline algorithms and can learn a more robust treatment policy than clinicians, with reducing the patient's mortality by 10.23%.
引用
收藏
页码:1403 / 1423
页数:21
相关论文
共 50 条
  • [1] A Goal-Oriented Specification Language for Reinforcement Learning
    Schwan, Simon
    Kloes, Verena
    Glesner, Sabine
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2023, 2023, 13890 : 169 - 180
  • [2] GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
    Liu, Jianfeng
    Pan, Feiyang
    Luo, Ling
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1793 - 1796
  • [3] Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments
    Chen, Liyu
    Luo, Haipeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] OPTIMAL CONTROL MODELS OF GOAL-ORIENTED HUMAN LOCOMOTION
    Chitour, Yacine
    Jean, Frederic
    Mason, Paolo
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2012, 50 (01) : 147 - 170
  • [5] On goal-oriented adaptivity for elliptic optimal control problems
    Weiser, Martin
    OPTIMIZATION METHODS & SOFTWARE, 2013, 28 (05): : 969 - 992
  • [6] Graph Enhanced Hierarchical Reinforcement Learning for Goal-oriented Learning Path Recommendation
    Li, Qingyao
    Xia, Wei
    Yin, Li'ang
    Shen, Jian
    Rui, Renting
    Zhang, Weinan
    Chen, Xianyu
    Tang, Ruiming
    Yu, Yong
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1318 - 1327
  • [7] Multi-objective reinforcement learning in process control: A goal-oriented approach with adaptive thresholds
    Li, Dazi
    Gu, Wentao
    Song, Tianheng
    JOURNAL OF PROCESS CONTROL, 2023, 129
  • [8] Optimizing passengers' experience: A goal-oriented reinforcement learning speed control approach for urban railway trains
    Liu, Wangyang
    Feng, Qingsheng
    Li, Hong
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART F-JOURNAL OF RAIL AND RAPID TRANSIT, 2024, 238 (10) : 1283 - 1295
  • [9] Learning of Subgoals for Goal-Oriented Behavior Control of Mobile Robots
    Lee, Sang Hyoung
    Lee, Sanghoon
    Suh, Il Hong
    Chung, Wan Kyun
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT I, 2009, 5506 : 64 - +
  • [10] Goal-Oriented Adaptivity in Control Constrained Optimal Control of Partial Differential Equations
    Hintermueller, Michael
    Hoppe, Ronald H. W.
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 6454 - 6459