A self-supervised method for treatment recommendation in sepsis

被引:4
作者
Zhu, Sihan [1 ]
Pu, Jian [1 ,2 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Treatment recommendation; Sepsis; Self-supervised learning; Reinforcement learning; Electronic health records; TP391; 4; SYSTEM;
D O I
10.1631/FITEE.2000127
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sepsis treatment is a highly challenging effort to reduce mortality in hospital intensive care units since the treatment response may vary for each patient. Tailored treatment recommendations are desired to assist doctors in making decisions efficiently and accurately. In this work, we apply a self-supervised method based on reinforcement learning (RL) for treatment recommendation on individuals. An uncertainty evaluation method is proposed to separate patient samples into two domains according to their responses to treatments and the state value of the chosen policy. Examples of two domains are then reconstructed with an auxiliary transfer learning task. A distillation method of privilege learning is tied to a variational auto-encoder framework for the transfer learning task between the low- and high-quality domains. Combined with the self-supervised way for better state and action representations, we propose a deep RL method called high-risk uncertainty (HRU) control to provide flexibility on the trade-off between the effectiveness and accuracy of ambiguous samples and to reduce the expected mortality. Experiments on the large-scale publicly available real-world dataset MIMIC-III demonstrate that our model reduces the estimated mortality rate by up to 2.3% in total, and that the estimated mortality rate in the majority of cases is reduced to 9.5%.
引用
收藏
页码:926 / 939
页数:14
相关论文
共 37 条
  • [1] Designing a pilot sequential multiple assignment randomized trial for developing an adaptive treatment strategy
    Almirall, Daniel
    Compton, Scott N.
    Gunlicks-Stoessel, Meredith
    Duan, Naihua
    Murphy, Susan A.
    [J]. STATISTICS IN MEDICINE, 2012, 31 (17) : 1887 - 1902
  • [2] A Reinforcement Learning Approach for Solving the Mean Variance Customer Portfolio in Partially Observable Models
    Asiain, Erick
    Clempner, Julio B.
    Poznyak, Alexander S.
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (08)
  • [3] Bajor JacekM., 2017, ICLR
  • [4] A disease diagnosis and treatment recommendation system based on big data mining and cloud computing
    Chen, Jianguo
    Li, Kenli
    Rong, Huigui
    Bilal, Kashif
    Yang, Nan
    Li, Keqin
    [J]. INFORMATION SCIENCES, 2018, 435 : 124 - 149
  • [5] A Physician Advisory System for Chronic Heart Failure management based on knowledge patterns
    Chen, Zhuo
    Marple, Kyle
    Salazar, Elmer
    Gupta, Gopal
    Tamil, Lakshman
    [J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2016, 16 : 604 - 618
  • [6] Futoma J, 2017, P 2 MACH LEARN HEALT PMLR, P243
  • [7] A Pilot SMART for Developing an Adaptive Treatment Strategy for Adolescent Depression
    Gunlicks-Stoessel, Meredith
    Mufson, Laura
    Westervelt, Ana
    Almirall, Daniel
    Murphy, Susan
    [J]. JOURNAL OF CLINICAL CHILD AND ADOLESCENT PSYCHOLOGY, 2016, 45 (04) : 480 - 494
  • [8] Hendrycks D, 2019, P INT C LEARN REPR
  • [9] Hinton G., 2015, COMPUT SCI, DOI DOI 10.4140/TCP.N.2015.249
  • [10] Jiang N, 2016, PR MACH LEARN RES, V48