A self-supervised method for treatment recommendation in sepsis

被引：4

作者：

Zhu, Sihan ^{[1
]}

Pu, Jian ^{[1
,2
]}

机构：

[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China

[2] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200433, Peoples R China

来源：

FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING | 2021年 / 22卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Treatment recommendation; Sepsis; Self-supervised learning; Reinforcement learning; Electronic health records; TP391; 4; SYSTEM;

D O I：

10.1631/FITEE.2000127

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sepsis treatment is a highly challenging effort to reduce mortality in hospital intensive care units since the treatment response may vary for each patient. Tailored treatment recommendations are desired to assist doctors in making decisions efficiently and accurately. In this work, we apply a self-supervised method based on reinforcement learning (RL) for treatment recommendation on individuals. An uncertainty evaluation method is proposed to separate patient samples into two domains according to their responses to treatments and the state value of the chosen policy. Examples of two domains are then reconstructed with an auxiliary transfer learning task. A distillation method of privilege learning is tied to a variational auto-encoder framework for the transfer learning task between the low- and high-quality domains. Combined with the self-supervised way for better state and action representations, we propose a deep RL method called high-risk uncertainty (HRU) control to provide flexibility on the trade-off between the effectiveness and accuracy of ambiguous samples and to reduce the expected mortality. Experiments on the large-scale publicly available real-world dataset MIMIC-III demonstrate that our model reduces the estimated mortality rate by up to 2.3% in total, and that the estimated mortality rate in the majority of cases is reduced to 9.5%.

引用

页码：926 / 939

页数：14

共 37 条

[1] Designing a pilot sequential multiple assignment randomized trial for developing an adaptive treatment strategy
Almirall, Daniel
Compton, Scott N.
Gunlicks-Stoessel, Meredith
Duan, Naihua
Murphy, Susan A.
[J]. STATISTICS IN MEDICINE, 2012, 31 (17) : 1887 - 1902
[2] A Reinforcement Learning Approach for Solving the Mean Variance Customer Portfolio in Partially Observable Models
Asiain, Erick
Clempner, Julio B.
Poznyak, Alexander S.
[J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (08)
[3] Bajor JacekM., 2017, ICLR
[4] A disease diagnosis and treatment recommendation system based on big data mining and cloud computing
Chen, Jianguo
Li, Kenli
Rong, Huigui
Bilal, Kashif
Yang, Nan
Li, Keqin
[J]. INFORMATION SCIENCES, 2018, 435 : 124 - 149
[5] A Physician Advisory System for Chronic Heart Failure management based on knowledge patterns
Chen, Zhuo
Marple, Kyle
Salazar, Elmer
Gupta, Gopal
Tamil, Lakshman
[J]. THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2016, 16 : 604 - 618
[6] Futoma J, 2017, P 2 MACH LEARN HEALT PMLR, P243
[7] A Pilot SMART for Developing an Adaptive Treatment Strategy for Adolescent Depression
Gunlicks-Stoessel, Meredith
Mufson, Laura
Westervelt, Ana
Almirall, Daniel
Murphy, Susan
[J]. JOURNAL OF CLINICAL CHILD AND ADOLESCENT PSYCHOLOGY, 2016, 45 (04) : 480 - 494
[8] Hendrycks D, 2019, P INT C LEARN REPR
[9] Hinton G., 2015, COMPUT SCI, DOI DOI 10.4140/TCP.N.2015.249
[10] Jiang N, 2016, PR MACH LEARN RES, V48

← 1 2 3 4 →