Learning to search efficiently for causally near-optimal treatments

被引：0

作者：

Hakansson, Samuel ^{[1
,2
]}

Lindblom, Viktor ^{[2
]}

Gottesman, Omer ^{[3
,4
]}

Johansson, Fredrik D. ^{[2
]}

机构：

[1] Gothenburg Univ, Gothenburg, Sweden

[2] Chalmers Univ Technol, Gothenburg, Sweden

[3] Brown Univ, Providence, RI USA

[4] Harvard Univ, Cambridge, MA USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Finding an effective medical treatment often requires a search by trial and error. Making this search more efficient by minimizing the number of unnecessary trials could lower both costs and patient suffering. We formalize this problem as learning a policy for finding a near-optimal treatment in a minimum number of trials using a causal inference framework. We give a model-based dynamic programming algorithm which learns from observational data while being robust to unmeasured confounding. To reduce time complexity, we suggest a greedy algorithm which bounds the near-optimality constraint. The methods are evaluated on synthetic and real-world healthcare data and compared to model-free reinforcement learning. We find that our methods compare favorably to the model-free baseline while offering a more transparent trade-off between search time and treatment efficacy.

引用

页数：12

共 37 条

[1] Reinforcement learning with immediate rewards and linear hypotheses [J].

Abe, N ;

Biermann, AW ;

Long, PM .

ALGORITHMICA, 2003, 37 (04) :263-293

[2]

[Anonymous], 2010, Depression: the treatment and management of depression in adults

[3]

[Anonymous], 2016, INT C MACH LEARN

[4]

Arevalo-Rodriguez I., 2015, COCHRANE DATABASE SY, V2015

[5]

Boominathan S., 2020, ARXIV200600927

[6]

Chakaravarthy V. T., 2007, P 26 ACM SIGMOD SIGA, P53

[7]

Chu Wei, 2011, JMLR WORKSHOP C P, P208

[8]

Dellinger RP, 2013, INTENS CARE MED, V39, P165, DOI [10.1007/s00134-012-2769-8, 10.1097/CCM.0b013e31827e83af]

[9]

Garcia F., 1998, P 15 INT C MACHINE L

[10]

Ghosh D., 2019, MACHINE LEARNING BAS

← 1 2 3 4 →