Learning to search efficiently for causally near-optimal treatments

被引:0
作者
Hakansson, Samuel [1 ,2 ]
Lindblom, Viktor [2 ]
Gottesman, Omer [3 ,4 ]
Johansson, Fredrik D. [2 ]
机构
[1] Gothenburg Univ, Gothenburg, Sweden
[2] Chalmers Univ Technol, Gothenburg, Sweden
[3] Brown Univ, Providence, RI USA
[4] Harvard Univ, Cambridge, MA USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding an effective medical treatment often requires a search by trial and error. Making this search more efficient by minimizing the number of unnecessary trials could lower both costs and patient suffering. We formalize this problem as learning a policy for finding a near-optimal treatment in a minimum number of trials using a causal inference framework. We give a model-based dynamic programming algorithm which learns from observational data while being robust to unmeasured confounding. To reduce time complexity, we suggest a greedy algorithm which bounds the near-optimality constraint. The methods are evaluated on synthetic and real-world healthcare data and compared to model-free reinforcement learning. We find that our methods compare favorably to the model-free baseline while offering a more transparent trade-off between search time and treatment efficacy.
引用
收藏
页数:12
相关论文
共 37 条
[1]   Reinforcement learning with immediate rewards and linear hypotheses [J].
Abe, N ;
Biermann, AW ;
Long, PM .
ALGORITHMICA, 2003, 37 (04) :263-293
[2]  
[Anonymous], 2010, Depression: the treatment and management of depression in adults
[3]  
[Anonymous], 2016, INT C MACH LEARN
[4]  
Arevalo-Rodriguez I., 2015, COCHRANE DATABASE SY, V2015
[5]  
Boominathan S., 2020, ARXIV200600927
[6]  
Chakaravarthy V. T., 2007, P 26 ACM SIGMOD SIGA, P53
[7]  
Chu Wei, 2011, JMLR WORKSHOP C P, P208
[8]  
Dellinger RP, 2013, INTENS CARE MED, V39, P165, DOI [10.1007/s00134-012-2769-8, 10.1097/CCM.0b013e31827e83af]
[9]  
Garcia F., 1998, P 15 INT C MACHINE L
[10]  
Ghosh D., 2019, MACHINE LEARNING BAS