C-learning: A new classification framework to estimate optimal dynamic treatment regimes

被引：20

作者：

Zhang, Baqun ^{[1
]}

Zhang, Min ^{[2
]}

机构：

[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China

[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA

来源：

BIOMETRICS | 2018年 / 74卷 / 03期

基金：

中国国家自然科学基金;

关键词：

A-learning; Augmented inverse probability weighted estimator; CART; Dynamic treatment regime; Precision medicine; Q-learning; INDIVIDUALIZED TREATMENT RULES; DECISIONS; DESIGN;

D O I：

10.1111/biom.12836

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

A dynamic treatment regime is a sequence of decision rules, each corresponding to a decision point, that determine that next treatment based on each individual's own available characteristics and treatment history up to that point. We show that identifying the optimal dynamic treatment regime can be recast as a sequential optimization problem and propose a direct sequential optimization method to estimate the optimal treatment regimes. In particular, at each decision point, the optimization is equivalent to sequentially minimizing a weighted expected misclassification error. Based on this classification perspective, we propose a powerful and flexible C-learning algorithm to learn the optimal dynamic treatment regimes backward sequentially from the last stage until the first stage. C-learning is a direct optimization method that directly targets optimizing decision rules by exploiting powerful optimization/classification techniques and it allows incorporation of patient's characteristics and treatment history to improve performance, hence enjoying advantages of both the traditional outcome regression-based methods (Q- and A-learning) and the more recent direct optimization methods. The superior performance and flexibility of the proposed methods are illustrated through extensive simulation studies.

引用

页码：891 / 899

页数：9

共 23 条

[1] A general statistical framework for subgroup identification and comparative treatment scoring [J].

Chen, Shuai ;

Tian, Lu ;

Cai, Tianxi ;

Yu, Menggang .

BIOMETRICS, 2017, 73 (04) :1199-1209

[2] On optimal treatment regimes selection for mean survival time [J].

Geng, Yuan ;

Zhang, Hao Helen ;

Lu, Wenbin .

STATISTICS IN MEDICINE, 2015, 34 (07) :1169-1184

[3]

Goldberg DE., 1989, Genetic algorithms in search, optimization and machine learning

[4]

Kang C, 2014, BIOMETRICS, V70, P695, DOI 10.1111/biom.12191

[5]

Mebane WR, 2011, J STAT SOFTW, V42, P1

[6] Demystifying optimal dynamic treatment regimes [J].

Moodie, Erica E. M. ;

Richardson, Thomas S. ;

Stephens, David A. .

BIOMETRICS, 2007, 63 (02) :447-455

[7] An experimental design for the development of adaptive treatment strategies [J].

Murphy, SA .

STATISTICS IN MEDICINE, 2005, 24 (10) :1455-1481

[8] Optimal dynamic treatment regimes [J].

Murphy, SA .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 :331-355

[9] PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES [J].

Qian, Min ;

Murphy, Susan A. .

ANNALS OF STATISTICS, 2011, 39 (02) :1180-1210

[10]

Robins JM, 2004, LECT NOTES STAT, V179, P189

← 1 2 3 →