Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引：1

作者：

Charvadeh, Yasin Khadem ^{[1
]}

Yi, Grace Y. ^{[1
,2
,3
]}

机构：

[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada

[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada

[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada

来源：

STATISTICS IN MEDICINE | 2024年 / 43卷 / 03期

基金：

加拿大自然科学与工程研究理事会;

关键词：

dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;

D O I：

10.1002/sim.9973

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.

引用

页码：578 / 605

页数：28

共 50 条

[21] ENTROPY LEARNING FOR DYNAMIC TREATMENT REGIMES
Jiang, Binyan
Song, Rui
Li, Jialiang
Zeng, Donglin
Lu, Wenbin
He, Xin
Xu, Shirong
Wang, Junhui
Qian, Min
Cheng, Bin
Qiu, Hongxiang
Luedtke, Alex
van der Laan, Mark
Wager, Stefan
Zhang, Yichi
Laber, Eric B.
Kallus, Nathan
STATISTICA SINICA, 2019, 29 (04) : 1633 - 1710
[22] Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
Bertsekas, Dimitri P.
Yu, Huizhen
MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (01) : 66 - 94
[23] Dynamic feature selection algorithm based on Q-learning mechanism
Xu, Ruohao
Li, Mengmeng
Yang, Zhongliang
Yang, Lifang
Qiao, Kangjia
Shang, Zhigang
APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
[24] Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access
Nguyen, Ha Q.
Nguyen, Binh T.
Dong, Trung Q.
Ngo, Dat T.
Nguyen, Tuan A.
2018 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2018,
[25] Dynamic Pricing Decision for Perishable Goods: A Q-learning Approach
Cheng, Yan
2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11965 - 11969
[26] Optimizing the Impact of Musical Education on Mental Health of Students using Q-Learning
Bing, Yang
MOBILE NETWORKS & APPLICATIONS, 2024,
[27] Optimizing Handover Parameters by Q-Learning for Heterogeneous Radio-Optical Networks
Shao, Sihua
Liu, Guanxiong
Khreishah, Abdallah
Ayyash, Moussa
Elgala, Hany
Little, Thomas D. C.
Rahaim, Michael
IEEE PHOTONICS JOURNAL, 2020, 12 (01):
[28] A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
Hao, Bing
Du, He
Yan, Zheping
OCEAN ENGINEERING, 2023, 270
[29] Mounting of auction agent under dynamic environment by Q-learning and SARSA learning
Katou, T
Nagasaka, K
7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 472 - 475
[30] Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments
Gupta, Anmol
Roy, Partha Pratim
Dutt, Varun
IEEE ACCESS, 2021, 9 : 138775 - 138790

← 1 2 3 4 5 →