Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引：1

作者：

Charvadeh, Yasin Khadem ^{[1
]}

Yi, Grace Y. ^{[1
,2
,3
]}

机构：

[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada

[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada

[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada

来源：

STATISTICS IN MEDICINE | 2024年 / 43卷 / 03期

基金：

加拿大自然科学与工程研究理事会;

关键词：

dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;

D O I：

10.1002/sim.9973

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.

引用

页码：578 / 605

页数：28

共 50 条

[31] Personalized Treatment Policies with the Novel Buckley-James Q-Learning Algorithm
Lee, Jeongjin
Kim, Jong-Min
AXIOMS, 2024, 13 (04)
[32] A Q-learning approach based on human reasoning for navigation in a dynamic environment
Yuan, Rupeng
Zhang, Fuhai
Wang, Yu
Fu, Yili
Wang, Shuguo
ROBOTICA, 2019, 37 (03) : 445 - 468
[33] Q-Learning Based Dynamic Channel Assignment Algorithm in Cognitive Radio
Wang, Huahua
Wei, Yang
Long, Yin
ELECTRONIC INFORMATION AND ELECTRICAL ENGINEERING, 2012, 19 : 127 - 131
[34] Improved Q-Learning Applied to Dynamic Obstacle Avoidance and Path Planning
Wang, Chunlei
Yang, Xiao
Li, He
IEEE ACCESS, 2022, 10 : 92879 - 92888
[35] Q-learning Based Dynamic Optimal Relax Automatic Generation Control
Yu, Tao
Yuan, Ye
Liang, Haihua
POWER AND ENERGY ENGINEERING CONFERENCE 2010, 2010, : 797 - 800
[36] Dynamic Path Planning of a Mobile Robot with Improved Q-Learning algorithm
Li, Siding
Xu, Xin
Zuo, Lei
2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 409 - 414
[37] Dynamic Q-Learning for Intersection Traffic Flow Control Based on Agents
Vista, Felipe P.
Zhou, Xuan
Ryu, Ji Hyoung
Chong, Kil To
ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 120 - 123
[38] Smart home's wireless sensor networks lifetime optimizing using Q-learning
Jrhilifa, Ismael
Ouadi, Hamid
Jilbab, Abdelilah
IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
[39] Q-learning model of insight problem solving and the effects of learning traits on creativity
Harada, Tsutomu
FRONTIERS IN PSYCHOLOGY, 2024, 14
[40] A Novel Q-Learning Assisted Dynamic Power Sharing for Dual Connectivity Scenario
Chaudhari, Anup
Kumar, Naveen
Rao, Prakash
2020 IEEE 17TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC 2020), 2020,

← 1 2 3 4 5 →