Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引:1
|
作者
Charvadeh, Yasin Khadem [1 ]
Yi, Grace Y. [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada
[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;
D O I
10.1002/sim.9973
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.
引用
收藏
页码:578 / 605
页数:28
相关论文
共 50 条
  • [31] Personalized Treatment Policies with the Novel Buckley-James Q-Learning Algorithm
    Lee, Jeongjin
    Kim, Jong-Min
    AXIOMS, 2024, 13 (04)
  • [32] A Q-learning approach based on human reasoning for navigation in a dynamic environment
    Yuan, Rupeng
    Zhang, Fuhai
    Wang, Yu
    Fu, Yili
    Wang, Shuguo
    ROBOTICA, 2019, 37 (03) : 445 - 468
  • [33] Q-Learning Based Dynamic Channel Assignment Algorithm in Cognitive Radio
    Wang, Huahua
    Wei, Yang
    Long, Yin
    ELECTRONIC INFORMATION AND ELECTRICAL ENGINEERING, 2012, 19 : 127 - 131
  • [34] Improved Q-Learning Applied to Dynamic Obstacle Avoidance and Path Planning
    Wang, Chunlei
    Yang, Xiao
    Li, He
    IEEE ACCESS, 2022, 10 : 92879 - 92888
  • [35] Q-learning Based Dynamic Optimal Relax Automatic Generation Control
    Yu, Tao
    Yuan, Ye
    Liang, Haihua
    POWER AND ENERGY ENGINEERING CONFERENCE 2010, 2010, : 797 - 800
  • [36] Dynamic Path Planning of a Mobile Robot with Improved Q-Learning algorithm
    Li, Siding
    Xu, Xin
    Zuo, Lei
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 409 - 414
  • [37] Dynamic Q-Learning for Intersection Traffic Flow Control Based on Agents
    Vista, Felipe P.
    Zhou, Xuan
    Ryu, Ji Hyoung
    Chong, Kil To
    ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 120 - 123
  • [38] Smart home's wireless sensor networks lifetime optimizing using Q-learning
    Jrhilifa, Ismael
    Ouadi, Hamid
    Jilbab, Abdelilah
    IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
  • [39] Q-learning model of insight problem solving and the effects of learning traits on creativity
    Harada, Tsutomu
    FRONTIERS IN PSYCHOLOGY, 2024, 14
  • [40] A Novel Q-Learning Assisted Dynamic Power Sharing for Dual Connectivity Scenario
    Chaudhari, Anup
    Kumar, Naveen
    Rao, Prakash
    2020 IEEE 17TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC 2020), 2020,