Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引:1
|
作者
Charvadeh, Yasin Khadem [1 ]
Yi, Grace Y. [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada
[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;
D O I
10.1002/sim.9973
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.
引用
收藏
页码:578 / 605
页数:28
相关论文
共 50 条
  • [21] ENTROPY LEARNING FOR DYNAMIC TREATMENT REGIMES
    Jiang, Binyan
    Song, Rui
    Li, Jialiang
    Zeng, Donglin
    Lu, Wenbin
    He, Xin
    Xu, Shirong
    Wang, Junhui
    Qian, Min
    Cheng, Bin
    Qiu, Hongxiang
    Luedtke, Alex
    van der Laan, Mark
    Wager, Stefan
    Zhang, Yichi
    Laber, Eric B.
    Kallus, Nathan
    STATISTICA SINICA, 2019, 29 (04) : 1633 - 1710
  • [22] Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
    Bertsekas, Dimitri P.
    Yu, Huizhen
    MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (01) : 66 - 94
  • [23] Dynamic feature selection algorithm based on Q-learning mechanism
    Xu, Ruohao
    Li, Mengmeng
    Yang, Zhongliang
    Yang, Lifang
    Qiao, Kangjia
    Shang, Zhigang
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
  • [24] Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access
    Nguyen, Ha Q.
    Nguyen, Binh T.
    Dong, Trung Q.
    Ngo, Dat T.
    Nguyen, Tuan A.
    2018 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2018,
  • [25] Dynamic Pricing Decision for Perishable Goods: A Q-learning Approach
    Cheng, Yan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11965 - 11969
  • [26] Optimizing the Impact of Musical Education on Mental Health of Students using Q-Learning
    Bing, Yang
    MOBILE NETWORKS & APPLICATIONS, 2024,
  • [27] Optimizing Handover Parameters by Q-Learning for Heterogeneous Radio-Optical Networks
    Shao, Sihua
    Liu, Guanxiong
    Khreishah, Abdallah
    Ayyash, Moussa
    Elgala, Hany
    Little, Thomas D. C.
    Rahaim, Michael
    IEEE PHOTONICS JOURNAL, 2020, 12 (01):
  • [28] A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
    Hao, Bing
    Du, He
    Yan, Zheping
    OCEAN ENGINEERING, 2023, 270
  • [29] Mounting of auction agent under dynamic environment by Q-learning and SARSA learning
    Katou, T
    Nagasaka, K
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 472 - 475
  • [30] Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments
    Gupta, Anmol
    Roy, Partha Pratim
    Dutt, Varun
    IEEE ACCESS, 2021, 9 : 138775 - 138790