Personalized Treatment Policies with the Novel Buckley-James Q-Learning Algorithm

被引:3
|
作者
Lee, Jeongjin [1 ]
Kim, Jong-Min [2 ]
机构
[1] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
[2] Univ Minnesota Morris, Stat Discipline, Morris, MN 56267 USA
关键词
Q-learning; reinforcement learning; precision medicine; Buckley-James Method; survival analysis; DYNAMIC TREATMENT REGIMES; LINEAR-REGRESSION; SURVIVAL;
D O I
10.3390/axioms13040212
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This research paper presents the Buckley-James Q-learning (BJ-Q) algorithm, a cutting-edge method designed to optimize personalized treatment strategies, especially in the presence of right censoring. We critically assess the algorithm's effectiveness in improving patient outcomes and its resilience across various scenarios. Central to our approach is the innovative use of the survival time to impute the reward in Q-learning, employing the Buckley-James method for enhanced accuracy and reliability. Our findings highlight the significant potential of personalized treatment regimens and introduce the BJ-Q learning algorithm as a viable and promising approach. This work marks a substantial advancement in our comprehension of treatment dynamics and offers valuable insights for augmenting patient care in the ever-evolving clinical landscape.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A distributed Q-learning algorithm for multi-agent team coordination
    Huang, J
    Yang, B
    Liu, DY
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 108 - 113
  • [32] QLLog: A log anomaly detection method based on Q-learning algorithm
    Duan, Xiaoyu
    Ying, Shi
    Yuan, Wanli
    Cheng, Hailong
    Yin, Xiang
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)
  • [33] Accelerated multi-objective task learning using modified Q-learning algorithm
    Rajamohan, Varun Prakash
    Jagatheesaperumal, Senthil Kumar
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2024, 47 (01) : 28 - 37
  • [34] A novel dynamic integration approach for multiple load forecasts based on Q-learning algorithm
    Ma, Minhua
    Jin, Bingjie
    Luo, Shuxin
    Guo, Shaoqing
    Huang, Hongwei
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (07):
  • [35] Controlling Sequential Hybrid Evolutionary Algorithm by Q-Learning
    Zhang, Haotian
    Sun, Jianyong
    Back, Thomas
    Zhang, Qingfu
    Xu, Zongben
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2023, 18 (01) : 84 - 103
  • [36] Anomaly Detection using Fuzzy Q-learning Algorithm
    Shamshirband, Shahaboddin
    Anuar, Nor Badrul
    Kiah, Miss Laiha Mat
    Misra, Sanjay
    ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
  • [37] Application of Q-Learning algorithm for Traveling Salesman Problem
    Hasegawa, N
    Li, L
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND MANAGEMENT SCIENCES, 2002, 2 : 134 - 138
  • [38] A Task Scheduling Algorithm Based on Q-Learning for WSNs
    Zhang, Benhong
    Wu, Wensheng
    Bi, Xiang
    Wang, Yiming
    COMMUNICATIONS AND NETWORKING, CHINACOM 2018, 2019, 262 : 521 - 530
  • [39] Fundamental Q-learning Algorithm in Finding Optimal Policy
    Sun, Canyu
    2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 243 - 246
  • [40] An algorithm that excavates suboptimal states and improves Q-learning
    Zhu, Canxin
    Yang, Jingmin
    Zhang, Wenjie
    Zheng, Yifeng
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (04):