Personalized Treatment Policies with the Novel Buckley-James Q-Learning Algorithm

被引：3

作者：

Lee, Jeongjin ^{[1
]}

Kim, Jong-Min ^{[2
]}

机构：

[1] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA

[2] Univ Minnesota Morris, Stat Discipline, Morris, MN 56267 USA

来源：

AXIOMS | 2024年 / 13卷 / 04期

关键词：

Q-learning; reinforcement learning; precision medicine; Buckley-James Method; survival analysis; DYNAMIC TREATMENT REGIMES; LINEAR-REGRESSION; SURVIVAL;

D O I：

10.3390/axioms13040212

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This research paper presents the Buckley-James Q-learning (BJ-Q) algorithm, a cutting-edge method designed to optimize personalized treatment strategies, especially in the presence of right censoring. We critically assess the algorithm's effectiveness in improving patient outcomes and its resilience across various scenarios. Central to our approach is the innovative use of the survival time to impute the reward in Q-learning, employing the Buckley-James method for enhanced accuracy and reliability. Our findings highlight the significant potential of personalized treatment regimens and introduce the BJ-Q learning algorithm as a viable and promising approach. This work marks a substantial advancement in our comprehension of treatment dynamics and offers valuable insights for augmenting patient care in the ever-evolving clinical landscape.

引用

页数：12

共 50 条

[31] A distributed Q-learning algorithm for multi-agent team coordination
Huang, J
Yang, B
Liu, DY
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 108 - 113
[32] QLLog: A log anomaly detection method based on Q-learning algorithm
Duan, Xiaoyu
Ying, Shi
Yuan, Wanli
Cheng, Hailong
Yin, Xiang
INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)
[33] Accelerated multi-objective task learning using modified Q-learning algorithm
Rajamohan, Varun Prakash
Jagatheesaperumal, Senthil Kumar
INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2024, 47 (01) : 28 - 37
[34] A novel dynamic integration approach for multiple load forecasts based on Q-learning algorithm
Ma, Minhua
Jin, Bingjie
Luo, Shuxin
Guo, Shaoqing
Huang, Hongwei
INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2020, 30 (07):
[35] Controlling Sequential Hybrid Evolutionary Algorithm by Q-Learning
Zhang, Haotian
Sun, Jianyong
Back, Thomas
Zhang, Qingfu
Xu, Zongben
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2023, 18 (01) : 84 - 103
[36] Anomaly Detection using Fuzzy Q-learning Algorithm
Shamshirband, Shahaboddin
Anuar, Nor Badrul
Kiah, Miss Laiha Mat
Misra, Sanjay
ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
[37] Application of Q-Learning algorithm for Traveling Salesman Problem
Hasegawa, N
Li, L
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND MANAGEMENT SCIENCES, 2002, 2 : 134 - 138
[38] A Task Scheduling Algorithm Based on Q-Learning for WSNs
Zhang, Benhong
Wu, Wensheng
Bi, Xiang
Wang, Yiming
COMMUNICATIONS AND NETWORKING, CHINACOM 2018, 2019, 262 : 521 - 530
[39] Fundamental Q-learning Algorithm in Finding Optimal Policy
Sun, Canyu
2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 243 - 246
[40] An algorithm that excavates suboptimal states and improves Q-learning
Zhu, Canxin
Yang, Jingmin
Zhang, Wenjie
Zheng, Yifeng
ENGINEERING RESEARCH EXPRESS, 2024, 6 (04):

← 1 2 3 4 5 →