Generalisations of a Bayesian decision-theoretic randomisation procedure and the impact of delayed responses

被引:8
作者
Williamson, S. Faye [1 ,2 ]
Jacko, Peter [3 ]
Jaki, Thomas [1 ,4 ]
机构
[1] Univ Lancaster, Dept Math & Stat, Lancaster, England
[2] Newcastle Univ, Populat Hlth Sci Inst, Biostat Res Grp, Newcastle Upon Tyne, England
[3] Univ Lancaster, Dept Management Sci, Lancaster, England
[4] Univ Cambridge, MRC Biostat Unit, Cambridge, England
基金
英国医学研究理事会; 英国工程与自然科学研究理事会;
关键词
Bayesian decision-theoretic model; Clinical trials; Delayed responses; Dynamic programming; Response-adaptive randomisation; BIASED COIN DESIGNS; CLINICAL-TRIALS; ADAPTIVE RANDOMIZATION; GITTINS INDEX; ASYMPTOTIC PROPERTIES; ISCHEMIC-STROKE; WINNER RULE; BANDIT; TESTS; MODEL;
D O I
10.1016/j.csda.2021.107407
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The design of sequential experiments and, in particular, randomised controlled trials involves a trade-off between operational characteristics such as statistical power, estimation bias and patient benefit. The family of randomisation procedures referred to as Constrained Randomised Dynamic Programming (CRDP), which is set in the Bayesian decision-theoretic framework, can be used to balance these competing objectives. A generalisation and novel interpretation of CRDP is proposed to highlight its inherent flexibility to adapt to a variety of practicalities and align with individual trial objectives. CRDP, as with most responseadaptive randomisation procedures, hinges on the limiting assumption of patient responses being available before allocation of the next patient. This forms one of the greatest barriers to their implementation in practice which, despite being an important research question, has not received a thorough treatment. Therefore, motivated by the existing gap between the theory of response-adaptive randomisation (which is abundant with proposed methods in the immediate response setting) and clinical practice (in which responses are typically delayed), the performance of CRDP in the presence of fixed and random delays is evaluated. Simulation results show that CRDP continues to offer patient benefit gains over alternative procedures and is relatively robust to delayed responses. To compensate for a fixed delay, a method which adjusts the time horizon used in the optimisation objective is proposed and its performance illustrated. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:26
相关论文
共 81 条
[1]   An Approximation Approach for Response-Adaptive Clinical Trial Design [J].
Ahuja, Vishal ;
Birge, John R. .
INFORMS JOURNAL ON COMPUTING, 2020, 32 (04) :877-894
[2]  
Alban A, 2018, WINT SIMUL C PROC, P2459, DOI 10.1109/WSC.2018.8632500
[3]  
[Anonymous], 1989, Learning from delayed rewards: A foundation of reinforcement learning
[4]   THE SEARCH FOR OPTIMALITY IN CLINICAL-TRIALS [J].
ARMITAGE, P .
INTERNATIONAL STATISTICAL REVIEW, 1985, 53 (01) :15-24
[5]  
Atkinson AC., 2014, RANDOMISED RESPONSE
[6]  
Bai ZD, 2002, ANN STAT, V30, P122
[7]  
Bandyopadhyay U., 1996, Calcutta Statistical Association Bulletin, V46, P69, DOI [10.1177/0008068319960107, DOI 10.1177/0008068319960107]
[8]  
Bellman R, 1956, SANKHYA, V16, P221
[9]  
Berry D.A., 1996, BAYESIAN BIOSTATISTI, P3
[10]   Generalized delayed response in randomized play-the-winner rule [J].
Biswas, A .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2003, 32 (01) :259-274