Robust optimal designs using a model misspecification term

被引:1
作者
Tsirpitzi, Renata Eirini [1 ]
Miller, Frank [1 ,2 ]
Burman, Carl-Fredrik [3 ,4 ]
机构
[1] Stockholm Univ, Dept Stat, SE-10691 Stockholm, Sweden
[2] Linkoping Univ, Dept Comp & Informat Sci, SE-58183 Linkoping, Sweden
[3] AstraZeneca, Early Biometr & Stat Innovat Data Sci & Artificial, Gothenburg, Sweden
[4] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
关键词
Fedorov algorithm; Gaussian process; Mixed-effects model; Optimal experimental design; Statistical modelling; SAMPLING DESIGNS; REGRESSION;
D O I
10.1007/s00184-023-00893-6
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Much of classical optimal design theory relies on specifying a model with only a small number of parameters. In many applications, such models will give reasonable approximations. However, they will often be found not to be entirely correct when enough data are at hand. A property of classical optimal design methodology is that the amount of data does not influence the design when a fixed model is used. However, it is reasonable that a low dimensional model is satisfactory only if limited data is available. With more data available, more aspects of the underlying relationship can be assessed. We consider a simple model that is not thought to be fully correct. The model misspecification, that is, the difference between the true mean and the simple model, is explicitly modeled with a stochastic process. This gives a unified approach to handle situations with both limited and rich data. Our objective is to estimate the combined model, which is the sum of the simple model and the assumed misspecification process. In our situation, the low-dimensional model can be viewed as a fixed effect and the misspecification term as a random effect in a mixed-effects model. Our aim is to predict within this model. We describe how we minimize the prediction error using an optimal design. We compute optimal designs for the full model in different cases. The results confirm that the optimal design depends strongly on the sample size. In low-information situations, traditional optimal designs for models with a small number of parameters are sufficient, while the inclusion of the misspecification term lead to very different designs in data-rich cases.
引用
收藏
页码:781 / 804
页数:24
相关论文
共 42 条
[1]   SOME EXACT OPTIMAL DESIGNS FOR LINEAR COVARIANCE FUNCTIONS IN ONE DIMENSION [J].
ABT, M .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1992, 21 (07) :2059-2069
[2]  
[Anonymous], 1996, Stochastic Processes
[3]  
ATKINSON A. C., 2007, Oxford Statistical Science Series, V34
[4]   Optimal designs for testing the functional form of a regression via nonparametric estimation techniques [J].
Biedermann, S ;
Dette, H .
STATISTICS & PROBABILITY LETTERS, 2001, 52 (02) :215-224
[5]   Exact asymptotics for boundary crossings of the Brownian bridge with trend with application to the Kolmogorov test [J].
Bischoff, W ;
Hashorva, E ;
Hüsler, J ;
Miller, F .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2003, 55 (04) :849-864
[6]   Optimal designs which are efficient for lack of fit tests [J].
Bischoff, Wolfgang ;
Miller, Frank .
ANNALS OF STATISTICS, 2006, 34 (04) :2015-2025
[7]   BAYESIAN APPROACH TO MODEL INADEQUACY FOR POLYNOMIAL REGRESSION [J].
BLIGHT, BJN ;
OTT, L .
BIOMETRIKA, 1975, 62 (01) :79-88
[8]   Brownian bridge [J].
Chow, Winston C. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2009, 1 (03) :325-332
[9]  
Christensen R., 2002, Plane Answers to Complex Questions: The Theory of Linear Models, DOI [10.1007/978-0-387-21544-0, DOI 10.1007/978-0-387-21544-0]
[10]   Active learning with statistical models [J].
Cohn, DA ;
Ghahramani, Z ;
Jordan, MI .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145