Sample size recommendations for studies on reliability and measurement error: an online application based on simulation studies

被引:45
作者
Mokkink, Lidwine B. [1 ,2 ]
de Vet, Henrica [1 ,2 ]
Diemeer, Susanne [1 ]
Eekhout, Iris [1 ,2 ,3 ]
机构
[1] Vrije Univ Amsterdam, Dept Epidemiol & Data Sci, Amsterdam UMC, Amsterdam, Netherlands
[2] Amsterdam Publ Hlth Res Inst, Amsterdam, Netherlands
[3] Netherlands Org Appl Sci Res, Child Hlth, Leiden, Netherlands
关键词
Sample size recommendations; Simulation study; Reliability; Measurement error; Repeated measurements; Outcome measurement instruments; INTRACLASS; REQUIREMENTS; INTERVAL; DESIGN;
D O I
10.1007/s10742-022-00293-9
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Simulation studies were performed to investigate for which conditions of sample size of patients (n) and number of repeated measurements (k) (e.g., raters) the optimal (i.e., balance between precise and efficient) estimations of intraclass correlation coefficients (ICCs) and standard error of measurements (SEMs) can be achieved. Subsequently, we developed an online application that shows the implications for decisions about sample sizes in reliability studies. We simulated scores for repeated measurements of patients, based on different conditions of n, k, the correlation between scores on repeated measurements (r), the variance between patients' test scores (v), and the presence of systematic differences within k. The performance of the reliability parameters (based on one-way and two-way effects models) was determined by the calculation of bias, mean squared error (MSE), and coverage and width of the confidence intervals (CI). We showed that the gain in precision (i.e., largest change in MSE) of the ICC and SEM parameters diminishes at larger values of n or k. Next, we showed that the correlation and the presence of systematic differences have most influence on the MSE values, the coverage and the CI width. This influence differed between the models. As measurements can be expensive and burdensome for patients and professionals, we recommend to use an efficient design, in terms of the sample size and number of repeated measurements to come to precise ICC and SEM estimates. Utilizing the results, a user-friendly online application is developed to decide upon the optimal design, as 'one size fits all' doesn't hold.
引用
收藏
页码:241 / 265
页数:25
相关论文
共 28 条
[1]  
Bland J M., 2004, How Can I Decide the Sample Size for a Study of Agreement between Two Methods of Measurement?
[2]   The design of simulation studies in medical statistics [J].
Burton, Andrea ;
Altman, Douglas G. ;
Royston, Patrick ;
Holder, Roger L. .
STATISTICS IN MEDICINE, 2006, 25 (24) :4279-4292
[3]  
De Vet H. C. W., 2011, MEASUREMENT MED
[4]   When to use agreement versus reliability measures [J].
de Vet, Henrica C. W. ;
Terwee, Caroline B. ;
Knol, Dirk L. ;
Bouter, Lex M. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (10) :1033-1039
[5]  
Dikmans REG, 2017, PRS-GLOB OPEN, V5, DOI 10.1097/GOX.0000000000001254
[6]   SAMPLE-SIZE REQUIREMENTS FOR RELIABILITY STUDIES [J].
DONNER, A ;
ELIASZIW, M .
STATISTICS IN MEDICINE, 1987, 6 (04) :441-448
[7]  
Eekhout I., 2022, ICC POWER SHINY APPI
[8]  
Eekhout I., 2022, AGREE AGREEMENT RELI
[9]  
Eekhout I., 2022, ESTIMATING ICCS SEMS
[10]   Analyzing Incomplete Item Scores in Longitudinal Data by Including Item Score Information as Auxiliary Variables [J].
Eekhout, Iris ;
Enders, Craig K. ;
Twisk, Jos W. R. ;
de Boer, Michiel R. ;
de Vet, Henrica C. W. ;
Heymans, Martijn W. .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2015, 22 (04) :588-602