Confidence interval width contours: Sample size planning for linear mixed-effects models

被引:1
作者
Liu Yue [1 ]
Xu Lei [1 ]
Liu Hongyun [2 ,3 ]
Han Yuting [4 ]
You Xiaofeng [5 ]
Wan Zhilin [1 ]
机构
[1] Sichuan Normal Univ, Inst Brain & Psychol Sci, Chengdu 610066, Peoples R China
[2] Beijing Normal Univ, Beijing Key Lab Appl Expt Psychol, Beijing 100875, Peoples R China
[3] Beijing Normal Univ, Fac Psychol, Beijing 100875, Peoples R China
[4] Beijing Language & Culture Univ, Sch Psychol, Beijing 100083, Peoples R China
[5] Nanchang Normal Univ, Sch Math & Informat Sci, Nanchang 360111, Jiangxi, Peoples R China
关键词
linear mixed-effects models; multilevel models; power analysis; effect size; confidence interval width; STATISTICAL POWER; ACCURACY;
D O I
10.3724/SP.J.1041.2024.00124
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Hierarchical data, which is observed frequently in psychological experiments, is usually analyzed with the linear mixed-effects models (LMEMs), as it can account for multiple sources of random effects due to participants, items, and/or predictors simultaneously. However, it is still unclear of how to determine the sample size and number of trials in LMEMs. In history, sample size planning was conducted based purely on power analysis. Later, the influential article of Maxwell et al. (2008) has made clear that sample size planning should consider statistical power and accuracy in parameter estimation (AIPE) simultaneously. In this paper, we derive a confidence interval width contours plot with the codes to generate it, providing power and AIPE information simultaneously. With this plot, sample size requirements in LMEMs based on power and AIPE criteria can be decided. We also demonstrated how to run sensitivity analysis to assess the impact of the magnitude of experiment effect size and the magnitude of random slope variance on statistical power, AIPE and the results of sample size planning. There were two sets of sensitivity analysis based on different LMEMs. Sensitivity analysis. investigated how the experiment effect size influenced power, AIPE and the requirement of sample size for within-subject experiment design, while sensitivity analysis. investigated the impact of random slope variance on optimal sample size based on power and AIPE analysis for the cross-level interaction effect. The results for binary and continuous between-subject variables were compared. In these sensitivity analysis, two factors regarding sample size varied: number of subjects (I = 10, 30, 50, 70, 100, 200, 400, 600, 800), number of trials (J = 10, 20, 30, 50, 70, 100, 150, 200, 250, 300). The additional manipulated factor was the effect size of experiment effect standard coefficient of experiment condition = 0.2, 0.5, 0.8, in sensitivity analysis I) and the magnitude of random slope variance (0.01, 0.09 and 0.25, in sensitivity analysis.). A random slope model was used in sensitivity analysis., while a random slope model with level-2 independent variable was used in sensitivity analysis II. Data-generating model and fitted model were the same. Estimation performance was evaluated in terms of convergence rate, power, AIPE for the fixed effect, AIPE for the standard error of the fixed effect, and AIPE for the random effect. The results are as following. First, there were no convergence problems under all the conditions, except that when the variance of random slope was small and a maximal model was used to fit the data. Second, power increased as sample size, number of trials or effect size increased. However, the number of trials played a key role for the power of within-subject effect, while sample size was more important for the power of cross-level effect. Power was larger for continuous between-subject variable than for binary between-subject variable. Third, although the fixed effect was accurately estimated under all the simulation conditions, the width 95% confidence interval (95% width) was extremely large under some conditions. Lastly, AIPE for the random effect increased as sample size and/or number of trials increased. The variance of residual was estimated accurately. As the variance of random slope increased, the accuracy of the estimates of variances of random intercept decreased, and the accuracy of the estimates of random slope increased. In conclusion, if sample size planning was conducted solely based on power analysis, the chosen sample size might not be large enough to obtain accurate estimates of effects size. Therefore, the rational for considering statistical power and AIPE during sample size planning was adopted. To shed light on this issue, this article provided a standard procedure based on a confidence interval width contours plot to recommend sample size and number of trials for using LMEMs. This plot visualizes the combined effect of sample size and number of trials per participant on 95% width, power and AIPE for random effects. Based on this tool and other empirical considerations, practitioners can make informed choices about how many participants to test, and how many trials to test each one for.
引用
收藏
页码:124 / +
页数:57
相关论文
共 31 条
  • [11] Sample size recommendations for continuous-time models: compensating shorter time series with larger numbers of persons and vice versa
    Hecht, Martin
    Zitzmann, Steffen
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2021, 28 (02) : 229 - 236
  • [12] Hox J.J., 2010, Multilevel analysis: Techniques and applications, V2nd ed.
  • [13] Effects of emotion on intertemporal decision-making: Explanation from the single dimension priority model
    Jiang Yuanping
    Jiang Chengming
    Hu Tianyi
    Sun Hongyue
    [J]. ACTA PSYCHOLOGICA SINICA, 2022, 54 (02) : 122 - 140
  • [14] Experiments with More Than One Random Factor: Designs, Analytic Models, and Statistical Power
    Judd, Charles M.
    Westfall, Jacob
    Kenny, David A.
    [J]. ANNUAL REVIEW OF PSYCHOLOGY, VOL 68, 2017, 68 : 601 - 625
  • [15] Accuracy in Parameter Estimation for a General Class of Effect Sizes: A Sequential Approach
    Kelley, Ken
    Darku, Francis Bilson
    Chattopadhyay, Bhargab
    [J]. PSYCHOLOGICAL METHODS, 2018, 23 (02) : 226 - 243
  • [16] Sample size planning for the standardized mean difference: Accuracy in parameter estimation via narrow confidence intervals
    Kelley, Ken
    Rausch, Joseph R.
    [J]. PSYCHOLOGICAL METHODS, 2006, 11 (04) : 363 - 385
  • [17] A longitudinal multilevel CFA-MTMM model for interchangeable and structurally different methods
    Koch, Tobias
    Schultze, Martin
    Eid, Michael
    Geiser, Christian
    [J]. FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [18] Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R
    Kumle, Leah
    Vo, Melissa L. -H.
    Draschkow, Dejan
    [J]. BEHAVIOR RESEARCH METHODS, 2021, 53 (06) : 2528 - 2543
  • [19] Sample size planning for statistical power and accuracy in parameter estimation
    Maxwell, Scott E.
    Kelley, Ken
    Rausch, Joseph R.
    [J]. ANNUAL REVIEW OF PSYCHOLOGY, 2008, 59 : 537 - 563
  • [20] Summary-Statistics-Based Power Analysis: A New and Practical Method to Determine Sample Size for Mixed-Effects Modeling
    Murayama, Kou
    Usami, Satoshi
    Sakaki, Michiko
    [J]. PSYCHOLOGICAL METHODS, 2022, 27 (06) : 1014 - 1038