A simulation study of sample size for multilevel logistic regression models

被引:289
|
作者
Moineddin, Rahim
Matheson, Flora I.
Glazier, Richard H.
机构
[1] Univ Toronto, Dept Publ Hlth Sci, Toronto, ON, Canada
[2] St Michaels Hosp, Ctr Res Inner City Hlth, Toronto, ON M5B 1W8, Canada
[3] Univ Toronto, Dept Family & Community Med, Toronto, ON, Canada
[4] Inst Clin Evaluat Sci, Toronto, ON, Canada
关键词
ISSUES;
D O I
10.1186/1471-2288-7-34
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Many studies conducted in health and social sciences collect individual level data as outcome measures. Usually, such data have a hierarchical structure, with patients clustered within physicians, and physicians clustered within practices. Large survey data, including national surveys, have a hierarchical or clustered structure; respondents are naturally clustered in geographical units ( e. g., health regions) and may be grouped into smaller units. Outcomes of interest in many fields not only reflect continuous measures, but also binary outcomes such as depression, presence or absence of a disease, and self-reported general health. In the framework of multilevel studies an important problem is calculating an adequate sample size that generates unbiased and accurate estimates. Methods: In this paper simulation studies are used to assess the effect of varying sample size at both the individual and group level on the accuracy of the estimates of the parameters and variance components of multilevel logistic regression models. In addition, the influence of prevalence of the outcome and the intra-class correlation coefficient (ICC) is examined. Results: The results show that the estimates of the fixed effect parameters are unbiased for 100 groups with group size of 50 or higher. The estimates of the variance covariance components are slightly biased even with 100 groups and group size of 50. The biases for both fixed and random effects are severe for group size of 5. The standard errors for fixed effect parameters are unbiased while for variance covariance components are underestimated. Results suggest that low prevalent events require larger sample sizes with at least a minimum of 100 groups and 50 individuals per group. Conclusion: We recommend using a minimum group size of 50 with at least 50 groups to produce valid estimates for multi-level logistic regression models. Group size should be adjusted under conditions where the prevalence of events is low such that the expected number of events in each group should be greater than one.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A simulation study of sample size for multilevel logistic regression models
    Rahim Moineddin
    Flora I Matheson
    Richard H Glazier
    BMC Medical Research Methodology, 7
  • [2] Sample size issues in multilevel logistic regression models
    Ali, Amjad
    Ali, Sabz
    Khan, Sajjad Ahmad
    Khan, Dost Muhammad
    Abbas, Kamran
    Khalil, Alamgir
    Manzoor, Sadaf
    Khalil, Umair
    PLOS ONE, 2019, 14 (11):
  • [3] Sufficient Sample Size and Power in Multilevel Ordinal Logistic Regression Models
    Ali, Sabz
    Ali, Amjad
    Khan, Sajjad Ahmad
    Hussain, Sundas
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
  • [4] Sample size determination for logistic regression: A simulation study
    School of Mathematical Sciences, University of Technology Sydney, Broadway, Australia
    Commun. Stat. Simul. Comput., 2 (360-373):
  • [5] Sample Size Determination for Logistic Regression: A Simulation Study
    Bush, Stephen
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2015, 44 (02) : 360 - 373
  • [6] Sample size calculations for logistic and Poisson regression models
    Shieh, G
    BIOMETRIKA, 2001, 88 (04) : 1193 - 1199
  • [7] Validation and updating of predictive logistic regression models: a study on sample size and shrinkage
    Steyerberg, EW
    Borsboom, GJJM
    van Houwelingen, HC
    Eijkemans, MJC
    Habbema, JDF
    STATISTICS IN MEDICINE, 2004, 23 (16) : 2567 - 2586
  • [8] Sample size determination in logistic regression
    Alam, M. Khorshed
    Rao, M. Bhaskara
    Cheng, Fu-Chih
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2010, 72 (01): : 58 - 75
  • [9] Sample size determination for logistic regression
    Motrenko, Anastasiya
    Strijov, Vadim
    Weber, Gerhard-Wilhelm
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2014, 255 : 743 - 752
  • [10] Sample size determination in logistic regression
    Khorshed Alam M.
    Bhaskara Rao M.
    Cheng F.-C.
    Sankhya B, 2010, 72 (1) : 58 - 75