Intermediate and advanced topics in multilevel logistic regression analysis

被引:439
作者
Austin, Peter C. [1 ,2 ,3 ]
Merlo, Juan [4 ,5 ]
机构
[1] Inst Clin Evaluat Sci, G106,2075 Bayview Ave, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Inst Hlth Management Policy & Evaluat, Toronto, ON, Canada
[3] Sunnybrook Res Inst, Schulich Heart Res Program, Toronto, ON, Canada
[4] Lund Univ, Unit Social Epidemiol, Fac Med, Malmo, Sweden
[5] Ctr Primary Hlth Care Res, Malmo, Region Skane, Sweden
基金
加拿大健康研究院; 瑞典研究理事会;
关键词
multilevel analysis; hierarchical models; logistic regression; multilevel models; clustered data; BRIEF CONCEPTUAL TUTORIAL; LINEAR MIXED MODELS; SOCIAL EPIDEMIOLOGY; CONTEXTUAL PHENOMENA; EXPLAINED VARIATION; INDIVIDUAL HEALTH; MORTALITY; VARIANCE; CLUSTER;
D O I
10.1002/sim.7336
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R-2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. (C) 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
引用
收藏
页码:3257 / 3277
页数:21
相关论文
共 52 条
  • [1] [Anonymous], 2011, MULTILEVEL STAT MODE
  • [2] [Anonymous], 1993, STAT DISTRIBUTIONS
  • [3] [Anonymous], 2002, HIERARCHICAL LINEAR
  • [4] Austin PC., 2017, INT STAT REV
  • [5] The median hazard ratio: a useful measure of variance and general contextual effects in multilevel survival analysis
    Austin, Peter C.
    Wagner, Philippe
    Merlo, Juan
    [J]. STATISTICS IN MEDICINE, 2017, 36 (06) : 928 - 938
  • [6] Hospital volume and surgical mortality in the United States.
    Birkmeyer, JD
    Siewers, AE
    Finlayson, EVA
    Stukel, TA
    Lucas, FL
    Batista, I
    Welch, HG
    Wennberg, DE
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2002, 346 (15) : 1128 - 1137
  • [7] Variance partitioning in multilevel logistic models that exhibit overdispersion
    Browne, WJ
    Subramanian, SV
    Jones, K
    Goldstein, H
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2005, 168 : 599 - 613
  • [8] A Monte Carlo EM algorithm for generalized linear mixed models with flexible random effects distribution
    Chen, JL
    Zhang, DW
    Davidian, M
    [J]. BIOSTATISTICS, 2002, 3 (03) : 347 - 360
  • [9] DIEHR P, 1990, HEALTH SERV RES, V24, P741
  • [10] Donner A., 2010, Design and analysis of cluster randomization trials in health research