l2-Penalized temporal logit-mixed models for the estimation of regional obesity prevalence over time

被引:3
作者
Burgard, Jan P. [1 ]
Krause, Joscha [1 ]
Muennich, Ralf [1 ]
Morales, Domingo [2 ]
机构
[1] Trier Univ, Dept Econ & Social Stat, Trier, Germany
[2] Univ Miguel Hernandez de Elche, Ctr Operat Res, Elche, Spain
关键词
Approximate likelihood; empirical best prediction; generalized linear-mixed model; ridge regression; small area estimation; SMALL-AREA ESTIMATION; MEAN SQUARED ERROR; VARIABLE SELECTION; RIDGE-REGRESSION; POVERTY; REGULARIZATION; INDICATORS; PARAMETERS; ADULTS;
D O I
10.1177/09622802211017583
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Obesity is considered to be one of the primary health risks in modern industrialized societies. Estimating the evolution of its prevalence over time is an essential element of public health reporting. This requires the application of suitable statistical methods on epidemiologic data with substantial local detail. Generalized linear-mixed models with medical treatment records as covariates mark a powerful combination for this purpose. However, the task is methodologically challenging. Disease frequencies are subject to both regional and temporal heterogeneity. Medical treatment records often show strong internal correlation due to diagnosis-related grouping. This frequently causes excessive variance in model parameter estimation due to rank-deficiency problems. Further, generalized linear-mixed models are often estimated via approximate inference methods as their likelihood functions do not have closed forms. These problems combined lead to unacceptable uncertainty in prevalence estimates over time. We propose an l(2)-penalized temporal logit-mixed model to solve these issues. We derive empirical best predictors and present a parametric bootstrap to estimate their mean-squared errors. A novel penalized maximum approximate likelihood algorithm for model parameter estimation is stated. With this new methodology, the regional obesity prevalence in Germany from 2009 to 2012 is estimated. We find that the national prevalence ranges between 15 and 16%, with significant regional clustering in eastern Germany.
引用
收藏
页码:1744 / 1768
页数:25
相关论文
共 62 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   Mechanism linking diabetes mellitus and obesity [J].
Al-Goblan, Abdullah S. ;
Al-Alfi, Mohammed A. ;
Khan, Muhammad Z. .
DIABETES METABOLIC SYNDROME AND OBESITY-TARGETS AND THERAPY, 2014, 7 :587-591
[3]   Poisson mixed models for studying the poverty in small areas [J].
Boubeta, Miguel ;
Jose Lombardia, Maria ;
Morales, Domingo .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 107 :32-47
[4]   Empirical best prediction under area-level Poisson mixed models [J].
Boubeta, Miguel ;
Jose Lombardia, Maria ;
Morales, Domingo .
TEST, 2016, 25 (03) :548-569
[5]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[6]  
BURGARD J, STAT PAP
[7]   An elastic net penalized small area model combining unit- and area-level data for regional hypertension prevalence estimation [J].
Burgard, J. P. ;
Krause, J. ;
Muennich, R. .
JOURNAL OF APPLIED STATISTICS, 2021, 48 (09) :1659-1674
[8]   Semiparametric small area estimation for binary outcomes with application to unemployment estimation for local authorities in the UK [J].
Chambers, Ray ;
Salvati, Nicola ;
Tzavidis, Nikos .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2016, 179 (02) :453-479
[9]   Disease mapping via negative binomial regression M-quantiles [J].
Chambers, Ray ;
Dreassi, Emanuela ;
Salvati, Nicola .
STATISTICS IN MEDICINE, 2014, 33 (27) :4805-4824
[10]  
Chen Shijie, 2012, J Indian Soc Agric Stat, V66, P121