Statistical models for longitudinal zero-inflated count data with applications to the substance abuse field

被引:61
作者
Buu, Anne [1 ]
Li, Runze [2 ,3 ]
Tan, Xianming [3 ]
Zucker, Robert A. [1 ]
机构
[1] Univ Michigan, Dept Psychiat, Ann Arbor, MI 48109 USA
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[3] Penn State Univ, Methodol Ctr, University Pk, PA 16802 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
hurdle model; zero-inflated Poisson model; random effect; regression spline; HURDLE MODELS; REGRESSION; TRAJECTORIES; DEPENDENCE; OUTCOMES; POISSON;
D O I
10.1002/sim.5510
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study fills in the current knowledge gaps in statistical analysis of longitudinal zero-inflated count data by providing a comprehensive review and comparison of the hurdle and zero-inflated Poisson models in terms of the conceptual framework, computational advantage, and performance under different real data situations. The design of simulations represents the special features of a well-known longitudinal study of alcoholism so that the results can be generalizable to the substance abuse field. When the hurdle model is more natural under the conceptual framework of the data, the zero-inflated Poisson model tends to produce inaccurate estimates. Model performance improves with larger sample sizes, lower proportions of missing data, and lower correlations between covariates. The simulation also shows that the computational strength of the hurdle model disappears when random effects are included. Copyright (C) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:4074 / 4086
页数:13
相关论文
共 20 条
[1]   Two-part regression models for longitudinal zero-inflated count data [J].
Alfo, Marco ;
Maruotti, Antonello .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (02) :197-216
[2]  
[Anonymous], SAS 9.2 Highlights and Resource Center
[3]  
[Anonymous], 1994, DIAGN STAT MAN
[4]   Mixture model framework facilitates understanding of zero-inflated and hurdle models for count data [J].
Baughman, A. L. .
JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (05) :943-946
[5]   The design of simulation studies in medical statistics [J].
Burton, Andrea ;
Altman, Douglas G. ;
Royston, Patrick ;
Holder, Roger L. .
STATISTICS IN MEDICINE, 2006, 25 (24) :4279-4292
[6]   New variable selection methods for zero-inflated count data with applications to the substance abuse field [J].
Buu, Anne ;
Johnson, Norman J. ;
Li, Runze ;
Tan, Xianming .
STATISTICS IN MEDICINE, 2011, 30 (18) :2326-2340
[7]  
Fan J., 1996, Local Polynomial Modeling and Its Applications
[8]  
Friedman J., 2001, ELEMENTS STAT LEARNI, V1, DOI [DOI 10.1007/978-0-387-21606-514, 10.1007/b94608]
[9]   Childhood and adolescent predictors of alcohol abuse and dependence in young adulthood [J].
Guo, J ;
Hawkins, JD ;
Hill, KG ;
Abbott, RD .
JOURNAL OF STUDIES ON ALCOHOL, 2001, 62 (06) :754-762
[10]   Zero-inflated Poisson and binomial regression with random effects: A case study [J].
Hall, DB .
BIOMETRICS, 2000, 56 (04) :1030-1039