Functional linear models for zero-inflated count data with application to modeling hospitalizations in patients on dialysis

被引:4
作者
Sentuerk, Damla [1 ]
Dalrymple, Lorien S. [2 ]
Nguyen, Danh V. [3 ,4 ]
机构
[1] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[2] Univ Calif Davis, Dept Med, Div Nephrol, Davis, CA 95616 USA
[3] Univ Calif Irvine, Dept Med, Irvine, CA 92717 USA
[4] Univ Calif Irvine, Inst Clin & Translat Sci, Irvine, CA USA
关键词
functional data analysis; end-stage renal disease; hurdle model; sparse longitudinal design; United States Renal Data System; zero-inflated Poisson model; LONGITUDINAL DATA; REGRESSION; POISSON;
D O I
10.1002/sim.6241
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose functional linear models for zero-inflated count data with a focus on the functional hurdle and functional zero-inflated Poisson (ZIP) models. Although the hurdle model assumes the counts come from a mixture of a degenerate distribution at zero and a zero-truncated Poisson distribution, the ZIP model considers a mixture of a degenerate distribution at zero and a standard Poisson distribution. We extend the generalized functional linear model framework with a functional predictor and multiple cross-sectional predictors to model counts generated by a mixture distribution. We propose an estimation procedure for functional hurdle and ZIP models, called penalized reconstruction, geared towards error-prone and sparsely observed longitudinal functional predictors. The approach relies on dimension reduction and pooling of information across subjects involving basis expansions and penalized maximum likelihood techniques. The developed functional hurdle model is applied to modeling hospitalizations within the first 2years from initiation of dialysis, with a high percentage of zeros, in the Comprehensive Dialysis Study participants. Hospitalization counts are modeled as a function of sparse longitudinal measurements of serum albumin concentrations, patient demographics, and comorbidities. Simulation studies are used to study finite sample properties of the proposed method and include comparisons with an adaptation of standard principal components regression. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:4825 / 4840
页数:16
相关论文
共 39 条
  • [1] [Anonymous], 2012, USRDS 2012 Annual Data Report: Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States
  • [2] Buus A, 2010, STAT MED, V30, P2326
  • [3] Estimation in generalized linear models for functional data via penalized likelihood
    Cardot, H
    Sarda, P
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 92 (01) : 24 - 41
  • [4] Semiparametric zero-inflated Poisson models with application to animal abundance studies
    Chiogna, Monica
    Gaetan, Carlo
    [J]. ENVIRONMETRICS, 2007, 18 (03) : 303 - 314
  • [5] Generalized Multilevel Functional Regression
    Crainiceanu, Ciprian M.
    Staicu, Ana-Maria
    Di, Chong-Zhi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (488) : 1550 - 1561
  • [6] Estess JP, 2014, BIOMETRICS IN PRESS
  • [7] Corrected Confidence Bands for Functional Data Using Principal Components
    Goldsmith, J.
    Greven, S.
    Crainiceanu, C.
    [J]. BIOMETRICS, 2013, 69 (01) : 41 - 51
  • [8] Penalized Functional Regression
    Goldsmith, Jeff
    Bobb, Jennifer
    Crainiceanu, Ciprian M.
    Caffo, Brian
    Reich, Daniel
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) : 830 - 851
  • [9] Zero-inflated Poisson and binomial regression with random effects: A case study
    Hall, DB
    [J]. BIOMETRICS, 2000, 56 (04) : 1030 - 1039
  • [10] ZERO-ALTERED AND OTHER REGRESSION-MODELS FOR COUNT DATA WITH ADDED ZEROS
    HEILBRON, DC
    [J]. BIOMETRICAL JOURNAL, 1994, 36 (05) : 531 - 547