Functional linear models for zero-inflated count data with application to modeling hospitalizations in patients on dialysis
被引:4
作者:
Sentuerk, Damla
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
Sentuerk, Damla
[1
]
Dalrymple, Lorien S.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Davis, Dept Med, Div Nephrol, Davis, CA 95616 USAUniv Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
Dalrymple, Lorien S.
[2
]
Nguyen, Danh V.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Irvine, Dept Med, Irvine, CA 92717 USA
Univ Calif Irvine, Inst Clin & Translat Sci, Irvine, CA USAUniv Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
Nguyen, Danh V.
[3
,4
]
机构:
[1] Univ Calif Los Angeles, Dept Biostat, Los Angeles, CA 90095 USA
[2] Univ Calif Davis, Dept Med, Div Nephrol, Davis, CA 95616 USA
[3] Univ Calif Irvine, Dept Med, Irvine, CA 92717 USA
[4] Univ Calif Irvine, Inst Clin & Translat Sci, Irvine, CA USA
functional data analysis;
end-stage renal disease;
hurdle model;
sparse longitudinal design;
United States Renal Data System;
zero-inflated Poisson model;
LONGITUDINAL DATA;
REGRESSION;
POISSON;
D O I:
10.1002/sim.6241
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
We propose functional linear models for zero-inflated count data with a focus on the functional hurdle and functional zero-inflated Poisson (ZIP) models. Although the hurdle model assumes the counts come from a mixture of a degenerate distribution at zero and a zero-truncated Poisson distribution, the ZIP model considers a mixture of a degenerate distribution at zero and a standard Poisson distribution. We extend the generalized functional linear model framework with a functional predictor and multiple cross-sectional predictors to model counts generated by a mixture distribution. We propose an estimation procedure for functional hurdle and ZIP models, called penalized reconstruction, geared towards error-prone and sparsely observed longitudinal functional predictors. The approach relies on dimension reduction and pooling of information across subjects involving basis expansions and penalized maximum likelihood techniques. The developed functional hurdle model is applied to modeling hospitalizations within the first 2years from initiation of dialysis, with a high percentage of zeros, in the Comprehensive Dialysis Study participants. Hospitalization counts are modeled as a function of sparse longitudinal measurements of serum albumin concentrations, patient demographics, and comorbidities. Simulation studies are used to study finite sample properties of the proposed method and include comparisons with an adaptation of standard principal components regression. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:4825 / 4840
页数:16
相关论文
共 39 条
[1]
[Anonymous], 2012, USRDS 2012 Annual Data Report: Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States