Markov regression models for count time series with excess zeros: A partial likelihood approach

被引:19
作者
Yang, Ming [1 ]
Zamba, Gideon K. D. [2 ]
Cavanaugh, Joseph E. [2 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Ctr Biostat AIDS Res, Cambridge, MA 02138 USA
[2] Univ Iowa, Dept Biostat, Coll Publ Hlth, Iowa City, IA 52242 USA
关键词
Autoregressive; EM algorithm; Poisson distribution; Zero-inflation; INFLATED POISSON REGRESSION; SCORE TEST;
D O I
10.1016/j.stamet.2013.02.001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Count data with excess zeros are common in many biomedical and public health applications. The zero-inflated Poisson (ZIP) regression model has been widely used in practice to analyze such data. In this paper, we extend the classical ZIP regression framework to model count time series with excess zeros. A Markov regression model is presented and developed, and the partial likelihood is employed for statistical inference. Partial likelihood inference has been successfully applied in modeling time series where the conditional distribution of the response lies within the exponential family. Extending this approach to ZIP time series poses methodological and theoretical challenges, since the ZIP distribution is a mixture and therefore lies outside the exponential family. In the partial likelihood framework, we develop an EM algorithm to compute the maximum partial likelihood estimator (MPLE). We establish the asymptotic theory of the MPLE under mild regularity conditions and investigate its finite sample behavior in a simulation study. The performances of different partial-likelihood based model selection criteria are compared in the presence of model misspecification. Finally, we present an epidemiological application to illustrate the proposed methodology. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:26 / 38
页数:13
相关论文
共 34 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 2012, Overdispersion Models in SAS
[3]   Unifying the derivations for the Akaike and corrected Akaike information criteria [J].
Cavanaugh, JE .
STATISTICS & PROBABILITY LETTERS, 1997, 33 (02) :201-208
[4]   MONTE-CARLO EM ESTIMATION FOR TIME-SERIES MODELS INVOLVING COUNTS [J].
CHAN, KS ;
LEDOLTER, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (429) :242-252
[5]  
COX DR, 1981, SCAND J STAT, V8, P93
[6]   PARTIAL LIKELIHOOD [J].
COX, DR .
BIOMETRIKA, 1975, 62 (02) :269-276
[7]   Observation-driven models for Poisson counts [J].
Davis, RA ;
Dunsmuir, WTM ;
Streett, SB .
BIOMETRIKA, 2003, 90 (04) :777-790
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]  
Diggle P., 2002, ANAL LONGITUDINAL DA
[10]   Time series analysis of non-Gaussian observations based on state space models from both classical and Bayesian perspectives [J].
Durbin, J ;
Koopman, SJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2000, 62 :3-29