Two-part regression models for longitudinal zero-inflated count data

被引:47
作者
Alfo, Marco [1 ]
Maruotti, Antonello [2 ]
机构
[1] Univ Roma La Sapienza, Dipartimento Stat Probabilita & Stat Applicate, I-00199 Rome, Italy
[2] Univ Roma Tre, Dipartimento Ist Pubbl Econ & Soc, I-00145 Rome, Italy
来源
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE | 2010年 / 38卷 / 02期
关键词
Hidden Markov model; Hurdle model; longitudinal data; random effects model; zero inflation; MAXIMUM-LIKELIHOOD-ESTIMATION; POISSON REGRESSION; HEALTH-CARE; MIXTURE LIKELIHOODS; EFFICIENCY; GEOMETRY; NUMBER;
D O I
10.1002/cjs.10056
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-part models are quite well established in the economic literature, since they resemble accurately a principal-agent type model, where homogeneous, observable, counted outcomes are subject to a (prior, exogenous) selection choice. The first decision can be represented by a binary choice model, modeled using a probit or a logit link; the second can be analyzed through a truncated discrete distribution such as a truncated Poisson, negative binomial, and so on. Only recently, a particular attention has been devoted to the extension of two-part models to handle longitudinal data. The authors discuss a semi-parametric estimation method for dynamic two-part models and propose a comparison with other, well-established alternatives. Heterogeneity sources that influence the first level decision process, that is, the decision to use a certain service, are assumed to influence also the (truncated) distribution of the positive outcomes. Estimation is carried out through an EM algorithm without parametric assumptions on the random effects distribution. Furthermore, the authors investigate the extension of the finite mixture representation to allow for unobservable transition between components in each of these parts. The proposed models are discussed using empirical as well as simulated data. The Canadian Journal of Statistics 38: 197-216; 2010 (C) 2010 Statistical Society of Canada
引用
收藏
页码:197 / 216
页数:20
相关论文
共 39 条
[1]  
[Anonymous], 2000, Sankhya Ser. A, DOI DOI 10.2307/25051289
[2]  
[Anonymous], 1997, Hidden Markov Models and Other Models for Discrete-Valued Time Series
[3]   The EM algorithm with gradient function update for discrete mixtures with known (fixed) number of components [J].
Böhning, D .
STATISTICS AND COMPUTING, 2003, 13 (03) :257-265
[4]   Nonparametric maximum likelihood estimation of population size based on the counting distribution [J].
Böhning, D ;
Schön, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :721-737
[5]   Equivalence of truncated count mixture distributions and mixtures of truncated count distributions [J].
Bohning, Dankmar ;
Kuhnert, Ronny .
BIOMETRICS, 2006, 62 (04) :1207-1215
[6]  
CAMERON C, 2005, MICROECONOMETICS
[7]  
CAPPE O, 2005, SPR S STAT, P1
[8]   Latent class models for utilisation of health care [J].
d'Uva, TB .
HEALTH ECONOMICS, 2006, 15 (04) :329-343
[9]  
Deb P, 2000, HEALTH ECON, V9, P475, DOI 10.1002/1099-1050(200009)9:6<475::AID-HEC544>3.0.CO
[10]  
2-H