共 42 条
Developing a Random Parameters Negative Binomial-Lindley Model to analyze highly over-dispersed crash count data
被引:54
作者:
Shaon, Mohammad Razaur Rahman
[1
]
Qin, Xiao
[1
]
Shirazi, Mohammadali
[2
]
Lord, Dominique
[2
]
Geedipally, Srinivas Reddy
[3
]
机构:
[1] Univ Wisconsin Milwaukee, Dept Civil & Environm Engn, POB 784, Milwaukee, WI 53201 USA
[2] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX 77843 USA
[3] Texas A&M Transportat Inst, 110 N Davis Dr, Arlington, TX 76013 USA
关键词:
Excess zero observations;
Over-dispersion;
Unobserved heterogeneity;
Mixed model;
Random parameters model;
Negative binomial-Lindley;
GENERALIZED LINEAR-MODEL;
MOTOR-VEHICLE CRASHES;
INJURY SEVERITIES;
UNOBSERVED HETEROGENEITY;
STATISTICAL-ANALYSIS;
MIXED MODELS;
POISSON;
FREQUENCY;
VARIANCES;
SAFETY;
D O I:
10.1016/j.amar.2018.04.002
中图分类号:
R1 [预防医学、卫生学];
学科分类号:
1004 ;
120402 ;
摘要:
The existence of preponderant zero crash sites and/or sites with large crash counts can present challenges during the statistical analysis of crash count data. Additionally, unobserved heterogeneity in crash data due to the absence of important variables could negatively impact the estimated model parameters. The traditional negative binomial (NB) model with fixed parameters might not adequately handle highly over-dispersed data or unobserved heterogeneity. Many research efforts that have involved the negative binomial-Lindley (NB-L) model or the random parameters negative binomial (RPNB) model, for example, have attempted to improve the inference of estimated coefficients by explicitly accounting for extra variation in crash data. The NB-L is a mixed modeling approach which provides flexibility to account for additional dispersion in data. The RP modeling approach accommodates the effect of unobserved variables by allowing the model parameters to vary from one observation to another. The following study proposes a combination of these models - the random parameters NB-L (RPNB-L) generalized linear model (GLM) - to account for underlying heterogeneity and address excess over-dispersion. The results show that the RPNB-L model not only provides a superior goodness-of-fit (GOF) with the sample data, but also offers a better understanding about the effects of potential contributing factors. The paper uses the Bayesian framework to provide a strategy for eliminating the potential for poor mixing in the Markov Chain Monte Carlo (MCMC) chains during the estimation of the RPNB-L model. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:33 / 44
页数:12
相关论文