Do Larger Sample Sizes Increase the Reliability of Traffic Incident Duration Models? A Case Study of East Tennessee Incidents

被引:14
作者
Zhang, Zihe [1 ]
Liu, Jun [1 ]
Li, Xiaobing [2 ]
Khattak, Asad J. [3 ]
机构
[1] Univ Alabama, Dept Civil Construct & Environm Engn, Tuscaloosa, AL 35487 USA
[2] Univ Alabama, Alabama Transportat Inst, Tuscaloosa, AL USA
[3] Univ Tennessee, Dept Civil & Environm Engn, Knoxville, TN USA
关键词
REGRESSION; NETWORK;
D O I
10.1177/0361198121992063
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Incident duration models are often developed to assist incident management and traveler information dissemination. With recent advances in data collection and management, enormous achieved incident data are now available for incident model development. However, a large volume of data may present challenges to practitioners, such as data processing and computation. Besides, data that span multiple years may have inconsistency issues because of the data collection environments and procedures. A practical question may arise in the incident modeling community-Is that much data really necessary ("all-in") to build models? If not, then how many data are necessary? To answer these questions, this study aims to investigate the relationship between the data sample sizes and the reliability of incident duration analysis models. This study proposed and demonstrated a sample size determination framework through a case study using data of over 47,000 incidents. This study estimated handfuls of hazard-based duration models with varying sample sizes. The relationships between sample size and model performance, along with estimate outcomes (i.e., coefficients and significance levels), were examined and visualized. The results showed that the variation of estimated coefficients decreases as the sample size increases, and becomes stabilized when the sample size reaches a critical threshold value. This critical threshold value may be the recommended sample size. The case study suggested a sample size of 6,500 to be enough for a reliable incident duration model. The critical value may vary significantly with different data and model specifications. More implications are discussed in the paper.
引用
收藏
页码:265 / 280
页数:16
相关论文
共 38 条
[1]   Impact of Abandoned and Disabled Vehicles on Freeway Incident Duration [J].
Chimba, Deo ;
Kutela, Boniphace ;
Ogletree, Gary ;
Horne, Frank ;
Tugwell, Mike .
JOURNAL OF TRANSPORTATION ENGINEERING, 2014, 140 (03)
[2]   Sample size considerations for the external validation of a multivariable prognostic model: a resampling study [J].
Collins, Gary S. ;
Ogundimu, Emmanuel O. ;
Altman, Douglas G. .
STATISTICS IN MEDICINE, 2016, 35 (02) :214-226
[3]   Sample size determination for logistic regression revisited [J].
Demidenko, Eugene .
STATISTICS IN MEDICINE, 2007, 26 (18) :3385-3397
[4]  
Dwivedi A.N., 2009, HDB RES INFORM TECHN
[5]  
Farradyne P.B., 2000, TRAFFIC INCIDENT MAN
[6]  
Figueiredo Filho Dalson Britto, 2013, Bras. political sci. rev., V7, P31
[7]   Estimating magnitude and duration of incident delays [J].
Garib, A ;
Radwan, AE ;
AlDeek, H .
JOURNAL OF TRANSPORTATION ENGINEERING-ASCE, 1997, 123 (06) :459-466
[9]   Modelling total duration of traffic incidents including incident detection and recovery time [J].
Hojati, Ahmad Tavassoli ;
Ferreira, Luis ;
Washington, Simon ;
Charles, Phil ;
Shobeirinejad, Ameneh .
ACCIDENT ANALYSIS AND PREVENTION, 2014, 71 :296-305
[10]   Hazard based models for freeway traffic incident duration [J].
Hojati, Ahmad Tavassoli ;
Ferreira, Luis ;
Washington, Simon ;
Charles, Phil .
ACCIDENT ANALYSIS AND PREVENTION, 2013, 52 :171-181