Modeling Road Accident Severity with Comparisons of Logistic Regression, Decision Tree and Random Forest

被引:55
作者
Chen, Mu-Ming [1 ]
Chen, Mu-Chen [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Transportat & Logist Management, Hsinchu 30010, Taiwan
关键词
transportation; road accident severity; logistic regression; decision tree; random forest; CLASSIFICATION TREES; INJURY SEVERITY; VEHICLE; CRASHES; SPECIFICITY; SENSITIVITY; PREDICTION; TESTS;
D O I
10.3390/info11050270
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To reduce the damage caused by road accidents, researchers have applied different techniques to explore correlated factors and develop efficient prediction models. The main purpose of this study is to use one statistical and two nonparametric data mining techniques, namely, logistic regression (LR), classification and regression tree (CART), and random forest (RF), to compare their prediction capability, identify the significant variables (identified by LR) and important variables (identified by CART or RF) that are strongly correlated with road accident severity, and distinguish the variables that have significant positive influence on prediction performance. In this study, three prediction performance evaluation measures, accuracy, sensitivity and specificity, are used to find the best integrated method which consists of the most effective prediction model and the input variables that have higher positive influence on accuracy, sensitivity and specificity.
引用
收藏
页数:23
相关论文
共 38 条
[1]   Analysis of types of crashes at signalized intersections by using complete crash data and tree-based regression [J].
Abdel-Aty, M ;
Keller, J ;
Brady, PA .
STATISTICAL METHODS; HIGHWAY SAFETY DATA, ANALYSIS, AND EVALUATION; OCCUPANT PROTECTION; SYSTEMATIC REVIEWS AND META-ANALYSIS, 2005, (1908) :37-45
[2]   Assessing Safety on Dutch Freeways with Data from Infrastructure-Based Intelligent Transportation Systems [J].
Abdel-Aty, Mohamed ;
Pande, Anurag ;
Das, Abhishek ;
Knibbe, Willem Jan .
TRANSPORTATION RESEARCH RECORD, 2008, (2083) :153-161
[4]   Using logistic regression to estimate the influence of accident factors on accident severity [J].
Al-Ghamdi, AS .
ACCIDENT ANALYSIS AND PREVENTION, 2002, 34 (06) :729-741
[5]   On the prediction of geoeffectiveness of CMEs during the ascending phase of SC24 using a logistic regression method [J].
Besliu-Ionescu, D. ;
Talpeanu, D-C ;
Mierla, M. ;
Muntean, G. Maris .
JOURNAL OF ATMOSPHERIC AND SOLAR-TERRESTRIAL PHYSICS, 2019, 193
[6]  
Breiman L., 2001, RANDOM FORESTS, V45, P5, DOI DOI 10.1023/A:1010933404324
[7]  
Breiman L, 1984, CLASSIFICATION REGRE
[8]   Analysis of traffic injury severity: An application of non-parametric classification tree techniques [J].
Chang, Li-Yen ;
Wang, Hsiu-Wen .
ACCIDENT ANALYSIS AND PREVENTION, 2006, 38 (05) :1019-1027
[9]   Analysis of driver injury severity in truck-involved accidents using a non-parametric classification tree model [J].
Chang, Li-Yen ;
Chien, Jui-Tseng .
SAFETY SCIENCE, 2013, 51 (01) :17-22
[10]   Data mining of tree-based models to analyze freeway accident frequency [J].
Chang, LY ;
Chen, WC .
JOURNAL OF SAFETY RESEARCH, 2005, 36 (04) :365-375