Crash Severity Analysis of Highways Based on Multinomial Logistic Regression Model, Decision Tree Techniques, and Artificial Neural Network: A Modeling Comparison

被引:26
|
作者
Shiran, Gholamreza [1 ]
Imaninasab, Reza [2 ]
Khayamim, Razieh [3 ]
机构
[1] Univ Isfahan, Fac Civil Engn & Transportat, Esfahan 8174673441, Iran
[2] Purdue Univ, Lyles Sch Civil Engn, W Lafayette, IN 47907 USA
[3] Isfahan Univ Technol, Dept Transportat Engn, Esfahan 8415683111, Iran
关键词
crash severity; multinomial logistic regression model; decision tree techniques; artificial neural network; MACHINE LEARNING-METHODS; DRIVER INJURY SEVERITY; CLASSIFICATION TREE; ACCIDENT; PREDICTION;
D O I
10.3390/su13105670
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The classification of vehicular crashes based on their severity is crucial since not all of them have the same financial and injury values. In addition, avoiding crashes by identifying their influential factors is possible via accurate prediction modeling. In crash severity analysis, accurate and time-saving prediction models are necessary for classifying crashes based on their severity. Moreover, statistical models are incapable of identifying the potential severity of crashes regarding influencing factors incorporated in models. Unlike previous research efforts, which focused on the limited class of crash severity, including property damage only (PDO), fatality, and injury by applying data mining models, the present study sought to predict crash frequency according to five severity levels of PDO, fatality, severe injury, other visible injuries, and complaint of pain. The multinomial logistic regression (MLR) model and data mining approaches, including artificial neural network-multilayer perceptron (ANN-MLP) and two decision tree techniques, (i.e., Chi-square automatic interaction detector (CHAID) and C5.0) are utilized based on traffic crash records for State Highways in California, USA. The comparison of the findings of the relative importance of ten qualitative and ten quantitative independent variables incorporated in CHAID and C5.0 indicated that the cause of the crash (X-1) and the number of vehicles (X-5) were known as the most influential variables involved in the crash. However, the cause of the crash (X-1) and weather (X-2) were identified as the most contributing variables by the ANN-MLP model. In addition, the MLR model showed that the driver's age (X-11) accounts for a larger proportion of traffic crash severity. Therefore, the sensitivity analysis demonstrated that C5.0 had the best performance for predicting road crash severity. Not only did C5.0 take a shorter time (0.05 s) compared to CHAID, MLP, and MLR, it also represented the highest accuracy rate for the training set. The overall prediction accuracy based on the training data was approximately 88.09% compared to 77.21% and 70.21% for CHAID and MLP models. In general, the findings of this study revealed that C5.0 can be a promising tool for predicting road crash severity.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Crash severity along rural mountainous highways in Malaysia: An application of a combined decision tree and logistic regression model
    Rusli, Rusdi
    Haque, Md. Mazharul
    Saifuzzaman, Mohammad
    King, Mark
    TRAFFIC INJURY PREVENTION, 2018, 19 (07) : 741 - 748
  • [2] Analysis and prediction of β-turn types using multinomial logistic regression and artificial neural network
    Mehdi, Poursheikhali Asgary
    Parviz, Abdolmaleki
    Anoshirvan, Kazemnejad
    Samad, Jahandidehs
    BIOINFORMATICS, 2019, 35 (12) : E8 - E15
  • [3] CLASSIFICATION OF PSYCHIATRIC DISORDERS USING MULTINOMIAL LOGISTIC REGRESSION VERSUS ARTIFICIAL NEURAL NETWORK
    Martin Perez, Elena
    Caldero Alonso, Amaya
    Martin Martin, Quintin
    JP JOURNAL OF BIOSTATISTICS, 2021, 18 (03) : 395 - 408
  • [4] SHAP-based convolutional neural network modeling for intersection crash severity on Thailand's highways
    Sunkpho, Jirapon
    Se, Chamroeun
    Wipulanusat, Warit
    Ratanavaraha, Vatanavongs
    IATSS RESEARCH, 2025, 49 (01) : 27 - 41
  • [5] Small business credit scoring: A comparison of logistic regression, neural network, and decision tree models
    Zekic-Susac, M
    Sarlija, N
    Bensic, M
    ITI 2004: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2004, : 265 - 270
  • [6] Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network
    Amini, Payam
    Ahmadinia, Hasan
    Poorolajal, Jalal
    Moqaddasi Amiri, Mohammad
    IRANIAN JOURNAL OF PUBLIC HEALTH, 2016, 45 (09) : 1179 - 1187
  • [7] Vehicle collisions analysis on highways based on multi-user driving simulator and multinomial logistic regression model on US highways in Michigan
    Almadi, Abdulla I. M.
    Al Mamlook, Rabia Emhamed
    Ullah, Irfan
    Alshboul, Odey
    Bandara, Nishantha
    Shehadeh, Ali
    INTERNATIONAL JOURNAL OF CRASHWORTHINESS, 2023, 28 (06) : 770 - 785
  • [8] Comparison of Prediction Model for Cardiovascular Autonomic Dysfunction Using Artificial Neural Network and Logistic Regression Analysis
    Tang, Zi-Hui
    Liu, Juanmei
    Zeng, Fangfang
    Li, Zhongtao
    Yu, Xiaoling
    Zhou, Linuo
    PLOS ONE, 2013, 8 (08):
  • [9] RETRACTED: Analysis and identification of β-turn types using multinomial logistic regression and artificial neural network (Retracted Article)
    Asgary, Mehdi Poursheikhali
    Jahandideh, Samad
    Abdolmaleki, Parviz
    Kazemnejad, Anoshirvan
    BIOINFORMATICS, 2007, 23 (23) : 3125 - 3130
  • [10] Comparison of artificial neural network and logistic regression model for factors affecting birth weight
    Murat Kirişci
    SN Applied Sciences, 2019, 1