Identification of the best machine learning model for the prediction of driver injury severity

被引:1
作者
Sorum, Neero Gumsar [1 ]
Pal, Dibyendu [1 ]
机构
[1] North Eastern Reg Inst Sci & Technol, Dept Civil Engn, Nirjuli 791109, Arunachal Prade, India
关键词
Road traffic accidents; driver injury severity; machine learning; light GBM; DIS; Dataiku; SUPPORT VECTOR MACHINE; LOGISTIC-REGRESSION; TRAFFIC CRASHES; RISK-FACTORS;
D O I
10.1080/17457300.2024.2335478
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Predicting the injury severities sustained by drivers engaged in road traffic accidents is a key topic of research in road traffic safety. The current study analyzed the driver injury severity (DIS) using twelve machine learning (ML) algorithms. These models were implemented using 0.70, 0.80, and 0.90 train ratios and 5-, 10- and 15-fold cross-validation. Ten years of accident data (from 2011 to 2020) was obtained from police department of Shillong, India. A total of 693 accidents were documented, with 68% being nonfatal and 32% being fatal. Precision, recall, accuracy, F1 score and area under the curve measures were used to compare the performance of all twelve ML models. Overall, the light gradient-boosting machine model was shown to be the best ML model for predicting the injury severities of drivers engaged in road traffic incidents. Finally, variable importance analysis results showed that cause of accident, collision type and types of vehicles were the most influencing factors in nonfatal and fatal driver accidents. The results also revealed that age and gender were slightly associated with DIS. The findings of the current research could be helpful to road safety agencies for the implementation of suitable countermeasures to increase driver safety in road accidents.
引用
收藏
页码:360 / 375
页数:16
相关论文
共 82 条
  • [31] Dataiku, 2014, PRONOUNCING DATAIKU
  • [32] Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks
    Delen, D
    Sharda, R
    Bessonov, M
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2006, 38 (03) : 434 - 444
  • [33] Driver crash risk factors and prevalence evaluation using naturalistic driving data
    Dingus, Thomas A.
    Guo, Feng
    Lee, Suzie
    Antin, Jonathan F.
    Perez, Miguel
    Buchanan-King, Mindy
    Hankey, Jonathan
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (10) : 2636 - 2641
  • [34] An introduction to ROC analysis
    Fawcett, Tom
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (08) : 861 - 874
  • [35] Gan XF, 2020, CICTP 2020: ADVANCED TRANSPORTATION TECHNOLOGIES AND DEVELOPMENT-ENHANCING CONNECTIONS, P4195
  • [36] Gopinath V., 2017, Int. J. Emerg. Trends Technol. Comput. Sci, V6, P15
  • [37] Gururaj H. L., 2022, INT J TRANSPORT DEV, V6, P363, DOI [https://doi.org/10.2495/TDI-V6-N4-363-377, DOI 10.2495/TDI-V6-N4-363-377]
  • [38] Machine Learning: An Approach in Identifying Risk Factors for Coercion Compared to Binary Logistic Regression
    Hotzy, Florian
    Theodoridou, Anastasia
    Hoff, Paul
    Schneeberger, Andres R.
    Seifritz, Erich
    Olbrich, Sebastian
    Jaeger, Matthias
    [J]. FRONTIERS IN PSYCHIATRY, 2018, 9
  • [39] A comparative study of machine learning classifiers for injury severity prediction of crashes involving three-wheeled motorized rickshaw
    Ijaz, Muhammad
    Lan, Liu
    Zahid, Muhammad
    Jamal, Arshad
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2021, 154
  • [40] Comparison of four statistical and machine learning methods for crash severity prediction
    Iranitalab, Amirfarrokh
    Khattak, Aemal
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2017, 108 : 27 - 36