Predicting child occupant crash injury severity in the United Arab Emirates using machine learning models for imbalanced dataset

被引:3
作者
Abdulazeez, Muhammad Uba [1 ,2 ,3 ]
Khan, Wasif [4 ,5 ]
Abdullah, Kassim Abdulrahman [1 ,2 ,6 ]
机构
[1] United Arab Emirates Univ, Coll Engn, Dept Mech & Aerosp Engn, POB 15551, Al Ain, Abu Dhabi, U Arab Emirates
[2] United Arab Emirates Univ, Emirates Ctr Mobil Res, POB 15551, Al Ain, Abu Dhabi, U Arab Emirates
[3] Abubakar Tafawa Balewa Univ, Fac Engn & Engn Technol, Dept Automot Engn, PMB 0248, Bauchi, Nigeria
[4] United Arab Emirates Univ, Coll Informat Technol, Dept Comp Sci & Software Engn, POB 15551, Al Ain, Abu Dhabi, U Arab Emirates
[5] United Arab Emirates Univ, Big Data Analyt Ctr, POB 15551, Al Ain, Abu Dhabi, U Arab Emirates
[6] United Arab Emirates Univ, Sheikh Khalifa Bin Zayed St, Al Ain 15551, Abu Dhabi, U Arab Emirates
关键词
Crash injury severity; Child occupant; Machine learning; Data balancing; Feature selection; Injury severity prediction; SINGLE-VEHICLE; LOGISTIC-REGRESSION; MULTINOMIAL LOGIT; RISK-FACTORS; CLASSIFICATION; IMPACT; NETWORKS; SAFETY; SMOTE;
D O I
10.1016/j.iatssr.2023.05.003
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Road traffic crashes have increased over the years leading to greater injury severity among children who are mostly vehicle occupants in high-income countries. This adversely affects the healthy development of children and might lead to death. However, studies in the literature have focused on predicting crash injuries among adults while children have different crash injury risks as well as crash kinematics compared to adults. To address this gap, this paper presents a new dataset for child occupant crash injury severity prediction collected over 8 years (2012 to 2019) in the United Arab Emirates (UAE). The performance of state-of-the-art machine learning algorithms was then evaluated using the proposed dataset. In addition, feature selection techniques and logistic regression model were employed to extract the most significant features for crash injury severity prediction among child occupants. Furthermore, the impact of data balancing approaches on the prediction performance was analyzed as the dataset is highly imbalanced. The experimental results showed that Adaboost, Bagging REP, ZeroR, OneR, and Decision Table algorithms predicts child occupant injury severity with the highest accuracy. Child occupant seating position, emirate, crash location, crash type and crash cause were observed as significant features that predicts injury severity by both the feature selection and logistic regression models. & COPY; 2023 International Association of Traffic and Safety Sciences. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:134 / 159
页数:26
相关论文
共 112 条
  • [41] Pediatric and Youth Traffic-Collision Injuries in Al Ain, United Arab Emirates: A Prospective Study
    Grivna, Michal
    Eid, Hani O.
    Abu-Zidan, Fikri M.
    [J]. PLOS ONE, 2013, 8 (07):
  • [42] Child and Youth Traffic-Related Injuries: Use of a Trauma Registry to Identify Priorities for Prevention in the United Arab Emirates
    Grivna, Michal
    Barss, Peter
    Stanculescu, Cristina
    Eid, Hani O.
    Abu-Zidan, Fikri M.
    [J]. TRAFFIC INJURY PREVENTION, 2013, 14 (03) : 274 - 282
  • [43] Gupta B., 2017, International Journal of Computer Applications, V163, P15, DOI [10.5120/ijca2017913660, DOI 10.5120/IJCA2017913660]
  • [44] John GH, 2013, Arxiv, DOI arXiv:1302.4964
  • [45] Hall M., 2009, SIGKDD EXPLORATIONS, V11, P10, DOI [DOI 10.1145/1656274.1656278, 10.1145/1656274.1656278]
  • [46] Hall M. A., 1999, Proceedings of the Twelfth International Florida AI Research Society Conference, P235
  • [47] Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning
    Han, H
    Wang, WY
    Mao, BH
    [J]. ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 878 - 887
  • [48] ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning
    He, Haibo
    Bai, Yang
    Garcia, Edwardo A.
    Li, Shutao
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1322 - 1328
  • [49] Support vector machines
    Hearst, MA
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04): : 18 - 21
  • [50] VERY SIMPLE CLASSIFICATION RULES PERFORM WELL ON MOST COMMONLY USED DATASETS
    HOLTE, RC
    [J]. MACHINE LEARNING, 1993, 11 (01) : 63 - 91