Feature Selection in Machine Learning Models for Road Accident Severity

被引:0
|
作者
Al-Turaiki, Isra [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2020年 / 20卷 / 03期
关键词
Machine learning; Road; Traffic; Accidents; Severity; Classification Models; Ensemble; CLASSIFICATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic accidents are a major cause of serious injuries and deaths around the world. Building predictive models from traffic data can give insights that help authorities improve road safety. Feature selection is an important step in building effective machine learning models. Feature selection methods are used to determine features that are relevant to classification task. The chosen feature selection method can affect the performance of machine learning models. In this paper, a real dataset of traffic accidents in Saudi Arabia is used to model accident severity. Classification models are built using single and ensemble classification algorithms. In addition, we evaluate the performance of developed models to which feature selection is applied. Two feature selection methods are used in this study: information gain, which is a filter-based feature selection method, and a genetic algorithm, which is a wrapper-based method. Experimental results show that better classification performance is obtained with genetic algorithm feature selection. In particular, ID3 and naive Bayes classifiers have improved results with genetic algorithm feature selection.
引用
收藏
页码:77 / 82
页数:6
相关论文
共 50 条
  • [21] The Impact of Feature Selection on Different Machine Learning Models for Breast Cancer Classification
    Algherairy, Atheer
    Almattar, Wadha
    Bakri, Eman
    Albelali, Salma
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 91 - 96
  • [22] Severity Invariant Feature Selection for Machine Health Monitoring
    Yaqub, M. F.
    Gondal, I.
    Kamruzzaman, J.
    INTERNATIONAL REVIEW OF ELECTRICAL ENGINEERING-IREE, 2011, 6 (01): : 238 - 248
  • [23] Impact of COVID-19 Pandemic on Road Traffic Accident Severity in Thailand: An Application of K-Nearest Neighbor Algorithm with Feature Selection Techniques
    Simmachan, Teerawat
    Wongsai, Sangdao
    Lerdsuwansri, Rattana
    Boonkrong, Pichit
    THAILAND STATISTICIAN, 2025, 23 (01): : 129 - 143
  • [24] Analysis on road crash severity of drivers using machine learning techniques
    Mittal, Mohit
    Gupta, Swadha
    Chauhan, Shaifali
    Saraswat, Lalit Kumar
    INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2022, 13 (02) : 154 - 163
  • [25] Using Feature Selection with Machine Learning for Generation of Insurance Insights
    Taha, Ayman
    Cosgrave, Bernard
    Mckeever, Susan
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [26] Feature Selection Investigation in Machine Learning Docking Scoring Functions
    Balboni, Mauricio Dorneles Caldeira
    Arrua, Oscar Emilio
    Werhli, Adriano V.
    Machado, Karina dos Santos
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2023, 2023, 13954 : 58 - 69
  • [27] FeatureSelect: a software for feature selection based on machine learning approaches
    Masoudi-Sobhanzadeh, Yosef
    Motieghader, Habib
    Masoudi-Nejad, Ali
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [28] Machine learning and feature selection for the analysis of Alzheimer Metabolomics Data
    Belacel, Nabil
    Cuperlovic-Culf, Miroslava
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 222 - 226
  • [29] Integrated Long-Term Stock Selection Models Based on Feature Selection and Machine Learning Algorithms for China Stock Market
    Yuan, Xianghui
    Yuan, Jin
    Jiang, Tianzhao
    Ain, Qurat Ul
    IEEE ACCESS, 2020, 8 : 22672 - 22685
  • [30] A comprehensive survey on feature selection in the various fields of machine learning
    Dhal, Pradip
    Azad, Chandrashekhar
    APPLIED INTELLIGENCE, 2022, 52 (04) : 4543 - 4581