Investigating Machine Learning Applications in the Prediction of Occupational Injuries in South African National Parks

被引:5
作者
Chadyiwa, Martha [1 ]
Kagura, Juliana [2 ]
Stewart, Aimee [3 ]
机构
[1] Univ Johannesburg, Dept Environm Hlth, Doornfontein Campus, ZA-2094 Johannesburg, South Africa
[2] Univ Witwatersrand, Div Epidemiol & Biostat, ZA-2000 Johannesburg, South Africa
[3] Univ Witwatersrand, Physiotherapy Dept, Sch Therapeut Sci, ZA-2000 Johannesburg, South Africa
来源
MACHINE LEARNING AND KNOWLEDGE EXTRACTION | 2022年 / 4卷 / 03期
关键词
machine learning; prediction; occupational injuries; national parks;
D O I
10.3390/make4030037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a need to predict occupational injuries in South African National Parks for the purpose of implementing targeted interventions or preventive measures. Machine-learning models have the capability of predicting injuries such that the employees that are at risk of experiencing occupational injuries can be identified. Support Vector Machines (SVMs), k Nearest Neighbours (k-NN), XGB classifier and Deep Neural Networks were applied and overall performance was compared to the accuracy of baseline models that always predict low extremity injuries. Data extracted from the Department of Employment and Labour's Compensation Fund was used for training the models. SVMs had the best performance in predicting between low extremity injuries and injuries in the torso and hands regions. However, the overall accuracy was 56%, which was slightly above the baseline and below findings from similar previous research that reported a minimum of 62%. Gender was the only feature with an importance score significantly greater than zero. There is a need to use more features related to work conditions and which acknowledge the importance of environment in order to improve the accuracy of the predictions of the models. Furthermore, more types of injuries, and employees that have not experienced any injuries, should be included in future studies.
引用
收藏
页码:768 / 778
页数:11
相关论文
共 24 条
  • [1] A novel machine learning model to predict respiratory failure and invasive mechanical ventilation in critically ill patients suffering from COVID-19
    Bendavid, Itai
    Statlender, Liran
    Shvartser, Leonid
    Teppler, Shmuel
    Azullay, Roy
    Sapir, Rotem
    Singer, Pierre
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [2] Gender Differences in Occupational Injury Incidence
    Berecki-Gisolf, Janneke
    Smith, Peter M.
    Collie, Alex
    McClure, Roderick J.
    [J]. AMERICAN JOURNAL OF INDUSTRIAL MEDICINE, 2015, 58 (03) : 299 - 307
  • [3] Brownlee J, 2018, XG BOOST PYTHON GRAD
  • [4] Severity Prediction Models of Falling Risk for Workers at Height
    Chen, Hainan
    Luo, Xiaowei
    [J]. 5TH CREATIVE CONSTRUCTION CONFERENCE (CCC 2016), 2016, 164 : 439 - 445
  • [5] Machine-learning algorithms based on personalized pathways for a novel predictive model for the diagnosis of hepatocellular carcinoma
    Cheng, Binglin
    Zhou, Peitao
    Chen, Yuhan
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [6] Applying data mining techniques to explore factors contributing to occupational injuries in Taiwan's construction industry
    Cheng, Ching-Wu
    Leu, Sou-Sen
    Cheng, Ying-Mei
    Wu, Tsung-Chih
    Lin, Chen-Chung
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2012, 48 : 214 - 222
  • [7] Fuzzy inference model for assessing occupational risks in construction sites
    Debnath, Joy
    Biswas, Animesh
    Sivan, Preshobh
    Sen, Krishna Nirmalya
    Sahu, Subhashis
    [J]. INTERNATIONAL JOURNAL OF INDUSTRIAL ERGONOMICS, 2016, 55 : 114 - 128
  • [8] Fernandez A., 2018, Learning from imbalanced data sets, V10
  • [9] Hastie T., 2008, The Elements of StatisticalLearning: Data Mining, Inference, and Prediction
  • [10] He H, 2013, IMBALANCED LEARNING: FOUNDATIONS, ALGORITHMS, AND APPLICATIONS, P1, DOI 10.1002/9781118646106