The Analysis of Construction Workplace Incident Fatalities Using Text Mining and Classification Techniques

被引:0
|
作者
Assaf, Sena [1 ]
Liu, Kexin [1 ]
Mao, Zeyu [1 ]
Saleh, Amira [1 ]
Abellanosa, Abbey [1 ]
Mohamed, Yasser [1 ]
机构
[1] Univ Alberta, Edmonton, AB T6G 2R3, Canada
来源
PROCEEDINGS OF THE CANADIAN SOCIETY FOR CIVIL ENGINEERING ANNUAL CONFERENCE 2023, VOL 5, CSCE 2023 | 2024年 / 499卷
关键词
Construction safety; Construction fatalities; Text mining; Classification models; INJURY; SAFETY;
D O I
10.1007/978-3-031-61503-0_12
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The construction industry in Canada has been witnessing an increased number of workplace fatalities. In fact, a total of 52 fatalities have been observed in Alberta's construction industry in 2020 according to Alberta Occupational Health and Safety (OHS). Alberta OHS works on investigating such incidents and prepares the corresponding investigation reports. However, these are presented in a textual format making it challenging to extract further insights. Additionally, these reports present knowledge on the causes of such fatalities which can be potentially leveraged in identifying the necessary preventive measures for other similar projects. As such, the objective of this research work is two-fold. First, it aims to obtain insights related to workplace incident fatalities in construction projects by analyzing 116 reports between 2004 and 2022. This will be supported by the use of text mining analysis as a preliminary step to transform the data into a structured format. Secondly, this work aims to develop and compare different machine learning-based classifiers to identify the probability of occurrence of a specified type of a certain fatality incident at the construction site given a potential-incident's information. Text mining results of 100 incident investigation reports showed that fall types of fatalities are the most common. Additionally, the performance of Decision Trees, Random Forest, Gradient Boosted Tree, and the Zero Classifier were assessed in classifying the type of fatality. The Random Forest classifier was selected for classifying Crush and Fall types of fatality incidents as it gave the highest F1 scores of 90.5% and 88.3%, respectively. Safety personnel can leverage the developed classifier to integrate a data-driven approach in identifying the necessary preventive measures to mitigate fatal incidents on construction sites.
引用
收藏
页码:157 / 171
页数:15
相关论文
共 50 条
  • [1] Construction accident narrative classification: An evaluation of text mining techniques
    Goh, Yang Miang
    Ubeynarayana, C. U.
    ACCIDENT ANALYSIS AND PREVENTION, 2017, 108 : 122 - 130
  • [2] Document Classification of Filipino Online Scam Incident Text using Data Mining Techniques
    Palad, Eddie Bouy B.
    Tangkeko, Marivic S.
    Magpantay, Lissa Andrea K.
    Sipin, Glenn L.
    ISCIT 2019: PROCEEDINGS OF 2019 19TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2019, : 232 - 237
  • [3] Automatic classification of academic documents using text mining techniques
    Nunez, Haydemar
    Ramos, Esmeralda
    2012 XXXVIII CONFERENCIA LATINOAMERICANA EN INFORMATICA (CLEI), 2012,
  • [4] Construction site accident analysis using text mining and natural language processing techniques
    Zhang, Fan
    Fleyeh, Hasan
    Wang, Xinru
    Lu, Minghui
    AUTOMATION IN CONSTRUCTION, 2019, 99 : 238 - 248
  • [5] Sentiment Analysis in Online Reviews Classification using Text Mining Techniques
    Agueda, M.
    Rita, P.
    Guerreiro, P.
    2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2019,
  • [6] Text Classification Using Ensemble Features Selection and Data Mining Techniques
    Shravankumar, B.
    Ravi, Vadlamani
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, SEMCCO 2014, 2015, 8947 : 176 - 186
  • [7] Automated Operations Classification using Text Mining Techniques
    Esmael, Bilal
    Arnaout, Mohammad Arghad
    Fruhwirth, Rudolf K.
    Thonhauser, Gerhard
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL V, 2010, : 235 - 238
  • [8] Analysis of child development facts and myths using text mining techniques and classification models
    Tajrian, Mehedi
    Rahman, Azizur
    Kabir, Muhammad Ashad
    Islam, Md Rafiqul
    HELIYON, 2024, 10 (17)
  • [9] Using Some Web Content Mining Techniques for Arabic Text Classification
    Zubi, Zakaria Suliman
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON DATA NETWORKS, COMMUNICATIONS, COMPUTERS (DNCOCO '09), 2009, : 73 - 84
  • [10] Safety Engagement in the Workplace: Text Mining Analysis
    Seo, Hyun Jeong
    Hong, Ah Jeong
    SAFETY, 2022, 8 (02)