The Analysis of Construction Workplace Incident Fatalities Using Text Mining and Classification Techniques

被引:0
作者
Assaf, Sena [1 ]
Liu, Kexin [1 ]
Mao, Zeyu [1 ]
Saleh, Amira [1 ]
Abellanosa, Abbey [1 ]
Mohamed, Yasser [1 ]
机构
[1] Univ Alberta, Edmonton, AB T6G 2R3, Canada
来源
PROCEEDINGS OF THE CANADIAN SOCIETY FOR CIVIL ENGINEERING ANNUAL CONFERENCE 2023, VOL 5, CSCE 2023 | 2024年 / 499卷
关键词
Construction safety; Construction fatalities; Text mining; Classification models; INJURY; SAFETY;
D O I
10.1007/978-3-031-61503-0_12
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The construction industry in Canada has been witnessing an increased number of workplace fatalities. In fact, a total of 52 fatalities have been observed in Alberta's construction industry in 2020 according to Alberta Occupational Health and Safety (OHS). Alberta OHS works on investigating such incidents and prepares the corresponding investigation reports. However, these are presented in a textual format making it challenging to extract further insights. Additionally, these reports present knowledge on the causes of such fatalities which can be potentially leveraged in identifying the necessary preventive measures for other similar projects. As such, the objective of this research work is two-fold. First, it aims to obtain insights related to workplace incident fatalities in construction projects by analyzing 116 reports between 2004 and 2022. This will be supported by the use of text mining analysis as a preliminary step to transform the data into a structured format. Secondly, this work aims to develop and compare different machine learning-based classifiers to identify the probability of occurrence of a specified type of a certain fatality incident at the construction site given a potential-incident's information. Text mining results of 100 incident investigation reports showed that fall types of fatalities are the most common. Additionally, the performance of Decision Trees, Random Forest, Gradient Boosted Tree, and the Zero Classifier were assessed in classifying the type of fatality. The Random Forest classifier was selected for classifying Crush and Fall types of fatality incidents as it gave the highest F1 scores of 90.5% and 88.3%, respectively. Safety personnel can leverage the developed classifier to integrate a data-driven approach in identifying the necessary preventive measures to mitigate fatal incidents on construction sites.
引用
收藏
页码:157 / 171
页数:15
相关论文
共 50 条
  • [31] Analysis of Nature of Science Included in Recent Popular Writing Using Text Mining Techniques
    Feng Jiang
    William F. McComas
    Science & Education, 2014, 23 : 1785 - 1809
  • [32] Identification and analysis of misclassified work-zone crashes using text mining techniques
    Abu Sayed, Md
    Qin, Xiao
    Kate, Rohit J.
    Anisuzzaman, D. M.
    Yu, Zeyun
    ACCIDENT ANALYSIS AND PREVENTION, 2021, 159
  • [33] Identifying risks areas related to medication administrations-text mining analysis using free-text descriptions of incident reports
    Harkanen, Marja
    Paananen, Jussi
    Murrells, Trevor
    Rafferty, Anne Marie
    Franklin, Bryony Dean
    BMC HEALTH SERVICES RESEARCH, 2019, 19 (01)
  • [34] Classification of broadband network devices using text mining technique
    Ketcham, Mahasak
    Ganokratanaa, Thittaporn
    Sridoung, Nattapat
    METHODSX, 2023, 11
  • [35] Automatic Lyrics Classification System Using Text Mining Technique
    Jareanpon, Chatklaw
    Kiatjindarat, Waranyoo
    Polhome, Thanawat
    Khongkraphan, Kittiya
    2018 INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT), 2018,
  • [36] Document classification using a deep neural network in text mining
    Lee, Bo-Hui
    Lee, Su-Jin
    Choi, Yong-Seok
    KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (05) : 615 - 625
  • [37] Identification of Moonlighting Proteins in Genomes Using Text Mining Techniques
    Jain, Aashish
    Gali, Hareesh
    Kihara, Daisuke
    PROTEOMICS, 2018, 18 (21-22)
  • [38] Text classification algorithms for mining unstructured data: a SWOT analysis
    Kumar A.
    Dabas V.
    Hooda P.
    International Journal of Information Technology, 2020, 12 (4) : 1159 - 1169
  • [39] Hierarchical classification in text mining for sentiment analysis of online news
    Jinyan Li
    Simon Fong
    Yan Zhuang
    Richard Khoury
    Soft Computing, 2016, 20 : 3411 - 3420
  • [40] Hierarchical classification in text mining for sentiment analysis of online news
    Li, Jinyan
    Fong, Simon
    Zhuang, Yan
    Khoury, Richard
    SOFT COMPUTING, 2016, 20 (09) : 3411 - 3420