Transformer-based deep learning model and video dataset for unsafe action identification in construction projects

被引:31
|
作者
Yang, Meng [1 ,2 ]
Wu, Chengke [1 ]
Guo, Yuanjun [1 ,2 ]
Jiang, Rui [1 ]
Zhou, Feixiang [3 ]
Zhang, Jianlin [4 ]
Yang, Zhile [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
[4] China Construct Sci & Technol Grp Cooperat, Room 703,G3 Bldg,TCL Int Ecity Nanshan, Shenzhen, Peoples R China
[5] Guangdong Inst Carbon Neutral, Bldg 41,Huangshaping Innovat Pk,Phase1, Shaoguan, Peoples R China
关键词
Action recognition; Construction safety; Transformer; Deep learning; ACTION RECOGNITION; VISION; CAPTURE; FALLS;
D O I
10.1016/j.autcon.2022.104703
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A large proportion of construction accidents are caused by unintentional and unsafe actions and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors using conventional manual supervision due to the complex and dynamic working conditions on construction sites. Recently, surveillance videos and computer vision (CV) techniques have been increasingly adopted to automatically identify risky behaviors. However, the challenge remains that spatial and temporal features in video clips cannot be effectively captured and fused by current CV models. To address this challenge, this paper describes a deep learning model named Spatial Temporal Relation Transformer (STR-Transformer), where spatial and temporal features of work behaviors are simultaneously extracted in paralleling video streams and then fused by a specially designed module. To verify the effectiveness of the STR-Transformer, a customized dataset is developed, including seven categories of construction worker behaviors and 1595 video clips. In numerical experiments and case studies, the STR-Transformer achieves an average precision of 88.7%, 4.0% higher than the baseline model. The STR-Transformer enables more accurate and reliable automatic safety surveillance on construction projects, and is expected to reduce accident rates and management costs. Moreover, the performance of STR-Transformer relies on efficient feature integration, which may inspire future studies to identify, extract, and fuse richer features when applying CV-based deep learning models in construction management.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A transformer-based multi-task deep learning model for simultaneous infiltrated brain area identification and segmentation of gliomas
    Li, Yin
    Zheng, Kaiyi
    Li, Shuang
    Yi, Yongju
    Li, Min
    Ren, Yufan
    Guo, Congyue
    Zhong, Liming
    Yang, Wei
    Li, Xinming
    Yao, Lin
    CANCER IMAGING, 2023, 23 (01)
  • [22] A transformer-based multi-task deep learning model for simultaneous infiltrated brain area identification and segmentation of gliomas
    Yin Li
    Kaiyi Zheng
    Shuang Li
    Yongju Yi
    Min Li
    Yufan Ren
    Congyue Guo
    Liming Zhong
    Wei Yang
    Xinming Li
    Lin Yao
    Cancer Imaging, 23
  • [23] Locational marginal price forecasting using Transformer-based deep learning network
    Liao, Shengyi
    Wang, Zhuo
    Luo, Yao
    Liang, Haiyan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8457 - 8462
  • [24] A transformer-based deep learning framework to predict employee attrition
    Li, Wenhui
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [25] A New Human Factor Study in Developing Practical Vision-Based Applications with the Transformer-Based Deep Learning Model
    Siriborvornratanakul, Thitirat
    ARTIFICIAL INTELLIGENCE IN HCI, AI-HCI 2022, 2022, 13336 : 436 - 447
  • [26] Comparative Analysis of Traditional Machine Learning and Transformer-based Deep Learning Models for Text Classification
    Aydin, Nazif
    Erdem, Osman Ayhan
    Tekerek, Adem
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024,
  • [27] A pelvis MR transformer-based deep learning model for predicting lung metastases risk in patients with rectal cancer
    Li, Yin
    Li, Shuang
    Xiao, Ruolin
    Li, Xi
    Yi, Yongju
    Zhang, Liangyou
    Zhou, You
    Wan, Yun
    Wei, Chenhua
    Zhong, Liming
    Yang, Wei
    Yao, Lin
    FRONTIERS IN ONCOLOGY, 2025, 15
  • [28] The MS-RadarFormer: A Transformer-Based Multi-Scale Deep Learning Model for Radar Echo Extrapolation
    Geng, Huantong
    Wu, Fangli
    Zhuang, Xiaoran
    Geng, Liangchao
    Xie, Boyang
    Shi, Zhanpeng
    REMOTE SENSING, 2024, 16 (02)
  • [29] Real-time prediction of TBM penetration rates using a transformer-based ensemble deep learning model
    Zhang, Minggong
    Ji, Ankang
    Zhou, Chang
    Ding, Yuexiong
    Wang, Luqi
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [30] Identification of driving factors of algal growth in the South-to-North Water Diversion Project by Transformer-based deep learning
    Qian, Jing
    Pu, Nan
    Qian, Li
    Xue, Xiaobai
    Bi, Yonghong
    Norra, Stefan
    WATER BIOLOGY AND SECURITY, 2023, 2 (03):