Transformer-based deep learning model and video dataset for unsafe action identification in construction projects

被引:31
|
作者
Yang, Meng [1 ,2 ]
Wu, Chengke [1 ]
Guo, Yuanjun [1 ,2 ]
Jiang, Rui [1 ]
Zhou, Feixiang [3 ]
Zhang, Jianlin [4 ]
Yang, Zhile [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
[4] China Construct Sci & Technol Grp Cooperat, Room 703,G3 Bldg,TCL Int Ecity Nanshan, Shenzhen, Peoples R China
[5] Guangdong Inst Carbon Neutral, Bldg 41,Huangshaping Innovat Pk,Phase1, Shaoguan, Peoples R China
关键词
Action recognition; Construction safety; Transformer; Deep learning; ACTION RECOGNITION; VISION; CAPTURE; FALLS;
D O I
10.1016/j.autcon.2022.104703
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A large proportion of construction accidents are caused by unintentional and unsafe actions and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors using conventional manual supervision due to the complex and dynamic working conditions on construction sites. Recently, surveillance videos and computer vision (CV) techniques have been increasingly adopted to automatically identify risky behaviors. However, the challenge remains that spatial and temporal features in video clips cannot be effectively captured and fused by current CV models. To address this challenge, this paper describes a deep learning model named Spatial Temporal Relation Transformer (STR-Transformer), where spatial and temporal features of work behaviors are simultaneously extracted in paralleling video streams and then fused by a specially designed module. To verify the effectiveness of the STR-Transformer, a customized dataset is developed, including seven categories of construction worker behaviors and 1595 video clips. In numerical experiments and case studies, the STR-Transformer achieves an average precision of 88.7%, 4.0% higher than the baseline model. The STR-Transformer enables more accurate and reliable automatic safety surveillance on construction projects, and is expected to reduce accident rates and management costs. Moreover, the performance of STR-Transformer relies on efficient feature integration, which may inspire future studies to identify, extract, and fuse richer features when applying CV-based deep learning models in construction management.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Automatic identification of suicide notes with a transformer-based deep learning model
    Zhang, Tianlin
    Schoene, Annika M.
    Ananiadou, Sophia
    INTERNET INTERVENTIONS-THE APPLICATION OF INFORMATION TECHNOLOGY IN MENTAL AND BEHAVIOURAL HEALTH, 2021, 25
  • [2] A teacher-student deep learning strategy for extreme low resolution unsafe action recognition in construction projects
    Yang, Meng
    Wu, Chengke
    Guo, Yuanjun
    He, Yong
    Jiang, Rui
    Jiang, Junjie
    Yang, Zhile
    ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [3] TemproNet: A transformer-based deep learning model for seawater temperature prediction
    Chen, Qiaochuan
    Cai, Candong
    Chen, Yaoran
    Zhou, Xi
    Zhang, Dan
    Peng, Yan
    OCEAN ENGINEERING, 2024, 293
  • [4] Deep-ProBind: binding protein prediction with transformer-based deep learning model
    Khan, Salman
    Noor, Sumaiya
    Awan, Hamid Hussain
    Iqbal, Shehryar
    Alqahtani, Salman A.
    Dilshad, Naqqash
    Ahmad, Nijad
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [5] A transformer-based deep learning model for recognizing communication-oriented entities from patents of ICT in construction
    Wu, Hengqin
    Shen, Geoffrey Qiping
    Lin, Xue
    Li, Minglei
    Li, Clyde Zhengdao
    AUTOMATION IN CONSTRUCTION, 2021, 125
  • [6] Transformer-based deep learning model for forced oscillation localization
    Matar, Mustafa
    Estevez, Pablo Gill
    Marchi, Pablo
    Messina, Francisco
    Elmoudi, Ramadan
    Wshah, Safwan
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 146
  • [7] Characterization of groundwater contamination: A transformer-based deep learning model
    Bai, Tao
    Tahmasebi, Pejman
    ADVANCES IN WATER RESOURCES, 2022, 164
  • [8] GIT: A Transformer-Based Deep Learning Model for Geoacoustic Inversion
    Feng, Sheng
    Zhu, Xiaoqian
    Ma, Shuqing
    Lan, Qiang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (06)
  • [9] A transformer-based deep learning model for Persian moral sentiment analysis
    Karami, Behnam
    Bakouie, Fatemeh
    Gharibzadeh, Shahriar
    JOURNAL OF INFORMATION SCIENCE, 2023,
  • [10] Construction of Transformer Fault Diagnosis and Prediction Model Based on Deep Learning
    Li X.
    Journal of Computing and Information Technology, 2022, 30 (04) : 223 - 238