Transformer-based deep learning model and video dataset for unsafe action identification in construction projects

被引:31
|
作者
Yang, Meng [1 ,2 ]
Wu, Chengke [1 ]
Guo, Yuanjun [1 ,2 ]
Jiang, Rui [1 ]
Zhou, Feixiang [3 ]
Zhang, Jianlin [4 ]
Yang, Zhile [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
[4] China Construct Sci & Technol Grp Cooperat, Room 703,G3 Bldg,TCL Int Ecity Nanshan, Shenzhen, Peoples R China
[5] Guangdong Inst Carbon Neutral, Bldg 41,Huangshaping Innovat Pk,Phase1, Shaoguan, Peoples R China
关键词
Action recognition; Construction safety; Transformer; Deep learning; ACTION RECOGNITION; VISION; CAPTURE; FALLS;
D O I
10.1016/j.autcon.2022.104703
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A large proportion of construction accidents are caused by unintentional and unsafe actions and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors using conventional manual supervision due to the complex and dynamic working conditions on construction sites. Recently, surveillance videos and computer vision (CV) techniques have been increasingly adopted to automatically identify risky behaviors. However, the challenge remains that spatial and temporal features in video clips cannot be effectively captured and fused by current CV models. To address this challenge, this paper describes a deep learning model named Spatial Temporal Relation Transformer (STR-Transformer), where spatial and temporal features of work behaviors are simultaneously extracted in paralleling video streams and then fused by a specially designed module. To verify the effectiveness of the STR-Transformer, a customized dataset is developed, including seven categories of construction worker behaviors and 1595 video clips. In numerical experiments and case studies, the STR-Transformer achieves an average precision of 88.7%, 4.0% higher than the baseline model. The STR-Transformer enables more accurate and reliable automatic safety surveillance on construction projects, and is expected to reduce accident rates and management costs. Moreover, the performance of STR-Transformer relies on efficient feature integration, which may inspire future studies to identify, extract, and fuse richer features when applying CV-based deep learning models in construction management.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A transformer-based multi-task deep learning model for simultaneous T-stage identification and segmentation of nasopharyngeal carcinoma
    Yang, Kaifan
    Dong, Xiuyu
    Tang, Fan
    Ye, Feng
    Chen, Bei
    Liang, Shujun
    Zhang, Yu
    Xu, Yikai
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [32] TRFM-LS: Transformer-Based Deep Learning Method for Vessel Trajectory Prediction
    Jiang, Dapeng
    Shi, Guoyou
    Li, Na
    Ma, Lin
    Li, Weifeng
    Shi, Jiahui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [33] SpineHRformer: A Transformer-Based Deep Learning Model for Automatic Spine Deformity Assessment with Prospective Validation
    Zhao, Moxin
    Meng, Nan
    Cheung, Jason Pui Yin
    Yu, Chenxi
    Lu, Pengyu
    Zhang, Teng
    BIOENGINEERING-BASEL, 2023, 10 (11):
  • [34] Enhancing Microseismic Signal Classification in Metal Mines Using Transformer-Based Deep Learning
    Peng, Pingan
    Lei, Ru
    Wang, Jinmiao
    SUSTAINABILITY, 2023, 15 (20)
  • [35] GPTransformer: A Transformer-Based Deep Learning Method for Predicting Fusarium Related Traits in Barley
    Jubair, Sheikh
    Tucker, James R.
    Henderson, Nathan
    Hiebert, Colin W.
    Badea, Ana
    Domaratzki, Michael
    Fernando, W. G. Dilantha
    FRONTIERS IN PLANT SCIENCE, 2021, 12
  • [36] YOLO-Sp: A Novel Transformer-Based Deep Learning Model for Achnatherum splendens Detection
    Zhang, Yuzhuo
    Wang, Tianyi
    You, Yong
    Wang, Decheng
    Zhang, Dongyan
    Lv, Yuchan
    Lu, Mengyuan
    Zhang, Xingshan
    AGRICULTURE-BASEL, 2023, 13 (06):
  • [37] Pilot Stress Detection Through Physiological Signals Using a Transformer-Based Deep Learning Model
    Li, Yuhan
    Li, Ke
    Chen, Jiaao
    Wang, Shaofan
    Lu, Haochang
    Wen, Dongsheng
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11774 - 11784
  • [38] A Transformer-Based Deep Learning Model for Sleep Apnea Detection and Application on RingConn Smart Ring
    Wu, Zetong
    Wu, Hao
    Fang, Kaiqun
    Sze, Keith Siu-Fung
    Feng, Qianjin
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [39] Timed-image based deep learning for action recognition in video sequences
    Atto, Abdourrahmane Mahamane
    Benoit, Alexandre
    Lambert, Patrick
    PATTERN RECOGNITION, 2020, 104
  • [40] Sewer defect detection from 3D point clouds using a transformer-based deep learning model
    Zhou, Yunxiang
    Ji, Ankang
    Zhang, Limao
    AUTOMATION IN CONSTRUCTION, 2022, 136