Transformer-based deep learning model and video dataset for unsafe action identification in construction projects

被引:31
|
作者
Yang, Meng [1 ,2 ]
Wu, Chengke [1 ]
Guo, Yuanjun [1 ,2 ]
Jiang, Rui [1 ]
Zhou, Feixiang [3 ]
Zhang, Jianlin [4 ]
Yang, Zhile [1 ,2 ,5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
[4] China Construct Sci & Technol Grp Cooperat, Room 703,G3 Bldg,TCL Int Ecity Nanshan, Shenzhen, Peoples R China
[5] Guangdong Inst Carbon Neutral, Bldg 41,Huangshaping Innovat Pk,Phase1, Shaoguan, Peoples R China
关键词
Action recognition; Construction safety; Transformer; Deep learning; ACTION RECOGNITION; VISION; CAPTURE; FALLS;
D O I
10.1016/j.autcon.2022.104703
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A large proportion of construction accidents are caused by unintentional and unsafe actions and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors using conventional manual supervision due to the complex and dynamic working conditions on construction sites. Recently, surveillance videos and computer vision (CV) techniques have been increasingly adopted to automatically identify risky behaviors. However, the challenge remains that spatial and temporal features in video clips cannot be effectively captured and fused by current CV models. To address this challenge, this paper describes a deep learning model named Spatial Temporal Relation Transformer (STR-Transformer), where spatial and temporal features of work behaviors are simultaneously extracted in paralleling video streams and then fused by a specially designed module. To verify the effectiveness of the STR-Transformer, a customized dataset is developed, including seven categories of construction worker behaviors and 1595 video clips. In numerical experiments and case studies, the STR-Transformer achieves an average precision of 88.7%, 4.0% higher than the baseline model. The STR-Transformer enables more accurate and reliable automatic safety surveillance on construction projects, and is expected to reduce accident rates and management costs. Moreover, the performance of STR-Transformer relies on efficient feature integration, which may inspire future studies to identify, extract, and fuse richer features when applying CV-based deep learning models in construction management.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Patent image retrieval using transformer-based deep metric learning
    Higuchi, Kotaro
    Yanai, Keiji
    WORLD PATENT INFORMATION, 2023, 74
  • [42] A Transformer-Based Framework for Parameter Learning of a Land Surface Hydrological Process Model
    Li, Klin
    Lu, Yutong
    REMOTE SENSING, 2023, 15 (14)
  • [43] Estimating finger joint angles by surface EMG signal using feature extraction and transformer-based deep learning model
    Putro, Nur Achmad Sulistyo
    Avian, Cries
    Prakosa, Setya Widyawan
    Mahali, Muhammad Izzuddin
    Leu, Jenq-Shiou
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [44] Transformer-based deep learning for predicting protein properties in the life sciences
    Chandra, Abel
    Tunnermann, Laura
    Lofstedt, Tommy
    Gratz, Regina
    ELIFE, 2023, 12
  • [45] Broadband Solar Metamaterial Absorbers Empowered by Transformer-Based Deep Learning
    Chen, Wei
    Gao, Yuan
    Li, Yuyang
    Yan, Yiming
    Ou, Jun-Yu
    Ma, Wenzhuang
    Zhu, Jinfeng
    ADVANCED SCIENCE, 2023, 10 (13)
  • [46] A Transformer-Based Deep Learning Network for Underwater Acoustic Target Recognition
    Feng, Sheng
    Zhu, Xiaoqian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [47] Multidomain transformer-based deep learning for early detection of network intrusion
    Liu, Jinxin
    Simsek, Murat
    Nogueira, Michele
    Kantarci, Burak
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6056 - 6061
  • [48] Transformer-based Reinforcement Learning Model for Optimized Quantitative Trading
    Kumar, Aniket
    Rizk, Rodrigue
    Santosh, K. C.
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1454 - 1455
  • [49] CWPR: An optimized transformer-based model for construction worker pose estimation on construction robots
    Zhou, Jiakai
    Zhou, Wanlin
    Wang, Yang
    ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [50] Population-Specific Glucose Prediction in Diabetes Care With Transformer-Based Deep Learning on the Edge
    Zhu, Taiyu
    Kuang, Lei
    Piao, Chengzhe
    Zeng, Junming
    Li, Kezhi
    Georgiou, Pantelis
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2024, 18 (02) : 236 - 246