Transformer-based deep learning model and video dataset for unsafe action identification in construction projects

被引：31

作者：

Yang, Meng ^{[1
,2
]}

Wu, Chengke ^{[1
]}

Guo, Yuanjun ^{[1
,2
]}

Jiang, Rui ^{[1
]}

Zhou, Feixiang ^{[3
]}

Zhang, Jianlin ^{[4
]}

Yang, Zhile ^{[1
,2
,5
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England

[4] China Construct Sci & Technol Grp Cooperat, Room 703,G3 Bldg,TCL Int Ecity Nanshan, Shenzhen, Peoples R China

[5] Guangdong Inst Carbon Neutral, Bldg 41,Huangshaping Innovat Pk,Phase1, Shaoguan, Peoples R China

来源：

AUTOMATION IN CONSTRUCTION | 2023年 / 146卷

关键词：

Action recognition; Construction safety; Transformer; Deep learning; ACTION RECOGNITION; VISION; CAPTURE; FALLS;

D O I：

10.1016/j.autcon.2022.104703

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

A large proportion of construction accidents are caused by unintentional and unsafe actions and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors using conventional manual supervision due to the complex and dynamic working conditions on construction sites. Recently, surveillance videos and computer vision (CV) techniques have been increasingly adopted to automatically identify risky behaviors. However, the challenge remains that spatial and temporal features in video clips cannot be effectively captured and fused by current CV models. To address this challenge, this paper describes a deep learning model named Spatial Temporal Relation Transformer (STR-Transformer), where spatial and temporal features of work behaviors are simultaneously extracted in paralleling video streams and then fused by a specially designed module. To verify the effectiveness of the STR-Transformer, a customized dataset is developed, including seven categories of construction worker behaviors and 1595 video clips. In numerical experiments and case studies, the STR-Transformer achieves an average precision of 88.7%, 4.0% higher than the baseline model. The STR-Transformer enables more accurate and reliable automatic safety surveillance on construction projects, and is expected to reduce accident rates and management costs. Moreover, the performance of STR-Transformer relies on efficient feature integration, which may inspire future studies to identify, extract, and fuse richer features when applying CV-based deep learning models in construction management.

引用

页数：14

共 50 条

[41] Patent image retrieval using transformer-based deep metric learning
Higuchi, Kotaro
Yanai, Keiji
WORLD PATENT INFORMATION, 2023, 74
[42] A Transformer-Based Framework for Parameter Learning of a Land Surface Hydrological Process Model
Li, Klin
Lu, Yutong
REMOTE SENSING, 2023, 15 (14)
[43] Estimating finger joint angles by surface EMG signal using feature extraction and transformer-based deep learning model
Putro, Nur Achmad Sulistyo
Avian, Cries
Prakosa, Setya Widyawan
Mahali, Muhammad Izzuddin
Leu, Jenq-Shiou
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
[44] Transformer-based deep learning for predicting protein properties in the life sciences
Chandra, Abel
Tunnermann, Laura
Lofstedt, Tommy
Gratz, Regina
ELIFE, 2023, 12
[45] Broadband Solar Metamaterial Absorbers Empowered by Transformer-Based Deep Learning
Chen, Wei
Gao, Yuan
Li, Yuyang
Yan, Yiming
Ou, Jun-Yu
Ma, Wenzhuang
Zhu, Jinfeng
ADVANCED SCIENCE, 2023, 10 (13)
[46] A Transformer-Based Deep Learning Network for Underwater Acoustic Target Recognition
Feng, Sheng
Zhu, Xiaoqian
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[47] Multidomain transformer-based deep learning for early detection of network intrusion
Liu, Jinxin
Simsek, Murat
Nogueira, Michele
Kantarci, Burak
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6056 - 6061
[48] Transformer-based Reinforcement Learning Model for Optimized Quantitative Trading
Kumar, Aniket
Rizk, Rodrigue
Santosh, K. C.
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1454 - 1455
[49] CWPR: An optimized transformer-based model for construction worker pose estimation on construction robots
Zhou, Jiakai
Zhou, Wanlin
Wang, Yang
ADVANCED ENGINEERING INFORMATICS, 2024, 62
[50] Population-Specific Glucose Prediction in Diabetes Care With Transformer-Based Deep Learning on the Edge
Zhu, Taiyu
Kuang, Lei
Piao, Chengzhe
Zeng, Junming
Li, Kezhi
Georgiou, Pantelis
IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2024, 18 (02) : 236 - 246

← 1 2 3 4 5 →