3D convolutional neural network-based one-stage model for real-time action detection in video of construction equipment

被引:33
|
作者
Jung, Seunghoon [1 ]
Jeoung, Jaewon [1 ]
Kang, Hyuna [1 ]
Hong, Taehoon [1 ]
机构
[1] Yonsei Univ, Dept Architecture & Architectural Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
ACTION RECOGNITION; PRODUCTIVITY; FEATURES; WORKERS;
D O I
10.1111/mice.12695
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to propose a three-dimensional convolutional neural network (3D CNN)-based one-stage model for real-time action detection in video of construction equipment (ADVICE). The 3D CNN-based single-stream feature extraction network and detection network are designed with the implementation of the 3D attention module and feature pyramid network developed in this study to improve performance. For model evaluation, 130 videos were collected from YouTube including videos of four types of construction equipment at various construction sites. Trained on 520 clips and tested on 260 clips, ADVICE achieved precision and recall of 82.1% and 83.1%, respectively, with an inference speed of 36.6 frames per second. The evaluation results indicate that the proposed method can implement the 3D CNN-based one-stage model for real-time action detection of construction equipment in videos of diverse, variable, and complex construction sites. The proposed method paved the way to improving safety, productivity, and environmental management of construction projects.
引用
收藏
页码:126 / 142
页数:17
相关论文
共 50 条
  • [1] A 3D Convolutional Neural Network Towards Real-time Amodal 3D Object Detection
    Sun, Hao
    Meng, Zehui
    Du, Xinxin
    Ang, Marcelo H., Jr.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8331 - 8338
  • [2] Real-Time Video Saliency Prediction Via 3D Residual Convolutional Neural Network
    Sun, Zhenhao
    Wang, Xu
    Zhang, Qiudan
    Jiang, Jianmin
    IEEE ACCESS, 2019, 7 : 147743 - 147754
  • [3] A Convolutional Neural Network-Based Method for 3D Object Detection
    Li Y.
    Shi L.
    Wan W.
    Zhao Q.
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2018, 52 (01): : 7 - 12
  • [4] One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
    Li, Minle
    Hu, Yihua
    Zhao, Nanxiang
    Qian, Qishu
    SENSORS, 2019, 19 (06)
  • [5] Design of action detection system in wrestling match video based on 3D convolutional neural network
    Liu Y.
    Mei Q.
    Gan X.
    Zhu Y.
    Wang Y.
    International Journal of Wireless and Mobile Computing, 2022, 22 (01) : 29 - 37
  • [6] VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition
    Maturana, Daniel
    Scherer, Sebastian
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 922 - 928
  • [7] Smoke Video Detection Algorithm Based on 3D Convolutional Neural Network
    Shi, Zhen
    Sun, Rui
    Huo, Mingge
    Proceedings of the 34th Chinese Control and Decision Conference, CCDC 2022, 2022, : 692 - 697
  • [8] Smoke Video Detection Algorithm Based On 3D Convolutional Neural Network
    Shi, Zhen
    Sun, Rui
    Huo, Mingge
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 692 - 697
  • [9] Lightweight convolutional neural network for real-time 3D object detection in road and railway environments
    Mauri, A.
    Khemmar, R.
    Decoux, B.
    Haddad, M.
    Boutteau, R.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (03) : 499 - 516
  • [10] Lightweight convolutional neural network for real-time 3D object detection in road and railway environments
    A. Mauri
    R. Khemmar
    B. Decoux
    M. Haddad
    R. Boutteau
    Journal of Real-Time Image Processing, 2022, 19 : 499 - 516