Vision-Based Body Pose Estimation of Excavator Using a Transformer-Based Deep-Learning Model

被引:1
作者
Ji, Ankang [1 ,2 ]
Fan, Hongqin [1 ]
Xue, Xiaolong [3 ]
机构
[1] Hong Kong Polytech Univ, Dept Bldg & Real Estate, Hong Kong 999077, Peoples R China
[2] Hong Kong Polytech Univ, Shenzhen Res Inst, Shenzhen 518057, Guangdong, Peoples R China
[3] Guangzhou Univ, Sch Management, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer vision; Deep learning; Body pose estimation; Excavator; Transformer;
D O I
10.1061/JCCEE5.CPENG-6079
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Devoted to safety, efficiency, and productivity management on construction sites, a deep-learning method termed transformer-based mechanical equipment pose network (TransMPNet) is proposed in this research to work on images for the body pose estimation of excavators in effective and efficient ways. TransMPNet contains data processing, an ensemble model coupled with DenseNet201, an improved transformer module, a loss function, and evaluation metrics to perform feature processing and learning for accurate results. To verify the effectiveness and efficiency of the method, a publicly available image database of excavator body poses is adopted for experimental testing and validation. The results indicate that TransMPNet provides excellent performance with a mean-square error (MSE) of 218.626, a root-MSE (RMSE) of 14.786, an average normalized error (NE) of 26.289x10-3, and an average area under the curve (AUC) of 74.487x10-3, and it significantly outperforms other state-of-the-art methods such as the cascaded pyramid network (CPN) and the stacked hourglass network (SHG) in terms of evaluation metrics. Accordingly, TransMPNet contributes to excavator body pose estimation, thereby providing more effective and accurate results with great potential for practical application in on-site construction management.
引用
收藏
页数:20
相关论文
共 55 条
  • [21] Application of dynamic time warping to the recognition of mixed equipment activities in cycle time measurement
    Kim, Hyunsoo
    Ahn, Changbum R.
    Engelhaupt, David
    Lee, SangHyun
    [J]. AUTOMATION IN CONSTRUCTION, 2018, 87 : 225 - 234
  • [22] YOLO with adaptive frame control for real-time object detection applications
    Lee, Jeonghun
    Hwang, Kwang-il
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 36375 - 36396
  • [23] Synthetic Image Dataset Development for Vision-Based Construction Equipment Detection
    Lee, Jin Gang
    Hwang, Jeongbin
    Chi, Seokho
    Seo, JoonOh
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (05)
  • [24] Novel Recursive BiFPN Combining with Swin Transformer for Wildland Fire Smoke Detection
    Li, Ao
    Zhao, Yaqin
    Zheng, Zhaoxiang
    [J]. FORESTS, 2022, 13 (12):
  • [25] A vision-based marker-less pose estimation system for articulated construction robots
    Liang, Ci-Jyun
    Lundeen, Kurt M.
    McGee, Wes
    Menassa, Carol C.
    Lee, SangHyun
    Kamat, Vineet R.
    [J]. AUTOMATION IN CONSTRUCTION, 2019, 104 : 80 - 94
  • [26] Multisensory and BIM-Integrated Digital Twin to Improve Urban Excavation Safety
    Liu, Donghai
    Sun, Chenyang
    Chen, Junjie
    Liu, Lei
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2023, 37 (05)
  • [27] A New Measurement Method of Real-time Pose Estimation for an Automatic Hydraulic Excavator
    Liu, Guangxu
    Wang, Qingfeng
    Wang, Tao
    [J]. 2022 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2022, : 308 - 313
  • [28] Optical marker-based end effector pose estimation for articulated excavators
    Lundeen, Kurt M.
    Dong, Suyang
    Fredricks, Nicholas
    Akula, Manu
    Seo, Jongwon
    Kamat, Vineet R.
    [J]. AUTOMATION IN CONSTRUCTION, 2016, 65 : 51 - 64
  • [29] Construction machine pose prediction considering historical motions and activity attributes using gated recurrent unit (GRU)
    Luo, Han
    Wang, Mingzhu
    Wong, Peter Kok-Yiu
    Tang, Jingyuan
    Cheng, Jack C. P.
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 121
  • [30] Full body pose estimation of construction equipment using computer vision and deep learning techniques
    Luo, Han
    Wang, Mingzhu
    Wong, Peter Kok-Yiu
    Cheng, Jack C. P.
    [J]. AUTOMATION IN CONSTRUCTION, 2020, 110