Vision-Based Body Pose Estimation of Excavator Using a Transformer-Based Deep-Learning Model

被引:1
作者
Ji, Ankang [1 ,2 ]
Fan, Hongqin [1 ]
Xue, Xiaolong [3 ]
机构
[1] Hong Kong Polytech Univ, Dept Bldg & Real Estate, Hong Kong 999077, Peoples R China
[2] Hong Kong Polytech Univ, Shenzhen Res Inst, Shenzhen 518057, Guangdong, Peoples R China
[3] Guangzhou Univ, Sch Management, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer vision; Deep learning; Body pose estimation; Excavator; Transformer;
D O I
10.1061/JCCEE5.CPENG-6079
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Devoted to safety, efficiency, and productivity management on construction sites, a deep-learning method termed transformer-based mechanical equipment pose network (TransMPNet) is proposed in this research to work on images for the body pose estimation of excavators in effective and efficient ways. TransMPNet contains data processing, an ensemble model coupled with DenseNet201, an improved transformer module, a loss function, and evaluation metrics to perform feature processing and learning for accurate results. To verify the effectiveness and efficiency of the method, a publicly available image database of excavator body poses is adopted for experimental testing and validation. The results indicate that TransMPNet provides excellent performance with a mean-square error (MSE) of 218.626, a root-MSE (RMSE) of 14.786, an average normalized error (NE) of 26.289x10-3, and an average area under the curve (AUC) of 74.487x10-3, and it significantly outperforms other state-of-the-art methods such as the cascaded pyramid network (CPN) and the stacked hourglass network (SHG) in terms of evaluation metrics. Accordingly, TransMPNet contributes to excavator body pose estimation, thereby providing more effective and accurate results with great potential for practical application in on-site construction management.
引用
收藏
页数:20
相关论文
共 55 条
  • [1] Alexey D, 2020, arXiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [2] Computer vision for anatomical analysis of equipment in civil infrastructure projects: Theorizing the development of regression-based deep neural networks
    Arashpour, Mehrdad
    Kamat, Vineet
    Heidarpour, Amin
    Hosseini, M. Reza
    Gill, Peter
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 137
  • [3] Excavator 3D pose estimation using deep learning and hybrid datasets
    Assadzadeh, Amin
    Arashpour, Mehrdad
    Li, Heng
    Hosseini, Reza
    Elghaish, Faris
    Baduge, Shanaka
    [J]. ADVANCED ENGINEERING INFORMATICS, 2023, 55
  • [4] Vision-based excavator pose estimation using synthetically generated datasets with domain randomization
    Assadzadeh, Amin
    Arashpour, Mehrdad
    Brilakis, Ioannis
    Ngo, Tuan
    Konstantinou, Eirini
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 134
  • [5] Synthesizing Pose Sequences from 3D Assets for Vision-Based Activity Analysis
    Calderon, Wilfredo Torres
    Roberts, Dominic
    Golparvar-Fard, Mani
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2021, 35 (01)
  • [6] Automatic vision-based calculation of excavator earthmoving productivity using zero-shot learning activity recognition
    Chen, Chen
    Xiao, Bo
    Zhang, Yuxuan
    Zhu, Zhenhua
    [J]. AUTOMATION IN CONSTRUCTION, 2023, 146
  • [7] Automatic Identification of Idling Reasons in Excavation Operations Based on Excavator-Truck Relationships
    Chen, Chen
    Zhu, Zhenhua
    Hammad, Amin
    Akbarzadeh, Mohammad
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2021, 35 (05)
  • [8] GasHis-Transformer: A multi-scale visual transformer approach for gastric histopathological image detection
    Chen, Haoyuan
    Li, Chen
    Wang, Ge
    Li, Xiaoyan
    Rahaman, Md Mamunur
    Sun, Hongzan
    Hu, Weiming
    Li, Yixin
    Liu, Wanli
    Sun, Changhao
    Ai, Shiliang
    Grzegorzek, Marcin
    [J]. PATTERN RECOGNITION, 2022, 130
  • [9] Performance evaluation of ultra wideband technology for construction resource location tracking in harsh environments
    Cheng, T.
    Venugopal, M.
    Teizer, J.
    Vela, P. A.
    [J]. AUTOMATION IN CONSTRUCTION, 2011, 20 (08) : 1173 - 1184
  • [10] Sensing, perception, decision, planning and action of autonomous excavators
    Eraliev, Oybek Maripjon Ugli
    Lee, Kwang-Hee
    Shin, Dae-Young
    Lee, Chul-Hee
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 141