YH-Pose: Human pose estimation in complex coal mine scenarios

被引:7
作者
Dong, Xiangqing [1 ,2 ]
Wang, Xichao [1 ,2 ,3 ]
Li, Baojiang [1 ,2 ]
Wang, Haiyan [1 ,2 ]
Chen, Guochu [1 ,2 ]
Cai, Meng [1 ,2 ]
机构
[1] Shanghai DianJi Univ, Sch Elect Engn, Shanghai, Peoples R China
[2] Shanghai DianJi Univ, Intelligent Decis & Control Technol Inst, Shanghai, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Lab Space Photoelect Detect & Percept, Minist Ind & Informat Technol, Nanjing, Jiangsu, Peoples R China
关键词
Human pose estimation; Spatio-temporal fusion; Heatmap; Chinese coal mine; Video frame;
D O I
10.1016/j.engappai.2023.107338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose estimation in continuous video frames captured from complex coal mine scenes is challenging. The video frames in this scene may suffer from problems such as uneven brightness, blurred image details, and excessive noise. Mainstream pose estimation methods have good detection performance for high-quality static images, but their accuracy and prediction rate can be significantly reduced in coal mine scenes. In this work, a yielding human pose estimation framework is proposed, termed YH-Pose. The framework strives to incorporate additional visual evidences from neighboring frames to facilitate the pose estimation of the current frame. Firstly, a human detector is introduced to locate the person's position in the video frame and extract global features. This can provide a good initialization for the latter keypoint detection, making the training process converge quickly. Secondly, heatmaps are used to encode the joint locations as Gaussian peaks, and a temporal road module (TRM) is designed, which encodes video frames at intervals. The module efficiently fuses spatio-temporal information through frame-rate groups in a hierarchical manner. Lastly, the spatial road module (SRM) learns from fused keypoint context features and predicts the location of the keypoints in the next frame. In addition, a dataset called Colliery-1 is proposed, which derives from underground surveillance video from chinese coal mines and consists of 3600 video clips. The experimental results on the Colliery-1 dataset indicate that the framework achieved an average accuracy of 82% and 80% on the training and test sets, respectively. Moreover, the framework achieved a 94.2% prediction rate for pose estimation. To further evaluate the effectiveness of the proposed method, some comparisons have been made between it and various mainstream methods using different metrics.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Novel Technique to Detect and Track Multiple Objects in Dynamic Video Surveillance Systems
    Adimoolam, M.
    Mohan, Senthilkumar
    John, A.
    Srivastava, Gautam
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (04): : 112 - 120
  • [2] Ahn B, 2018, ASIAPAC SIGN INFO PR, P516, DOI 10.23919/APSIPA.2018.8659548
  • [3] Akama S., 2018, ITE Tech. Rep., V42, P53
  • [4] PoseTrack: A Benchmark for Human Pose Estimation and Tracking
    Andriluka, Mykhaylo
    Iqbal, Umar
    Insafutdinov, Eldar
    Pishchulin, Leonid
    Milan, Anton
    Gall, Juergen
    Schiele, Bernt
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5167 - 5176
  • [5] Doering A., 2018, ARXIV PREPRINT ARXIV
  • [6] [杜京义 Du Jingyi], 2023, [工矿自动化, Industry and Mine Automation], V49, P90
  • [7] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [8] Autonomous rectification behavior of coal mine safety hazards under a gambling mind: From an evolutionary game perspective
    He, Yinnan
    Qin, Ruxiang
    [J]. PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 169 : 840 - 849
  • [9] Prediction of bedload transport rate using a block combined network structure
    Hosseini, Seyed Abbas
    Shahri, Abbas Abbaszadeh
    Asheghi, Reza
    [J]. HYDROLOGICAL SCIENCES JOURNAL, 2022, 67 (01) : 117 - 128
  • [10] PoseTrack: Joint Multi-Person Pose Estimation and Tracking
    Iqbal, Umar
    Milan, Anton
    Gall, Juergen
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4654 - 4663