YH-Pose: Human pose estimation in complex coal mine scenarios

被引：7

作者：

Dong, Xiangqing ^{[1
,2
]}

Wang, Xichao ^{[1
,2
,3
]}

Li, Baojiang ^{[1
,2
]}

Wang, Haiyan ^{[1
,2
]}

Chen, Guochu ^{[1
,2
]}

Cai, Meng ^{[1
,2
]}

机构：

[1] Shanghai DianJi Univ, Sch Elect Engn, Shanghai, Peoples R China

[2] Shanghai DianJi Univ, Intelligent Decis & Control Technol Inst, Shanghai, Peoples R China

[3] Nanjing Univ Aeronaut & Astronaut, Lab Space Photoelect Detect & Percept, Minist Ind & Informat Technol, Nanjing, Jiangsu, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 127卷

关键词：

Human pose estimation; Spatio-temporal fusion; Heatmap; Chinese coal mine; Video frame;

D O I：

10.1016/j.engappai.2023.107338

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human pose estimation in continuous video frames captured from complex coal mine scenes is challenging. The video frames in this scene may suffer from problems such as uneven brightness, blurred image details, and excessive noise. Mainstream pose estimation methods have good detection performance for high-quality static images, but their accuracy and prediction rate can be significantly reduced in coal mine scenes. In this work, a yielding human pose estimation framework is proposed, termed YH-Pose. The framework strives to incorporate additional visual evidences from neighboring frames to facilitate the pose estimation of the current frame. Firstly, a human detector is introduced to locate the person's position in the video frame and extract global features. This can provide a good initialization for the latter keypoint detection, making the training process converge quickly. Secondly, heatmaps are used to encode the joint locations as Gaussian peaks, and a temporal road module (TRM) is designed, which encodes video frames at intervals. The module efficiently fuses spatio-temporal information through frame-rate groups in a hierarchical manner. Lastly, the spatial road module (SRM) learns from fused keypoint context features and predicts the location of the keypoints in the next frame. In addition, a dataset called Colliery-1 is proposed, which derives from underground surveillance video from chinese coal mines and consists of 3600 video clips. The experimental results on the Colliery-1 dataset indicate that the framework achieved an average accuracy of 82% and 80% on the training and test sets, respectively. Moreover, the framework achieved a 94.2% prediction rate for pose estimation. To further evaluate the effectiveness of the proposed method, some comparisons have been made between it and various mainstream methods using different metrics.

引用

页数：13

共 50 条

[1] A Novel Technique to Detect and Track Multiple Objects in Dynamic Video Surveillance Systems
Adimoolam, M.
Mohan, Senthilkumar
John, A.
Srivastava, Gautam
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (04): : 112 - 120
[2] Ahn B, 2018, ASIAPAC SIGN INFO PR, P516, DOI 10.23919/APSIPA.2018.8659548
[3] Akama S., 2018, ITE Tech. Rep., V42, P53
[4] PoseTrack: A Benchmark for Human Pose Estimation and Tracking
Andriluka, Mykhaylo
Iqbal, Umar
Insafutdinov, Eldar
Pishchulin, Leonid
Milan, Anton
Gall, Juergen
Schiele, Bernt
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5167 - 5176
[5] Doering A., 2018, ARXIV PREPRINT ARXIV
[6] [杜京义 Du Jingyi], 2023, [工矿自动化, Industry and Mine Automation], V49, P90
[7] RMPE: Regional Multi-Person Pose Estimation
Fang, Hao-Shu
Xie, Shuqin
Tai, Yu-Wing
Lu, Cewu
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
[8] Autonomous rectification behavior of coal mine safety hazards under a gambling mind: From an evolutionary game perspective
He, Yinnan
Qin, Ruxiang
[J]. PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 169 : 840 - 849
[9] Prediction of bedload transport rate using a block combined network structure
Hosseini, Seyed Abbas
Shahri, Abbas Abbaszadeh
Asheghi, Reza
[J]. HYDROLOGICAL SCIENCES JOURNAL, 2022, 67 (01) : 117 - 128
[10] PoseTrack: Joint Multi-Person Pose Estimation and Tracking
Iqbal, Umar
Milan, Anton
Gall, Juergen
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4654 - 4663

← 1 2 3 4 5 →