Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention

被引：5

作者：

Li, Xin ^{[1
]}

Guo, Yuxin ^{[1
]}

Pan, Weiguo ^{[1
]}

Liu, Hongzhe ^{[1
]}

Xu, Bingxin ^{[1
]}

机构：

[1] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 06期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

human pose estimation; attention mechanism; multi-scale feature extraction; NETWORK;

D O I：

10.3390/app13063614

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Heatmap-based traditional approaches for estimating human pose usually suffer from drawbacks such as high network complexity or suboptimal accuracy. Focusing on the issue of multi-person pose estimation without heatmaps, this paper proposes an end-to-end, lightweight human pose estimation network using a multi-scale coordinate attention mechanism based on the Yolo-Pose network to improve the overall network performance while ensuring the network is lightweight. Specifically, the lightweight network GhostNet was first integrated into the backbone to alleviate the problem of model redundancy and produce a significant number of effective feature maps. Then, by combining the coordinate attention mechanism, the sensitivity of our proposed network to direction and location perception was enhanced. Finally, the BiFPN module was fused to balance the feature information of different scales and further improve the expression ability of convolutional features. Experiments on the COCO 2017 dataset showed that, compared with the baseline method YOLO-Pose, the average accuracy of the proposed network on the COCO 2017 validation dataset was improved by 4.8% while minimizing the amount of network parameters and calculations. The experimental results demonstrated that our proposed method can improve the detection accuracy of human pose estimation while ensuring that the model is lightweight.

引用

页数：18

共 50 条

[1] An improved DenseNet model to classify the damage caused by cotton aphid [J].

Bao, Wenxia ;

Cheng, Tao ;

Zhou, Xin-Gen ;

Guo, Wei ;

Wang, Yuanyuan ;

Zhang, Xuan ;

Qiao, Hongbo ;

Zhang, Dongyan .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 203

[2] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[3] Cascaded Pyramid Network for Multi-Person Pose Estimation [J].

Chen, Yilun ;

Wang, Zhicheng ;

Peng, Yuxiang ;

Zhang, Zhiqiang ;

Yu, Gang ;

Sun, Jian .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112

[4] Monocular human pose estimation: A survey of deep learning-based methods [J].

Chen, Yucheng ;

Tian, Yingli ;

He, Mingyi .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 192

[5] HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation [J].

Cheng, Bowen ;

Xiao, Bin ;

Wang, Jingdong ;

Shi, Honghui ;

Huang, Thomas S. ;

Zhang, Lei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5385-5394

[6] RMPE: Regional Multi-Person Pose Estimation [J].

Fang, Hao-Shu ;

Xie, Shuqin ;

Tai, Yu-Wing ;

Lu, Cewu .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2353-2362

[7]

Gadhiya R., 2021, PROC INT C CIRCUITS, P1

[8] Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression [J].

Geng, Zigang ;

Sun, Ke ;

Xiao, Bin ;

Zhang, Zhaoxiang ;

Wang, Jingdong .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14671-14681

[9] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].

Ghiasi, Golnaz ;

Lin, Tsung-Yi ;

Le, Quoc V. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038

[10] Human Pose Estimation from Monocular Images: A Comprehensive Survey [J].

Gong, Wenjuan ;

Zhang, Xuena ;

Gonzalez, Jordi ;

Sobral, Andrews ;

Bouwmans, Thierry ;

Tu, Changhe ;

Zahzah, El-hadi .

SENSORS, 2016, 16 (12)

← 1 2 3 4 5 →