MEMe: A Mutually Enhanced Modeling Method for Efficient and Effective Human Pose Estimation

被引：7

作者：

Li, Jie ^{[1
,2
,3
]}

Wang, Zhixing ^{[1
,2
,3
,4
]}

Qi, Bo ^{[1
,2
,3
]}

Zhang, Jianlin ^{[1
,2
,3
]}

Yang, Hu ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Key Lab Opt Engn, Chengdu 610209, Peoples R China

[2] Chinese Acad Sci, Inst Opt & Elect, Chengdu 610209, Peoples R China

[3] Univ Chinese Acad Sci, Beijing 100039, Peoples R China

[4] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 610209, Peoples R China

来源：

SENSORS | 2022年 / 22卷 / 02期

关键词：

human pose estimation; deep learning; mutually enhanced; efficient and effective; modeling method; extended convolutions; feature fusion; attention mechanisms; CNN; NETWORK;

D O I：

10.3390/s22020632

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

In this paper, a mutually enhanced modeling method (MEMe) is presented for human pose estimation, which focuses on enhancing lightweight model performance, but with low complexity. To obtain higher accuracy, a traditional model scale is largely expanded with heavy deployment difficulties. However, for a more lightweight model, there is a large performance gap compared to the former; thus, an urgent need for a way to fill it. Therefore, we propose a MEMe to reconstruct a lightweight baseline model, EffBase transferred intuitively from EfficientDet, into the efficient and effective pose (EEffPose) net, which contains three mutually enhanced modules: the Enhanced EffNet (EEffNet) backbone, the total fusion neck (TFNeck), and the final attention head (FAHead). Extensive experiments on COCO and MPII benchmarks show that our MEMe-based models reach state-of-the-art performances, with limited parameters. Specifically, in the same conditions, our EEffPose-P0 with 256 x 192 can use only 8.98 M parameters to achieve 75.4 AP on the COCO val set, which outperforms HRNet-W48, but with only 14% of its parameters.

引用

页数：15

共 40 条

[1] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
Andriluka, Mykhaylo
Pishchulin, Leonid
Gehler, Peter
Schiele, Bernt
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
[2] [Anonymous], 2014, Advances in neural information processing systems
[3] A Systematic Review of the Application of Camera-Based Human Pose Estimation in the Field of Sport and Physical Exercise
Badiola-Bengoa, Aritz
Mendez-Zorrilla, Amaia
[J]. SENSORS, 2021, 21 (18)
[4] Toward fast and accurate human pose estimation via soft-gated skip connections
Bulat, Adrian
Kossaifi, Jean
Tzimiropoulos, Georgios
Pantic, Maja
[J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 8 - 15
[5] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Cao, Zhe
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
[6] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[7] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[8] Monocular human pose estimation: A survey of deep learning-based methods
Chen, Yucheng
Tian, Yingli
He, Mingyi
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 192
[9] Debnath B, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P331
[10] RMPE: Regional Multi-Person Pose Estimation
Fang, Hao-Shu
Xie, Shuqin
Tai, Yu-Wing
Lu, Cewu
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362

← 1 2 3 4 →