MEMe: A Mutually Enhanced Modeling Method for Efficient and Effective Human Pose Estimation

被引:7
作者
Li, Jie [1 ,2 ,3 ]
Wang, Zhixing [1 ,2 ,3 ,4 ]
Qi, Bo [1 ,2 ,3 ]
Zhang, Jianlin [1 ,2 ,3 ]
Yang, Hu [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Opt Engn, Chengdu 610209, Peoples R China
[2] Chinese Acad Sci, Inst Opt & Elect, Chengdu 610209, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100039, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 610209, Peoples R China
关键词
human pose estimation; deep learning; mutually enhanced; efficient and effective; modeling method; extended convolutions; feature fusion; attention mechanisms; CNN; NETWORK;
D O I
10.3390/s22020632
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, a mutually enhanced modeling method (MEMe) is presented for human pose estimation, which focuses on enhancing lightweight model performance, but with low complexity. To obtain higher accuracy, a traditional model scale is largely expanded with heavy deployment difficulties. However, for a more lightweight model, there is a large performance gap compared to the former; thus, an urgent need for a way to fill it. Therefore, we propose a MEMe to reconstruct a lightweight baseline model, EffBase transferred intuitively from EfficientDet, into the efficient and effective pose (EEffPose) net, which contains three mutually enhanced modules: the Enhanced EffNet (EEffNet) backbone, the total fusion neck (TFNeck), and the final attention head (FAHead). Extensive experiments on COCO and MPII benchmarks show that our MEMe-based models reach state-of-the-art performances, with limited parameters. Specifically, in the same conditions, our EEffPose-P0 with 256 x 192 can use only 8.98 M parameters to achieve 75.4 AP on the COCO val set, which outperforms HRNet-W48, but with only 14% of its parameters.
引用
收藏
页数:15
相关论文
共 40 条
  • [1] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
    Andriluka, Mykhaylo
    Pishchulin, Leonid
    Gehler, Peter
    Schiele, Bernt
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
  • [2] [Anonymous], 2014, Advances in neural information processing systems
  • [3] A Systematic Review of the Application of Camera-Based Human Pose Estimation in the Field of Sport and Physical Exercise
    Badiola-Bengoa, Aritz
    Mendez-Zorrilla, Amaia
    [J]. SENSORS, 2021, 21 (18)
  • [4] Toward fast and accurate human pose estimation via soft-gated skip connections
    Bulat, Adrian
    Kossaifi, Jean
    Tzimiropoulos, Georgios
    Pantic, Maja
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 8 - 15
  • [5] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [6] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [7] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [8] Monocular human pose estimation: A survey of deep learning-based methods
    Chen, Yucheng
    Tian, Yingli
    He, Mingyi
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 192
  • [9] Debnath B, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P331
  • [10] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362