Unbiased feature position alignment for human pose estimation

被引:1
作者
Wang, Chen [1 ]
Zhou, Yanghong [1 ]
Zhang, Feng [2 ]
Mok, P. Y. [1 ]
机构
[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -scale fusion; Position misalignment; Unbiased feature position alignment; Unbiased human pose model; Human pose estimation; NETWORK;
D O I
10.1016/j.neucom.2023.03.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-scale feature fusion is a commonly-used module in existing deep-learning models, and feature misalignment occurs in the process of feature fusion. The spatial misalignment hinders the learning of semantic representation with multi-scale levels, but which has not received much attention. This misalignment problem is caused by the feature position shift after using the convolution and interpolation operation in feature fusion. To solve the misalignment problem, this paper formulates the shift error mathematically and proposes a plug-and-play unbiased feature position alignment strategy to align convolution with interpolation. As a model-agnostic approach, unbiased feature position alignment can boost the performance of different models without introducing extra parameters. Furthermore, the unbiased feature position alignment is applied to build an unbiased human pose estimation method. Experimental results have demonstrated the effectiveness of the proposed unbiased pose model in comparison to the state-of-the-arts, especially in the low-resolution field. The codes are shared at https:// github.com/WangChen100/Unbiased-Feature-Position-Alignment-for-Human-Pose-Estimation.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:152 / 163
页数:12
相关论文
共 41 条
  • [21] Indices Matter: Learning to Index for Deep Image Matting
    Lu, Hao
    Dai, Yutong
    Shen, Chunhua
    Xu, Songcen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3265 - 3274
  • [22] Mazzini Davide, 2018, British Machine Vision Conference 2018, BMVC 2018, Newcastle, UK, DOI DOI 10.1109/ICCE-BERLIN.2018.8576193
  • [23] Stacked Hourglass Networks for Human Pose Estimation
    Newell, Alejandro
    Yang, Kaiyu
    Deng, Jia
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 483 - 499
  • [24] Rafi U., 2016, BMVC, DOI [10.5244/C.30.109, DOI 10.5244/C.30.109]
  • [25] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [26] Densely connected attentional pyramid residual network for human pose estimation
    Tian, Yan
    Hu, Wei
    Jiang, Hangsen
    Wu, Jiachen
    [J]. NEUROCOMPUTING, 2019, 347 : 13 - 23
  • [27] Tompson J, 2015, PROC CVPR IEEE, P648, DOI 10.1109/CVPR.2015.7298664
  • [28] DeepPose: Human Pose Estimation via Deep Neural Networks
    Toshev, Alexander
    Szegedy, Christian
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1653 - 1660
  • [29] Wang C., 2022, PATTERN RECOGN, V126
  • [30] Deep High-Resolution Representation Learning for Visual Recognition
    Wang, Jingdong
    Sun, Ke
    Cheng, Tianheng
    Jiang, Borui
    Deng, Chaorui
    Zhao, Yang
    Liu, Dong
    Mu, Yadong
    Tan, Mingkui
    Wang, Xinggang
    Liu, Wenyu
    Xiao, Bin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3349 - 3364