Unbiased feature position alignment for human pose estimation

被引:1
作者
Wang, Chen [1 ]
Zhou, Yanghong [1 ]
Zhang, Feng [2 ]
Mok, P. Y. [1 ]
机构
[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Nanjing 210003, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -scale fusion; Position misalignment; Unbiased feature position alignment; Unbiased human pose model; Human pose estimation; NETWORK;
D O I
10.1016/j.neucom.2023.03.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-scale feature fusion is a commonly-used module in existing deep-learning models, and feature misalignment occurs in the process of feature fusion. The spatial misalignment hinders the learning of semantic representation with multi-scale levels, but which has not received much attention. This misalignment problem is caused by the feature position shift after using the convolution and interpolation operation in feature fusion. To solve the misalignment problem, this paper formulates the shift error mathematically and proposes a plug-and-play unbiased feature position alignment strategy to align convolution with interpolation. As a model-agnostic approach, unbiased feature position alignment can boost the performance of different models without introducing extra parameters. Furthermore, the unbiased feature position alignment is applied to build an unbiased human pose estimation method. Experimental results have demonstrated the effectiveness of the proposed unbiased pose model in comparison to the state-of-the-arts, especially in the low-resolution field. The codes are shared at https:// github.com/WangChen100/Unbiased-Feature-Position-Alignment-for-Human-Pose-Estimation.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:152 / 163
页数:12
相关论文
共 41 条
  • [1] Single Image Dehazing by Multi-Scale Fusion
    Ancuti, Codruta Orniana
    Ancuti, Cosmin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) : 3271 - 3282
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Hybrid Task Cascade for Instance Segmentation
    Chen, Kai
    Pang, Jiangmiao
    Wang, Jiaqi
    Xiong, Yu
    Li, Xiaoxiao
    Sun, Shuyang
    Feng, Wansen
    Liu, Ziwei
    Shi, Jianping
    Ouyang, Wanli
    Loy, Chen Change
    Lin, Dahua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
  • [4] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [5] Multi-feature fusion: Graph neural network and CNN combining for hyperspectral image classification
    Ding, Yao
    Zhang, Zhili
    Zhao, Xiaofeng
    Hong, Danfeng
    Cai, Wei
    Yu, Chengguo
    Yang, Nengjun
    Cai, Weiwei
    [J]. NEUROCOMPUTING, 2022, 501 : 246 - 257
  • [6] Joint usage of global and local attentions in hourglass network for human pose estimation
    Dong, Xiena
    Yu, Jun
    Zhang, Jian
    [J]. NEUROCOMPUTING, 2022, 472 : 95 - 102
  • [7] Dosovitskiy A, 2021, INT C LEARNING REPRE
  • [8] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [9] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [10] Removing the Bias of Integral Pose Regression
    Gu, Kerui
    Yang, Linlin
    Yao, Angela
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11047 - 11056