Unbiased feature position alignment for human pose estimation

被引：1

作者：

Wang, Chen ^{[1
]}

Zhou, Yanghong ^{[1
]}

Zhang, Feng ^{[2
]}

Mok, P. Y. ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Nanjing 210003, Peoples R China

来源：

NEUROCOMPUTING | 2023年 / 537卷

基金：

中国国家自然科学基金;

关键词：

Multi -scale fusion; Position misalignment; Unbiased feature position alignment; Unbiased human pose model; Human pose estimation; NETWORK;

D O I：

10.1016/j.neucom.2023.03.063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-scale feature fusion is a commonly-used module in existing deep-learning models, and feature misalignment occurs in the process of feature fusion. The spatial misalignment hinders the learning of semantic representation with multi-scale levels, but which has not received much attention. This misalignment problem is caused by the feature position shift after using the convolution and interpolation operation in feature fusion. To solve the misalignment problem, this paper formulates the shift error mathematically and proposes a plug-and-play unbiased feature position alignment strategy to align convolution with interpolation. As a model-agnostic approach, unbiased feature position alignment can boost the performance of different models without introducing extra parameters. Furthermore, the unbiased feature position alignment is applied to build an unbiased human pose estimation method. Experimental results have demonstrated the effectiveness of the proposed unbiased pose model in comparison to the state-of-the-arts, especially in the low-resolution field. The codes are shared at https:// github.com/WangChen100/Unbiased-Feature-Position-Alignment-for-Human-Pose-Estimation.(c) 2023 Elsevier B.V. All rights reserved.

引用

页码：152 / 163

页数：12

共 41 条

[1] Single Image Dehazing by Multi-Scale Fusion
Ancuti, Codruta Orniana
Ancuti, Cosmin
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) : 3271 - 3282
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] Hybrid Task Cascade for Instance Segmentation
Chen, Kai
Pang, Jiangmiao
Wang, Jiaqi
Xiong, Yu
Li, Xiaoxiao
Sun, Shuyang
Feng, Wansen
Liu, Ziwei
Shi, Jianping
Ouyang, Wanli
Loy, Chen Change
Lin, Dahua
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
[4] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[5] Multi-feature fusion: Graph neural network and CNN combining for hyperspectral image classification
Ding, Yao
Zhang, Zhili
Zhao, Xiaofeng
Hong, Danfeng
Cai, Wei
Yu, Chengguo
Yang, Nengjun
Cai, Weiwei
[J]. NEUROCOMPUTING, 2022, 501 : 246 - 257
[6] Joint usage of global and local attentions in hourglass network for human pose estimation
Dong, Xiena
Yu, Jun
Zhang, Jian
[J]. NEUROCOMPUTING, 2022, 472 : 95 - 102
[7] Dosovitskiy A, 2021, INT C LEARNING REPRE
[8] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[9] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[10] Removing the Bias of Integral Pose Regression
Gu, Kerui
Yang, Linlin
Yao, Angela
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11047 - 11056

← 1 2 3 4 5 →