Human pose estimation with gated multi-scale feature fusion and spatial mutual information

被引：0

作者：

Xiaoming Zhao

Chenchen Guo

Qiang Zou

机构：

[1] Tianjin University,School of Microelectronics

[2] Tianjin International Joint Research Center for Internet of Things,Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology

[3] Tianjin University,undefined

来源：

The Visual Computer | 2023年 / 39卷

关键词：

Human pose estimation; Gate; Multi-scale feature fusion; Noisy information; Spatial mutual information;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Although human pose estimation has achieved great success, the ambiguity of joint prediction has not been well resolved, especially in complex situations (crowded scenes, occlusions, and unnormal poses). We think that is caused by the noisy information introduced by combining multi-level features by simply adding features at each position. To alleviate this problem, we propose a new structure of gated multi-scale feature fusion (GMSFF). This module aims to selectively import high-level features to make up for the missing semantic information of low-resolution feature maps. Inspired by the prior knowledge that the position information of joints can refer to each other, we propose a new fine-tuning strategy for pose estimation—spatial mutual information complementary module (SMICM). It can assist the model in better adjusting the current joint’s position by capturing the information contained in other joints and only adds a little computational cost. We evaluated our proposed method on four datasets: MPII Human Pose Dataset (MPII), COCO keypoint detection Dataset (COCO), Occluded Human Dataset (OCHuman), and CrowdPose Dataset. The experimental results show that with the deepening of the occlusion and crowding level of the datasets, the improvement becomes more and more obvious. In particular, a performance improvement of 2.2 AP was obtained on the OCHuman dataset. In addition, our modules are plug-and-play.

引用

页码：119 / 137

页数：18

共 50 条

[31] Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention
Li, Xin
Guo, Yuxin
Pan, Weiguo
Liu, Hongzhe
Xu, Bingxin
APPLIED SCIENCES-BASEL, 2023, 13 (06):
[32] Hand pose estimation with multi-scale network
Hu, Zhongxu
Hu, Youmin
Wu, Bo
Liu, Jie
Han, Dongmin
Kurfess, Thomas
APPLIED INTELLIGENCE, 2018, 48 (08) : 2501 - 2515
[33] Multi-Scale Structure-Aware Network for Human Pose Estimation
Ke, Lipeng
Chang, Ming-Ching
Qi, Honggang
Lyu, Siwei
COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 731 - 746
[34] Video anomaly detection with multi-scale feature and temporal information fusion
Cai, Yiheng
Liu, Jiaqi
Guo, Yajun
Hu, Shaobin
Lang, Shinan
NEUROCOMPUTING, 2021, 423 : 264 - 273
[35] Enhancing feature fusion for human pose estimation
Rui Wang
Jiangwei Tong
Xiangyang Wang
Machine Vision and Applications, 2020, 31
[36] Human Pose Estimation With Deeply Learned Multi-Scale Compositional Models
Wang, Rui
Cao, Zhongzheng
Wang, Xiangyang
Liu, Zhi
Zhu, Xiaoqiang
IEEE ACCESS, 2019, 7 : 71158 - 71166
[37] SaMfENet: Self-Attention Based Multi-Scale Feature Fusion Coding and Edge Information Constraint Network for 6D Pose Estimation
Li, Zhuoxiao
Li, Xiaobing
Chen, Shihao
Du, Jialong
Li, Yong
MATHEMATICS, 2022, 10 (19)
[38] Multi-Scale Spatial Feature-Guided Cloth Landmark Estimation
Xie Z.
Zhou Z.
Wang Z.
Ding H.
Ma L.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (11): : 1763 - 1771
[39] Human pose estimation in complex background videos via Transformer-based multi-scale feature integration
Cheng, Chen
Xu, Huahu
DISPLAYS, 2024, 84
[40] Feature subset selection for multi-scale neighborhood decision information system via mutual information
Lujing Zhang
Guoping Lin
Ling Wei
Yi Kou
Artificial Intelligence Review, 2024, 57

← 1 2 3 4 5 →