Human pose estimation with gated multi-scale feature fusion and spatial mutual information

被引：0

作者：

Xiaoming Zhao

Chenchen Guo

Qiang Zou

机构：

[1] Tianjin University,School of Microelectronics

[2] Tianjin International Joint Research Center for Internet of Things,Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology

[3] Tianjin University,undefined

来源：

The Visual Computer | 2023年 / 39卷

关键词：

Human pose estimation; Gate; Multi-scale feature fusion; Noisy information; Spatial mutual information;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Although human pose estimation has achieved great success, the ambiguity of joint prediction has not been well resolved, especially in complex situations (crowded scenes, occlusions, and unnormal poses). We think that is caused by the noisy information introduced by combining multi-level features by simply adding features at each position. To alleviate this problem, we propose a new structure of gated multi-scale feature fusion (GMSFF). This module aims to selectively import high-level features to make up for the missing semantic information of low-resolution feature maps. Inspired by the prior knowledge that the position information of joints can refer to each other, we propose a new fine-tuning strategy for pose estimation—spatial mutual information complementary module (SMICM). It can assist the model in better adjusting the current joint’s position by capturing the information contained in other joints and only adds a little computational cost. We evaluated our proposed method on four datasets: MPII Human Pose Dataset (MPII), COCO keypoint detection Dataset (COCO), Occluded Human Dataset (OCHuman), and CrowdPose Dataset. The experimental results show that with the deepening of the occlusion and crowding level of the datasets, the improvement becomes more and more obvious. In particular, a performance improvement of 2.2 AP was obtained on the OCHuman dataset. In addition, our modules are plug-and-play.

引用

页码：119 / 137

页数：18

共 50 条

[41] Feature subset selection for multi-scale neighborhood decision information system via mutual information
Zhang, Lujing
Lin, Guoping
Wei, Ling
Kou, Yi
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (01)
[42] Multi-scale Attention Aided Multi-Resolution Network for Human Pose Estimation
Selvam, Srinika
Mishra, Deepak
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 461 - 472
[43] LEARNED IMAGE COMPRESSION WITH MULTI-SCALE SPATIAL AND CONTEXTUAL INFORMATION FUSION
Liu, Ziyi
Wang, Hanli
Su, Taiyi
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 706 - 710
[44] Multi-scale parallel gated local feature transformer
Qu, Hangzhou
Hu, Zhuhua
Wu, Jiaqi
SCIENTIFIC REPORTS, 2025, 15 (01):
[45] A multi-scale feature fusion spatial–channel attention model for background subtraction
Yizhong Yang
Tingting Xia
Dajin Li
Zhang Zhang
Guangjun Xie
Multimedia Systems, 2023, 29 : 3609 - 3623
[46] Multi-scale and Self-mutual Feature Distillation
Qiao, Nianzu
Sun, Jia
Dong, Lu
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 438 - 448
[47] A multi-scale feature extraction fusion model for human activity recognition
Zhang, Chuanlin
Cao, Kai
Lu, Limeng
Deng, Tao
SCIENTIFIC REPORTS, 2022, 12 (01)
[48] Multi-scale Dynamic Human Fatigue Detection with Feature Level Fusion
Fan, Xiao
Sun, Yanfeng
Yin, Baocai
2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 134 - 139
[49] MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
Zhou, Liming
Zhao, Shuai
Wan, Ziye
Liu, Yang
Wang, Yadi
Zuo, Xianyu
DRONES, 2024, 8 (05)
[50] Robust human gesture recognition by leveraging multi-scale feature fusion
Deng, Minwei
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 83

← 1 2 3 4 5 →