Human pose estimation with gated multi-scale feature fusion and spatial mutual information

被引:0
|
作者
Xiaoming Zhao
Chenchen Guo
Qiang Zou
机构
[1] Tianjin University,School of Microelectronics
[2] Tianjin International Joint Research Center for Internet of Things,Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology
[3] Tianjin University,undefined
来源
The Visual Computer | 2023年 / 39卷
关键词
Human pose estimation; Gate; Multi-scale feature fusion; Noisy information; Spatial mutual information;
D O I
暂无
中图分类号
学科分类号
摘要
Although human pose estimation has achieved great success, the ambiguity of joint prediction has not been well resolved, especially in complex situations (crowded scenes, occlusions, and unnormal poses). We think that is caused by the noisy information introduced by combining multi-level features by simply adding features at each position. To alleviate this problem, we propose a new structure of gated multi-scale feature fusion (GMSFF). This module aims to selectively import high-level features to make up for the missing semantic information of low-resolution feature maps. Inspired by the prior knowledge that the position information of joints can refer to each other, we propose a new fine-tuning strategy for pose estimation—spatial mutual information complementary module (SMICM). It can assist the model in better adjusting the current joint’s position by capturing the information contained in other joints and only adds a little computational cost. We evaluated our proposed method on four datasets: MPII Human Pose Dataset (MPII), COCO keypoint detection Dataset (COCO), Occluded Human Dataset (OCHuman), and CrowdPose Dataset. The experimental results show that with the deepening of the occlusion and crowding level of the datasets, the improvement becomes more and more obvious. In particular, a performance improvement of 2.2 AP was obtained on the OCHuman dataset. In addition, our modules are plug-and-play.
引用
收藏
页码:119 / 137
页数:18
相关论文
共 50 条
  • [41] Feature subset selection for multi-scale neighborhood decision information system via mutual information
    Zhang, Lujing
    Lin, Guoping
    Wei, Ling
    Kou, Yi
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (01)
  • [42] Multi-scale Attention Aided Multi-Resolution Network for Human Pose Estimation
    Selvam, Srinika
    Mishra, Deepak
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 461 - 472
  • [43] LEARNED IMAGE COMPRESSION WITH MULTI-SCALE SPATIAL AND CONTEXTUAL INFORMATION FUSION
    Liu, Ziyi
    Wang, Hanli
    Su, Taiyi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 706 - 710
  • [44] Multi-scale parallel gated local feature transformer
    Qu, Hangzhou
    Hu, Zhuhua
    Wu, Jiaqi
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [45] A multi-scale feature fusion spatial–channel attention model for background subtraction
    Yizhong Yang
    Tingting Xia
    Dajin Li
    Zhang Zhang
    Guangjun Xie
    Multimedia Systems, 2023, 29 : 3609 - 3623
  • [46] Multi-scale and Self-mutual Feature Distillation
    Qiao, Nianzu
    Sun, Jia
    Dong, Lu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 438 - 448
  • [47] A multi-scale feature extraction fusion model for human activity recognition
    Zhang, Chuanlin
    Cao, Kai
    Lu, Limeng
    Deng, Tao
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [48] Multi-scale Dynamic Human Fatigue Detection with Feature Level Fusion
    Fan, Xiao
    Sun, Yanfeng
    Yin, Baocai
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 134 - 139
  • [49] MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
    Zhou, Liming
    Zhao, Shuai
    Wan, Ziye
    Liu, Yang
    Wang, Yadi
    Zuo, Xianyu
    DRONES, 2024, 8 (05)
  • [50] Robust human gesture recognition by leveraging multi-scale feature fusion
    Deng, Minwei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 83