Human pose estimation with gated multi-scale feature fusion and spatial mutual information

被引:0
|
作者
Xiaoming Zhao
Chenchen Guo
Qiang Zou
机构
[1] Tianjin University,School of Microelectronics
[2] Tianjin International Joint Research Center for Internet of Things,Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology
[3] Tianjin University,undefined
来源
The Visual Computer | 2023年 / 39卷
关键词
Human pose estimation; Gate; Multi-scale feature fusion; Noisy information; Spatial mutual information;
D O I
暂无
中图分类号
学科分类号
摘要
Although human pose estimation has achieved great success, the ambiguity of joint prediction has not been well resolved, especially in complex situations (crowded scenes, occlusions, and unnormal poses). We think that is caused by the noisy information introduced by combining multi-level features by simply adding features at each position. To alleviate this problem, we propose a new structure of gated multi-scale feature fusion (GMSFF). This module aims to selectively import high-level features to make up for the missing semantic information of low-resolution feature maps. Inspired by the prior knowledge that the position information of joints can refer to each other, we propose a new fine-tuning strategy for pose estimation—spatial mutual information complementary module (SMICM). It can assist the model in better adjusting the current joint’s position by capturing the information contained in other joints and only adds a little computational cost. We evaluated our proposed method on four datasets: MPII Human Pose Dataset (MPII), COCO keypoint detection Dataset (COCO), Occluded Human Dataset (OCHuman), and CrowdPose Dataset. The experimental results show that with the deepening of the occlusion and crowding level of the datasets, the improvement becomes more and more obvious. In particular, a performance improvement of 2.2 AP was obtained on the OCHuman dataset. In addition, our modules are plug-and-play.
引用
收藏
页码:119 / 137
页数:18
相关论文
共 50 条
  • [31] Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention
    Li, Xin
    Guo, Yuxin
    Pan, Weiguo
    Liu, Hongzhe
    Xu, Bingxin
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [32] Hand pose estimation with multi-scale network
    Hu, Zhongxu
    Hu, Youmin
    Wu, Bo
    Liu, Jie
    Han, Dongmin
    Kurfess, Thomas
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2501 - 2515
  • [33] Multi-Scale Structure-Aware Network for Human Pose Estimation
    Ke, Lipeng
    Chang, Ming-Ching
    Qi, Honggang
    Lyu, Siwei
    COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 731 - 746
  • [34] Video anomaly detection with multi-scale feature and temporal information fusion
    Cai, Yiheng
    Liu, Jiaqi
    Guo, Yajun
    Hu, Shaobin
    Lang, Shinan
    NEUROCOMPUTING, 2021, 423 : 264 - 273
  • [35] Enhancing feature fusion for human pose estimation
    Rui Wang
    Jiangwei Tong
    Xiangyang Wang
    Machine Vision and Applications, 2020, 31
  • [36] Human Pose Estimation With Deeply Learned Multi-Scale Compositional Models
    Wang, Rui
    Cao, Zhongzheng
    Wang, Xiangyang
    Liu, Zhi
    Zhu, Xiaoqiang
    IEEE ACCESS, 2019, 7 : 71158 - 71166
  • [37] SaMfENet: Self-Attention Based Multi-Scale Feature Fusion Coding and Edge Information Constraint Network for 6D Pose Estimation
    Li, Zhuoxiao
    Li, Xiaobing
    Chen, Shihao
    Du, Jialong
    Li, Yong
    MATHEMATICS, 2022, 10 (19)
  • [38] Multi-Scale Spatial Feature-Guided Cloth Landmark Estimation
    Xie Z.
    Zhou Z.
    Wang Z.
    Ding H.
    Ma L.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (11): : 1763 - 1771
  • [39] Human pose estimation in complex background videos via Transformer-based multi-scale feature integration
    Cheng, Chen
    Xu, Huahu
    DISPLAYS, 2024, 84
  • [40] Feature subset selection for multi-scale neighborhood decision information system via mutual information
    Lujing Zhang
    Guoping Lin
    Ling Wei
    Yi Kou
    Artificial Intelligence Review, 2024, 57