MRSAPose: Multi-level routing sparse attention for multi-person pose estimation

被引:1
|
作者
Wu, Shang [1 ]
Wang, Bin [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
关键词
Multi-person pose estimation; Multi-level routing sparse attention; Transformer-CNN parallel interaction; Multi-level routing algorithm; High-order spatial interaction;
D O I
10.1016/j.eswa.2024.125100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person Pose Estimation (MPPE) aims to reconstruct human poses by locating and connecting keypoints of individuals in input images. The variability of human poses and the complexity of scenes make MPPE reliant on both local details and global structures, and the absence of either can lead to the generation of deformed poses. With the emergence of Transformer, the performance of MPPE has been significantly improved. However, due to self-attention computing attention scores between each pair of positions, the current Transformer-based MPPE exhibits high quadratic complexity. To address these issues, this paper proposes a novel pose estimation model, MRSAPose. MRSAPose utilizes Multi-level Routing Sparse Attention (MRSA) to dynamically select relevant regions for attention, reducing computational complexity and mitigating the impact of irrelevant regions. Furthermore, MRSAPose constructs a Transformer-CNN Parallel Interaction Block (T-CP block) through MRSA and Recursive Residual Gated Convolution (Res-gnConv), facilitating parallel learning of global and local information. By relying on multi-level routing algorithms and high-order spatial interactions conducted by recursive processing of adjacent features, T-CP block helps MRSAPose effectively alleviates the issues of occlusion and misalignment in pose estimation. On multiple challenging keypoint datasets, MRSAPose outperforms current state-of-the-art algorithms, particularly excelling in crowded and occluded scenes.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] MULTI-LEVEL NETWORK FOR HIGH-SPEED MULTI-PERSON POSE ESTIMATION
    Huang, Ying
    Zhuang, Jiankai
    Qin, Zengchang
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 589 - 593
  • [2] OCPAN: multi-person pose estimation with more attention on residual information
    Lin, Hsinchuan
    Zhu, Jiaying
    Hu, Haifeng
    ELECTRONICS LETTERS, 2020, 56 (12) : 602 - +
  • [3] Pose Knowledge Transfer for multi-person pose estimation
    Buwei Li
    Yi Ji
    Ying Li
    Yunlong Xu
    Chunping Liu
    Signal, Image and Video Processing, 2022, 16 : 321 - 328
  • [4] Pose Partition Networks for Multi-person Pose Estimation
    Nie, Xuecheng
    Feng, Jiashi
    Xing, Junliang
    Yan, Shuicheng
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 705 - 720
  • [5] Pose Knowledge Transfer for multi-person pose estimation
    Li, Buwei
    Ji, Yi
    Li, Ying
    Xu, Yunlong
    Liu, Chunping
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 321 - 328
  • [6] Monocular multi-person pose estimation: A survey
    dos Reis, Eduardo Souza
    Seewald, Lucas Adams
    Antunes, Rodolfo Stoffel
    Rodrigues, Vinicius Facco
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    da Silveira Jr, Luiz Gonzaga
    Eskofier, Bjoern
    Maier, Andreas
    Horz, Tim
    Fahrig, Rebecca
    PATTERN RECOGNITION, 2021, 118
  • [7] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [8] Multi-Person Pose Estimation on Embedded Device
    Ma, Zhipeng
    Tian, Dawei
    Zhang, Ming
    He, Dingxin
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 57 - 61
  • [9] The Overview of Multi-person Pose Estimation Method
    Li, Bingyi
    Zou, Jiaqi
    Wang, Luyao
    Li, Xiangyuan
    Li, Yue
    Lei, Rongjia
    Sun, Songlin
    SIGNAL AND INFORMATION PROCESSING, NETWORKING AND COMPUTERS (ICSINC), 2019, 550 : 600 - 607
  • [10] Multi-Domain Pose Network for Multi-Person Pose Estimation and Tracking
    Guo, Hengkai
    Tang, Tang
    Luo, Guozhong
    Chen, Riwei
    Lu, Yongchen
    Wen, Linfu
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 209 - 216