SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation

被引:0
|
作者
An, Xiaoqi [1 ,2 ]
Zhao, Lin [1 ,2 ]
Gong, Chen [1 ]
Wang, Nannan [2 ]
Wang, Di [2 ]
Yang, Jian [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, PCA Lab,Key Lab Intelligent Percept & Syst High D, Nanjing, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-resolution representation is essential for achieving good performance in human pose estimation models. To obtain such features, existing works utilize high-resolution input images or fine-grained image tokens. However, this dense high-resolution representation brings a significant computational burden. In this paper, we address the following question: "Only sparse human keypoint locations are detected for human pose estimation, is it really necessary to describe the whole image in a dense, high-resolution manner?" Based on dynamic transformer models, we propose a framework that only uses Sparse High-resolution Representations for human Pose estimation (SHaRPose). In detail, SHaRPose consists of two stages. At the coarse stage, the relations between image regions and keypoints are dynamically mined while a coarse estimation is generated. Then, a quality predictor is applied to decide whether the coarse estimation results should be refined. At the fine stage, SHaRPose builds sparse high-resolution representations only on the regions related to the keypoints and provides refined high-precision human pose estimations. Extensive experiments demonstrate the outstanding performance of the proposed method. Specifically, compared to the state-of-the-art method ViTPose, our model SHaRPose-Base achieves 77.4 AP (+0.5 AP) on the COCO validation set and 76.7 AP (+0.5 AP) on the COCO test-dev set, and infers at a speed of 1.4x faster than ViTPose-Base. Code is available at https://github.com/AnxQ/sharpose.
引用
收藏
页码:691 / 699
页数:9
相关论文
共 50 条
  • [21] A NEW SPARSE REPRESENTATION ALGORITHM FOR 3D HUMAN POSE ESTIMATION
    Andalib, Azam
    Babamir, Seyed Morteza
    Faraji, Alireza
    COMPUTING AND INFORMATICS, 2016, 35 (06) : 1338 - 1355
  • [22] HRNeXt: High-Resolution Context Network for Crowd Pose Estimation
    Li, Qun
    Zhang, Ziyi
    Zhang, Feng
    Xiao, Fu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1521 - 1528
  • [23] Simple Multi-Resolution Representation Learning for Human Pose Estimation
    Tran, Trung Q.
    Nguyen, Giang, V
    Kim, Daeyoung
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 511 - 518
  • [24] A residual semantic graph convolutional network with high-resolution representation for 3D human pose estimation in a virtual fashion show
    Zhang P.
    Ding P.
    Li G.
    Zhang J.
    Multimedia Tools and Applications, 2024, 83 (29) : 73649 - 73669
  • [25] Learning high resolution reservation for human pose estimation
    Gao, Bingkun
    Ma, Ke
    Bi, Hongbo
    Wang, Ling
    Wu, Chenlei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29251 - 29265
  • [26] Learning high resolution reservation for human pose estimation
    Bingkun Gao
    Ke Ma
    Hongbo Bi
    Ling Wang
    Chenlei Wu
    Multimedia Tools and Applications, 2021, 80 : 29251 - 29265
  • [27] Facial Pose Estimation via Dense and Sparse Representation
    Yu, Hui
    Liu, Honghai
    2014 IEEE SYMPOSIUM ON ROBOTIC INTELLIGENCE IN INFORMATIONALLY STRUCTURED SPACE (RIISS), 2014, : 98 - 103
  • [28] MS-HRNet: multi-scale high-resolution network for human pose estimation
    Wang, Yanxia
    Wang, Renjie
    Shi, Hu
    Liu, Dan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17269 - 17291
  • [29] Research on Lightweight High-resolution Network Human Pose Estimation Based on Self-attention
    Liu, Guangyu
    Zhong, Xiaoling
    Ma, Lizhi
    2023 IEEE 8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, ICBDA, 2023, : 142 - 146
  • [30] Human Pose Estimation Based on Efficient and Lightweight High-Resolution Network (EL-HRNet)
    Li, Rui
    Yan, An
    Yang, Shiqiang
    He, Duo
    Zeng, Xin
    Liu, Hongyan
    SENSORS, 2024, 24 (02)