SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation

被引:0
|
作者
An, Xiaoqi [1 ,2 ]
Zhao, Lin [1 ,2 ]
Gong, Chen [1 ]
Wang, Nannan [2 ]
Wang, Di [2 ]
Yang, Jian [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, PCA Lab,Key Lab Intelligent Percept & Syst High D, Nanjing, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-resolution representation is essential for achieving good performance in human pose estimation models. To obtain such features, existing works utilize high-resolution input images or fine-grained image tokens. However, this dense high-resolution representation brings a significant computational burden. In this paper, we address the following question: "Only sparse human keypoint locations are detected for human pose estimation, is it really necessary to describe the whole image in a dense, high-resolution manner?" Based on dynamic transformer models, we propose a framework that only uses Sparse High-resolution Representations for human Pose estimation (SHaRPose). In detail, SHaRPose consists of two stages. At the coarse stage, the relations between image regions and keypoints are dynamically mined while a coarse estimation is generated. Then, a quality predictor is applied to decide whether the coarse estimation results should be refined. At the fine stage, SHaRPose builds sparse high-resolution representations only on the regions related to the keypoints and provides refined high-precision human pose estimations. Extensive experiments demonstrate the outstanding performance of the proposed method. Specifically, compared to the state-of-the-art method ViTPose, our model SHaRPose-Base achieves 77.4 AP (+0.5 AP) on the COCO validation set and 76.7 AP (+0.5 AP) on the COCO test-dev set, and infers at a speed of 1.4x faster than ViTPose-Base. Code is available at https://github.com/AnxQ/sharpose.
引用
收藏
页码:691 / 699
页数:9
相关论文
共 50 条
  • [31] EDite-HRNet: Enhanced Dynamic Lightweight High-Resolution Network for Human Pose Estimation
    Rui, Liyuheng
    Gao, Yanyan
    Ren, Haopan
    IEEE ACCESS, 2023, 11 : 95948 - 95957
  • [32] Sparse Representation and Convolutional Neural Networks for 3D Human Pose Estimation
    Alikarami, Hassan
    Yaghmaee, Farzin
    Fadaeieslam, Mohammad Javad
    2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 188 - 192
  • [33] High-Resolution Radar Imaging of Space Debris Based on Sparse Representation
    Zhu, Jiang
    Zhu, Shengqi
    Liao, Guisheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (10) : 2090 - 2094
  • [34] High-resolution time delay estimation via sparse parameter estimation methods
    Park, Hyung-Rae
    Li, Jian
    IET SIGNAL PROCESSING, 2020, 14 (02) : 97 - 105
  • [35] High-Resolution Ocean Clutter Spectrum Estimation for Shipborne HFSWR Using Sparse-Representation-Based MUSIC
    Xie, Junhao
    Wang, Zhongbao
    Ji, Zhenyuan
    Quan, Taifan
    IEEE JOURNAL OF OCEANIC ENGINEERING, 2015, 40 (03) : 546 - 557
  • [36] An improved lightweight high-resolution network based on multi-dimensional weighting for human pose estimation
    Zhang, Lei
    Zheng, Jia-Chun
    Zhao, Shi-Jia
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [37] HR-xNet: A Novel High-Resolution Network for Human Pose Estimation with Low Resource Consumption
    Feng, Cun
    Zhang, Rong
    Guo, Lijun
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [38] An improved lightweight high-resolution network based on multi-dimensional weighting for human pose estimation
    Lei Zhang
    Jia-Chun Zheng
    Shi-Jia Zhao
    Scientific Reports, 13
  • [39] WideHRNet: An Efficient Model for Human Pose Estimation Using Wide Channels in Lightweight High-Resolution Network
    Samkari, Esraa
    Arif, Muhammad
    AlGhamdi, Manal
    Al Ghamdi, Mohammed A.
    IEEE ACCESS, 2024, 12 : 148990 - 149000
  • [40] Lightweight high-resolution network based on adaptive cross-dimensional weighting for human pose estimation
    Wang, Fengqin
    Chen, Hongyang
    Li, Zuhe
    Wang, Yanjun
    Tian, Erlin
    Ju, Fujiao
    Bu, Xiangzhou
    Chen, Hui
    Wang, Junmin
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)