SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation

被引:0
|
作者
An, Xiaoqi [1 ,2 ]
Zhao, Lin [1 ,2 ]
Gong, Chen [1 ]
Wang, Nannan [2 ]
Wang, Di [2 ]
Yang, Jian [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, PCA Lab,Key Lab Intelligent Percept & Syst High D, Nanjing, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-resolution representation is essential for achieving good performance in human pose estimation models. To obtain such features, existing works utilize high-resolution input images or fine-grained image tokens. However, this dense high-resolution representation brings a significant computational burden. In this paper, we address the following question: "Only sparse human keypoint locations are detected for human pose estimation, is it really necessary to describe the whole image in a dense, high-resolution manner?" Based on dynamic transformer models, we propose a framework that only uses Sparse High-resolution Representations for human Pose estimation (SHaRPose). In detail, SHaRPose consists of two stages. At the coarse stage, the relations between image regions and keypoints are dynamically mined while a coarse estimation is generated. Then, a quality predictor is applied to decide whether the coarse estimation results should be refined. At the fine stage, SHaRPose builds sparse high-resolution representations only on the regions related to the keypoints and provides refined high-precision human pose estimations. Extensive experiments demonstrate the outstanding performance of the proposed method. Specifically, compared to the state-of-the-art method ViTPose, our model SHaRPose-Base achieves 77.4 AP (+0.5 AP) on the COCO validation set and 76.7 AP (+0.5 AP) on the COCO test-dev set, and infers at a speed of 1.4x faster than ViTPose-Base. Code is available at https://github.com/AnxQ/sharpose.
引用
收藏
页码:691 / 699
页数:9
相关论文
共 50 条
  • [1] Deep High-Resolution Representation Learning for Human Pose Estimation
    Sun, Ke
    Xiao, Bin
    Liu, Dong
    Wang, Jingdong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5686 - 5696
  • [2] Efficient High-Resolution Human Pose Estimation
    Qin, Xiaofei
    Qiu, Lingfeng
    He, Changxiang
    Zhang, Xuedian
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 383 - 396
  • [3] Efficient High-Resolution High-Level-Semantic Representation Learning for Human Pose Estimation
    Liu, Hong
    Guan, Lisi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7862 - 7867
  • [4] Lightweight and High-Resolution Human Pose Estimation Method
    Qu Hanbing
    Jia Zhentang
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [5] Human Pose Estimation Based Pre-training Model and Efficient High-Resolution Representation
    Wen, Jinchen
    Chi, Jianning
    Wu, Chengdong
    Yu, Xiaosheng
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8463 - 8468
  • [6] High-resolution DOA estimation using nonredundant cumulants sparse representation
    Liu, Qinghua
    Zhou, Xiuqing
    Ouyang, Shan
    Journal of Computational Information Systems, 2015, 11 (17): : 6319 - 6325
  • [7] Lightweight and Efficient High-Resolution Network for Human Pose Estimation
    Liu, Jiarui
    Gong, Xiugang
    Guo, Qun
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 232 - 240
  • [8] FastNet: Fast high-resolution network for human pose estimation
    Luo, Yanmin
    Ou, Zhilong
    Wan, Tianjun
    Guo, Jing-Ming
    IMAGE AND VISION COMPUTING, 2022, 119
  • [9] High-Resolution with Global Context Network for Human Pose Estimation
    Wang, Kehao
    Li, Chenglin
    Ren, Ruiqi
    2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA, 2022, : 621 - 626
  • [10] High-resolution Human Pose Estimation Method Based on Efficient Convolution
    Du, Hai-Xia
    Ma, Hong-Bin
    Fan, Zheng
    Journal of Network Intelligence, 2022, 7 (04): : 909 - 920