Lightweight Multiperson Pose Estimation With Staggered Alignment Self-Distillation

被引:0
|
作者
Fan, Zhenkun [1 ,2 ,3 ]
Huang, Zhuoxu [4 ,5 ,6 ]
Chen, Zhixiang [7 ]
Xu, Tao [2 ]
Han, Jungong [7 ]
Kittler, Josef [8 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Wales
[2] Design & Res Inst Co Ltd, Shanghai Invest, Shanghai 200434, Peoples R China
[3] AMATUS Technol Ltd, Guildford GU3 3AW, England
[4] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Wales
[5] Zhejiang Future Technol Inst, Hangzhou 310006, Peoples R China
[6] Taizhou Baite Technol Ltd, Taizhou 318000, Peoples R China
[7] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, England
[8] Surrey Univ, Dept Elect Engn, Guildford GU2 7XH, England
关键词
Pose estimation; Training; Image resolution; Computational modeling; Skeleton; Heating systems; Task analysis; 2D pose estimation; lightweight neural networks;
D O I
10.1109/TMM.2024.3387754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate 2D human pose estimation from images is vital for understanding human actions. However, deploying the latest models, e.g., regression-based models, on resource-limited devices remains challenging due to their high computational requirements. In this paper, we address the resolution dilemma in regression-based multiperson pose estimation, where low-resolution inputs cause performance degradation, while high-resolution inputs drastically increase computational costs. To achieve a lightweight regression approach, it becomes crucial to enhance the model's capabilities in low-resolution scenarios. We propose the staggered alignment self-distillation (SASD) method and a corresponding network architecture. Our approach involves training two twin networks with shared weights: a high-resolution network and a low-resolution network. The high-resolution network serves as a teacher, guiding the learning process of the low-resolution network through feature map staggered alignment. The knowledge from the high-resolution network enhances the performance of the low-resolution network during low-resolution inference. Additionally, we employ a normalized skeleton loss to capture the loss of bone-related structure during training. Through extensive experiments on the MS-COCO and CrowdPose datasets, we demonstrate the superiority of our proposed method over state-of-the-art, lightweight multiperson pose estimation techniques, achieving much better performance with lower computational costs. Furthermore, our method achieves comparable performance to recent advanced regression-based pose estimation methods but with only 1/4 of the computational cost.
引用
收藏
页码:9228 / 9240
页数:13
相关论文
共 50 条
  • [1] A Self-distillation Lightweight Image Classification Network Scheme
    Ni S.
    Ma X.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (06): : 66 - 71
  • [2] Heterogeneous heatmap distillation framework based on unbiased alignment for lightweight human pose estimation
    Du, Congju
    Li, Zhenyu
    Zhao, Huijuan
    He, Shuangjiang
    Yu, Li
    IMAGE AND VISION COMPUTING, 2024, 146
  • [3] Reverse Self-Distillation Overcoming the Self-Distillation Barrier
    Ni, Shuiping
    Ma, Xinliang
    Zhu, Mingfu
    Li, Xingwang
    Zhang, Yu-Dong
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2023, 4 : 195 - 205
  • [4] Lightweight Human Pose Estimation Based on Densely Guided Self-Knowledge Distillation
    Wu, Mingyue
    Zhao, Zhong-Qiu
    Li, Jiajun
    Tian, Weidong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 421 - 433
  • [5] Human Pose Estimation via an Ultra-Lightweight Pose Distillation Network
    Zhang, Shihao
    Qiang, Baohua
    Yang, Xianyi
    Wei, Xuekai
    Chen, Ruidong
    Chen, Lirui
    ELECTRONICS, 2023, 12 (12)
  • [6] Monocular Depth Estimation via Self-Supervised Self-Distillation
    Hu, Haifeng
    Feng, Yuyang
    Li, Dapeng
    Zhang, Suofei
    Zhao, Haitao
    SENSORS, 2024, 24 (13)
  • [7] Self-distillation framework for indoor and outdoor monocular depth estimation
    Meng Pan
    Huanrong Zhang
    Jiahao Wu
    Zhi Jin
    Multimedia Tools and Applications, 2022, 81 : 35899 - 35913
  • [8] Self-distillation framework for indoor and outdoor monocular depth estimation
    Pan, Meng
    Zhang, Huanrong
    Wu, Jiahao
    Jin, Zhi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 35899 - 35913
  • [9] Efficient Pose Estimation via a Lightweight Single-Branch Pose Distillation Network
    Zhang, Shihao
    Qiang, Baohua
    Yang, Xianyi
    Zhou, Mingliang
    Chen, Ruidong
    Chen, Lirui
    IEEE SENSORS JOURNAL, 2023, 23 (22) : 27709 - 27719
  • [10] A Lightweight Graph Neural Network Algorithm for Action Recognition Based on Self-Distillation
    Feng, Miao
    Meunier, Jean
    ALGORITHMS, 2023, 16 (12)