Lightweight Multiperson Pose Estimation With Staggered Alignment Self-Distillation

被引：0

作者：

Fan, Zhenkun ^{[1
,2
,3
]}

Huang, Zhuoxu ^{[4
,5
,6
]}

Chen, Zhixiang ^{[7
]}

Xu, Tao ^{[2
]}

Han, Jungong ^{[7
]}

Kittler, Josef ^{[8
]}

机构：

[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Wales

[2] Design & Res Inst Co Ltd, Shanghai Invest, Shanghai 200434, Peoples R China

[3] AMATUS Technol Ltd, Guildford GU3 3AW, England

[4] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Wales

[5] Zhejiang Future Technol Inst, Hangzhou 310006, Peoples R China

[6] Taizhou Baite Technol Ltd, Taizhou 318000, Peoples R China

[7] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, England

[8] Surrey Univ, Dept Elect Engn, Guildford GU2 7XH, England

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Pose estimation; Training; Image resolution; Computational modeling; Skeleton; Heating systems; Task analysis; 2D pose estimation; lightweight neural networks;

D O I：

10.1109/TMM.2024.3387754

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Accurate 2D human pose estimation from images is vital for understanding human actions. However, deploying the latest models, e.g., regression-based models, on resource-limited devices remains challenging due to their high computational requirements. In this paper, we address the resolution dilemma in regression-based multiperson pose estimation, where low-resolution inputs cause performance degradation, while high-resolution inputs drastically increase computational costs. To achieve a lightweight regression approach, it becomes crucial to enhance the model's capabilities in low-resolution scenarios. We propose the staggered alignment self-distillation (SASD) method and a corresponding network architecture. Our approach involves training two twin networks with shared weights: a high-resolution network and a low-resolution network. The high-resolution network serves as a teacher, guiding the learning process of the low-resolution network through feature map staggered alignment. The knowledge from the high-resolution network enhances the performance of the low-resolution network during low-resolution inference. Additionally, we employ a normalized skeleton loss to capture the loss of bone-related structure during training. Through extensive experiments on the MS-COCO and CrowdPose datasets, we demonstrate the superiority of our proposed method over state-of-the-art, lightweight multiperson pose estimation techniques, achieving much better performance with lower computational costs. Furthermore, our method achieves comparable performance to recent advanced regression-based pose estimation methods but with only 1/4 of the computational cost.

引用

页码：9228 / 9240

页数：13

共 50 条

[1] A Self-distillation Lightweight Image Classification Network Scheme
Ni S.
Ma X.
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (06): : 66 - 71
[2] Heterogeneous heatmap distillation framework based on unbiased alignment for lightweight human pose estimation
Du, Congju
Li, Zhenyu
Zhao, Huijuan
He, Shuangjiang
Yu, Li
IMAGE AND VISION COMPUTING, 2024, 146
[3] Reverse Self-Distillation Overcoming the Self-Distillation Barrier
Ni, Shuiping
Ma, Xinliang
Zhu, Mingfu
Li, Xingwang
Zhang, Yu-Dong
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2023, 4 : 195 - 205
[4] Lightweight Human Pose Estimation Based on Densely Guided Self-Knowledge Distillation
Wu, Mingyue
Zhao, Zhong-Qiu
Li, Jiajun
Tian, Weidong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 421 - 433
[5] Human Pose Estimation via an Ultra-Lightweight Pose Distillation Network
Zhang, Shihao
Qiang, Baohua
Yang, Xianyi
Wei, Xuekai
Chen, Ruidong
Chen, Lirui
ELECTRONICS, 2023, 12 (12)
[6] Monocular Depth Estimation via Self-Supervised Self-Distillation
Hu, Haifeng
Feng, Yuyang
Li, Dapeng
Zhang, Suofei
Zhao, Haitao
SENSORS, 2024, 24 (13)
[7] Self-distillation framework for indoor and outdoor monocular depth estimation
Meng Pan
Huanrong Zhang
Jiahao Wu
Zhi Jin
Multimedia Tools and Applications, 2022, 81 : 35899 - 35913
[8] Self-distillation framework for indoor and outdoor monocular depth estimation
Pan, Meng
Zhang, Huanrong
Wu, Jiahao
Jin, Zhi
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 35899 - 35913
[9] Efficient Pose Estimation via a Lightweight Single-Branch Pose Distillation Network
Zhang, Shihao
Qiang, Baohua
Yang, Xianyi
Zhou, Mingliang
Chen, Ruidong
Chen, Lirui
IEEE SENSORS JOURNAL, 2023, 23 (22) : 27709 - 27719
[10] A Lightweight Graph Neural Network Algorithm for Action Recognition Based on Self-Distillation
Feng, Miao
Meunier, Jean
ALGORITHMS, 2023, 16 (12)

← 1 2 3 4 5 →