Lightweight human pose estimation algorithm based on polarized self-attention

被引：5

作者：

Liu, Shengjie ^{[1
]}

He, Ning ^{[2
]}

Wang, Cheng ^{[1
]}

Yu, Haigang ^{[1
]}

Han, Wenjing ^{[2
]}

机构：

[1] Beijing Union Univ, Coll Robot, Beijing Key Lab Informat Serv Engn, Beijing, Peoples R China

[2] Beijing Union Univ, Coll Smart City, Beijing, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2023年 / 29卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Human pose estimation; Polarized self-attention; Ghost module; Coordinate decoding; NETWORK;

D O I：

10.1007/s00530-022-00981-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, human pose estimation has been widely used in human-computer interaction, augmented reality, video surveillance, and many other fields, but the task of pose estimation still faces many challenges. To address the large number of parameters and complicated calculation in the current mainstream human pose estimation network, this paper proposes a lightweight pose estimation network (Lightweight Polarized Network, referred to as LPNet) based on a polarized self-attention mechanism. First, ghost convolution is used to reduce the number of parameters of the feature extraction network; second, by introducing the polarized self-attention module, the pixel-level regression task can be better solved, the lack of extracted features due to the decrease in the number of parameters can be reduced, and the accuracy of the regression of human keypoints can be improved; finally, a new coordinate decoding method is designed to reduce the error in the heatmap decoding process and improve the accuracy of keypoint regression. The method proposed in this paper was evaluated on the human keypoint detection datasets COCO and MPII, and compared with the current mainstream methods. The experimental results show that the proposed method greatly reduces the number of parameters of the model while ensuring a small loss in accuracy.

引用

页码：197 / 210

页数：14

共 31 条

[1]

[Anonymous], 2018, P EUR C COMP VIS ECC

[2] GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond [J].

Cao, Yue ;

Xu, Jiarui ;

Lin, Stephen ;

Wei, Fangyun ;

Hu, Han .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1971-1980

[3] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].

Cao, Zhe ;

Simon, Tomas ;

Wei, Shih-En ;

Sheikh, Yaser .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310

[4]

Chen Y., 2018, Adv. Neural Inf. Process. Syst, V31

[5] Cascaded Pyramid Network for Multi-Person Pose Estimation [J].

Chen, Yilun ;

Wang, Zhicheng ;

Peng, Yuxiang ;

Zhang, Zhiqiang ;

Yu, Gang ;

Sun, Jian .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112

[6] HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation [J].

Cheng, Bowen ;

Xiao, Bin ;

Wang, Jingdong ;

Shi, Honghui ;

Huang, Thomas S. ;

Zhang, Lei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5385-5394

[7] RMPE: Regional Multi-Person Pose Estimation [J].

Fang, Hao-Shu ;

Xie, Shuqin ;

Tai, Yu-Wing ;

Lu, Cewu .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2353-2362

[8]

Howard AG, 2017, Arxiv, DOI arXiv:1704.04861

[9] Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression [J].

Geng, Zigang ;

Sun, Ke ;

Xiao, Bin ;

Zhang, Zhaoxiang ;

Wang, Jingdong .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14671-14681

[10] GhostNet: More Features from Cheap Operations [J].

Han, Kai ;

Wang, Yunhe ;

Tian, Qi ;

Guo, Jianyuan ;

Xu, Chunjing ;

Xu, Chang .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586

← 1 2 3 4 →