DIR-BHRNet: A Lightweight Network for Real-Time Vision-Based Multiperson Pose Estimation on Smartphones

被引：0

作者：

Lan, Gongjin ^{[1
]}

Wu, Yu ^{[1
]}

Hao, Qi ^{[1
,2
]}

机构：

[1] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China

[2] Southern Univ Sci & Technol, Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2024年 / 20卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Deep learning; human pose estimation (HPE); multiperson pose estimation (MPPE); real time; smartphones;

D O I：

10.1109/TII.2024.3421511

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human pose estimation (HPE), particularly multiperson pose estimation (MPPE), has been applied in many domains, such as human-machine systems. However, the current MPPE methods generally run on powerful GPU systems and take a lot of computational costs. Real-time MPPE on mobile devices with low-performance computing is a challenging task. In this article, we propose a lightweight neural network, DIR-BHRNet, for real-time MPPE on smartphones. In DIR-BHRNet, we design a novel lightweight convolutional module, dense inverted residual (DIR), to improve accuracy by adding a depthwise convolution and a shortcut connection into the well-known inverted residual, and a novel efficient neural network structure, balanced HRNet (BHRNet), to reduce computational costs by reconfiguring the proper number of convolutional blocks on each branch. We evaluate DIR-BHRNet on the well-known COCO and CrowdPose datasets. The results show that DIR-BHRNet outperforms the state-of-the-art methods in terms of accuracy with a real-time computational cost. Finally, we implement the DIR-BHRNet on the current mainstream Android smartphones, which perform more than 10 FPS. The free-used executable file (Android 10), source code, and a video description of this work are publicly available on the page(1) to facilitate the development of real-time MPPE on smartphones.

引用

页码：12533 / 12541

页数：9

共 28 条

[1] Cai YX, 2021, AAAI CONF ARTIF INTE, V35, P955
[2] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
Cao, Zhe
Hidalgo, Gines
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
[3] Chen DY, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2463
[4] HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
Cheng, Bowen
Xiao, Bin
Wang, Jingdong
Shi, Honghui
Huang, Thomas S.
Zhang, Lei
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5385 - 5394
[5] AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time
Fang, Hao-Shu
Li, Jiefeng
Tang, Hongyang
Xu, Chao
Zhu, Haoyi
Xiu, Yuliang
Li, Yong-Lu
Lu, Cewu
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7157 - 7173
[6] Channel Pruning for Accelerating Very Deep Neural Networks
He, Yihui
Zhang, Xiangyu
Sun, Jian
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1398 - 1406
[7] DeepMon: Mobile GPU-based Deep Learning Framework for Continuous Vision Applications
Huynh, Loc N.
Lee, Youngki
Balan, Rajesh Krishna
[J]. MOBISYS'17: PROCEEDINGS OF THE 15TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS, AND SERVICES, 2017, : 82 - 95
[8] Jiang T., 2023, arXiv
[9] YOLO-Rlepose: Improved YOLO Based on Swin Transformer and Rle-Oks Loss for Multi-Person Pose Estimation
Jiang, Yi
Yang, Kexin
Zhu, Jinlin
Qin, Li
[J]. ELECTRONICS, 2024, 13 (03)
[10] VirtualComponent: A Mixed-Reality Tool for Designing and Tuning Breadboarded Circuits
Kim, Yoonji
Choi, Youngkyung
Lee, Hyein
Lee, Geehyuk
Bianchi, Andrea
[J]. CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,

← 1 2 3 →