HPAR: deep neural network-based approach for human pose-activity recognition

被引:0
作者
Bouzidi, Wijden [1 ,2 ]
Bouaafia, Soulef [1 ,3 ]
Hajjaji, Mohamed Ali [1 ,4 ]
Saidani, Taoufik [5 ]
机构
[1] Univ Monastir, Elect & Microelect Lab, LR99ES30, Fac Sci, Monastir, Tunisia
[2] Univ Monastir, Natl Engn Sch Monastir, Monastir, Tunisia
[3] Univ Kairouan, Higher Inst Appl Sci & Technol Kairouan, Kairouan, Tunisia
[4] Univ Sousse, Higher Inst Appl Sci & Technol, Sousse, Tunisia
[5] Northern Border Univ, Fac Comp & Informat Technol, Dept Comp Sci, Rafha, Saudi Arabia
关键词
deep learning; 3D human pose-activity recognition; CNN model; motion reconstruction; GPU-implementation;
D O I
10.1117/1.JEI.32.3.033017
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human pose activity recognition (HPAR) offers a wide range of applications due to the widespread use of collection devices such as smartphones and video cameras, as well as its capacity to gather human activity data. Electronic devices and applications continue to evolve, and breakthroughs in artificial intelligence (AI) have transformed the capacity to extract deeply buried information for accurate recognition and interpretation. We propose a systematic design for integrating conventional networks and constraints into the attention framework for learning long-range dependencies, thereby achieving end-to-end pose estimation with flexibility and scalability. The proposed method modifies the temporal receptive field using a multi-scale structure of dilated convolutions and can be adapted to a causal model for real-time performance. Our approach achieves state-of-the-art performance on the task of three-dimensional HPAR and outperforms previous methods while maintaining a lower complexity cost. (C) 2023 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 63 条
[1]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]   Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].
Bogo, Federica ;
Kanazawa, Angjoo ;
Lassner, Christoph ;
Gehler, Peter ;
Romero, Javier ;
Black, Michael J. .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578
[3]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[4]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[5]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[6]   Automatic and Efficient Human Pose Estimation for Sign Language Videos [J].
Charles, James ;
Pfister, Tomas ;
Everingham, Mark ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (01) :70-90
[7]   Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition [J].
Chen, Tianlang ;
Fang, Chen ;
Shen, Xiaohui ;
Zhu, Yiheng ;
Chen, Zhili ;
Luo, Jiebo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :198-209
[8]   Cascaded Pyramid Network for Multi-Person Pose Estimation [J].
Chen, Yilun ;
Wang, Zhicheng ;
Peng, Yuxiang ;
Zhang, Zhiqiang ;
Yu, Gang ;
Sun, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112
[9]   Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation [J].
Chen, Yu ;
Shen, Chunhua ;
Wei, Xiu-Shen ;
Liu, Lingqiao ;
Yang, Jian .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1221-1230
[10]   Occlusion-Aware Networks for 3D Human Pose Estimation in Video [J].
Cheng, Yu ;
Yang, Bo ;
Wang, Bo ;
Yan, Wending ;
Tan, Robby T. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :723-732