3D PostureNet: A unified framework for skeleton-based posture recognition

被引:24
作者
Liu, Jianbo [1 ,2 ]
Wang, Ying [1 ]
Liu, Yongcheng [1 ,2 ]
Xiang, Shiming [1 ]
Pan, Chunhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Human posture recognition; Static hand gesture recognition; Skeleton-based; 3D convolutional neural network; SYSTEM;
D O I
10.1016/j.patrec.2020.09.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-based posture recognition is a very challenging problem as it is difficult to acquire rich 3D information from postures in 2D images. Existing methods founded on 3D skeleton cues could alleviate this issue, but they are not particularly efficient due to the application of handcrafted features and traditional classifiers. This paper presents a novel and unified framework for skeleton-based posture recognition, applying powerful 3D Convolutional Neural Network (CNN) to this issue. Technically, bounding-box-based normalization for the raw skeleton data is proposed to eliminate the coordinate differences caused by diverse recording environments and posture displacements. Moreover, Gaussian voxelization for the skeleton is employed to expressively represent the posture configuration. Thereby, an end-to-end framework based on 3D CNN, called 3D PostureNet, is developed for robust posture recognition. To verify its effectiveness, a large-scale writing posture dataset is created and released in this work, including 113,400 samples of 30 subjects with 15 postures. Extensive experiments on the public MSRA hand gesture dataset, body pose dataset and the proposed writing posture dataset demonstrate that 3D PostureNet achieves significantly superior performance on both skeleton-based human posture and hand posture recognition tasks. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:143 / 149
页数:7
相关论文
共 50 条
[41]   A 3D model recognition mechanism based on deep Boltzmann machines [J].
Leng, Biao ;
Zhang, Xiangyang ;
Yao, Ming ;
Xiong, Zhang .
NEUROCOMPUTING, 2015, 151 :593-602
[42]   3D curve weld seam path and posture planning based on line laser sensors [J].
Wang, Hui ;
Rong, Youmin ;
Xiang, Songming ;
Xu, Jiajun ;
Peng, Yifan ;
Huang, Yu .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 94
[43]   Mini-TKAGCN: a lightweight Graph Convolutional Network via Temporal Kernel Attention for Skeleton-based Action Recognition [J].
Liu, Yanan ;
Dong, Shiqi ;
Zhang, Hao ;
Xu, Dan ;
Li, Haipeng .
THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
[44]   3D Printing of Skeleton Muscle Tissue Engineering Scaffolds [J].
Song, Ju Qing ;
Ye, Xin Liang ;
Chen, Wen Cong ;
Wang, Li ;
Lu, Bing Heng .
NANO LIFE, 2021, 11 (04)
[45]   Vision-Based Automated Recognition and 3D Localization Framework for Tower Cranes Using Far-Field Cameras [J].
Wang, Jiyao ;
Zhang, Qilin ;
Yang, Bin ;
Zhang, Binghan .
SENSORS, 2023, 23 (10)
[46]   FLEXIBLE 3D OBJECT RECOGNITION FRAMEWORK USING 2D VIEWS VIA A SIMILARITY-BASED ASPECT-GRAPH APPROACH [J].
Hu, Jwu-Sheng ;
Su, Tzung-Min .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2008, 22 (06) :1141-1169
[47]   Skeleton-based abnormal gait recognition with spatio-temporal attention enhanced gait-structural graph convolutional networks [J].
Tian, Haoyu ;
Ma, Xin ;
Wu, Hanbo ;
Li, Yibin .
NEUROCOMPUTING, 2022, 473 :116-126
[48]   Optimizing laser triangulation displacement sensor of 3D positioning and posture using COA Based BPNN [J].
Selami, Yassine ;
Lv, Na ;
Tao, Wei ;
Yang, Hongwei ;
Zhao, Hui .
SENSOR REVIEW, 2020, 40 (01) :112-120
[49]   Survey on 3D Hand Gesture Recognition [J].
Cheng, Hong ;
Yang, Lu ;
Liu, Zicheng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (09) :1659-1673
[50]   Face ShapeNets for 3D Face Recognition [J].
Jabberi, Marwa ;
Wali, Ali ;
Neji, Bilel ;
Beyrouthy, Taha ;
Alimi, Adel M. .
IEEE ACCESS, 2023, 11 :46240-46256