Overview of behavior recognition based on deep learning

被引:54
作者
Hu, Kai [1 ,2 ]
Jin, Junlan [1 ,2 ]
Zheng, Fei [2 ,3 ]
Weng, Liguo [1 ,2 ]
Ding, Yiwu [1 ,2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Automat, 219 Ningliu Rd, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, 219 Ningliu Rd, Nanjing 210044, Jiangsu, Peoples R China
[3] Innovat Dept Ind Internet, China Telecom Ningbo Branch, 96 HeYi Rd, Ningbo 315000, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Behavior recognition; Deep learning; Skeleton data; NETWORK; LSTM;
D O I
10.1007/s10462-022-10210-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human behavior recognition has always been a hot spot for research in computer vision. With the wide application of behavior recognition in virtual reality and short video in recent years and the rapid development of deep learning algorithms, behavior recognition algorithms based on deep learning have emerged. Compared with traditional methods, behavior recognition algorithms based on deep learning have the advantages of strong robustness and high accuracy. This paper systemizes and introduces behavior recognition algorithms based on deep learning proposed in recent years, then focuses on a series of behavior recognition algorithms based on image and bone data; deeply analyzes their theories and performance, and finally, puts forward further prospects.
引用
收藏
页码:1833 / 1865
页数:33
相关论文
共 76 条
[1]   All about VLAD [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1578-1585
[2]   MFANet: A Multi-Level Feature Aggregation Network for Semantic Segmentation of Land Cover [J].
Chen, Bingyu ;
Xia, Min ;
Huang, Junqing .
REMOTE SENSING, 2021, 13 (04) :1-20
[3]   Skeleton-Based Action Recognition with Shift Graph Convolutional Network [J].
Cheng, Ke ;
Zhang, Yifan ;
He, Xiangyu ;
Chen, Weihan ;
Cheng, Jian ;
Lu, Hanqing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :180-189
[4]   Spatio-Temporal Fusion Networks for Action Recognition [J].
Cho, Sangwoo ;
Foroosh, Hassan .
COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 :347-364
[5]   PoTion: Pose MoTion Representation for Action Recognition [J].
Choutas, Vasileios ;
Weinzaepfel, Philippe ;
Revaud, Jerome ;
Schmid, Cordelia .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7024-7033
[6]  
[邓三鸿 Deng Sanhong], 2017, [数据分析与知识发现, Data Analysis and Knowledge Discovery], V1, P52
[7]   Deep Temporal Linear Encoding Networks [J].
Diba, Ali ;
Sharma, Vivek ;
Van Gool, Luc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1541-1550
[8]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[9]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[10]   RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos [J].
Du, Wenbin ;
Wang, Yali ;
Qiao, Yu .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3745-3754