Toward human activity recognition: a survey

被引:78
作者
Saleem, Gulshan [1 ]
Bajwa, Usama Ijaz [1 ]
Raza, Rana Hammad [2 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus,1-5 KM Def Rd Off Raiwind Rd, Lahore, Pakistan
[2] Natl Univ Sci & Technol NUST, Pakistan Navy Engn Coll PNEC, Elect & Power Engn Dept, Habib Ibrahim Rehmatullah Rd, Karachi, Pakistan
关键词
Activity recognition; Action recognition; Video datasets; Deep learning; Handcrafted features; Video analysis; Computer vision; CONVOLUTIONAL NEURAL-NETWORKS; RECOGNIZING HUMAN ACTIONS; IMAGE CLASSIFICATION; MOTION; QUALITY; MODEL; SILHOUETTE; LSTM; SEGMENTATION; PROPAGATION;
D O I
10.1007/s00521-022-07937-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human activity recognition (HAR) is a complex and multifaceted problem. The research community has reported numerous approaches to perform HAR. Along with HAR approaches, various surveys have revealed HAR trends in various environments and applications. HAR is linked to a variety of technology-dependent daily life systems, such as human-computer interaction systems, security surveillance, video surveillance, healthcare surveillance, robotics, content-based information retrieval, and monitoring systems. Because of technological advancements, HAR trends change quickly and necessitate an up-to-date and broader perspective. This study offers an HAR taxonomy, which includes online/offline HAR, multimodal/unimodal HAR, handcrafted feature-based, and learning-based approaches. This study attempts to present the multidisciplinary nature of HAR, such as application areas, activity types, task complexities, benchmark datasets, and/methods. This research includes a comparative analysis of state-of-the-art HAR methods and a discussion of popular datasets. The selected studies have been categorized using taxonomy, and different attributes such as activity complexity, dataset size, and recognition rate have been used for their analysis. The comparative analysis of HAR approaches has also helped to highlight domain challenges and open research directions for HAR researchers to follow.
引用
收藏
页码:4145 / 4182
页数:38
相关论文
共 246 条
[71]   Going deeper into action recognition: A survey [J].
Herath, Samitha ;
Harandi, Mehrtash ;
Porikli, Fatih .
IMAGE AND VISION COMPUTING, 2017, 60 :4-21
[72]   Real-Time RGB-D Activity Prediction by Soft Regression [J].
Hu, Jian-Fang ;
Zheng, Wei-Shi ;
Ma, Lianyang ;
Wang, Gang ;
Lai, Jianhuang .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :280-296
[73]   Action Recognition Algorithm of Spatio-Temporal Differential LSTM Based on Feature Enhancement [J].
Hu, Kai ;
Zheng, Fei ;
Weng, Liguo ;
Ding, Yiwu ;
Jin, Junlan .
APPLIED SCIENCES-BASEL, 2021, 11 (17)
[74]   Action Detection in Complex Scenes with Spatial and Temporal Ambiguities [J].
Hu, Yuxiao ;
Cao, Liangliang ;
Lv, Fengjun ;
Yan, Shuicheng ;
Gong, Yihong ;
Huang, Thomas S. .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :128-135
[75]   An Advanced Motion Detection Algorithm with Video Quality Analysis for Video Surveillance Systems [J].
Huang, Shih-Chia .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (01) :1-14
[76]  
Ijsselmuiden J, 2010, LECT NOTES ARTIF INT, V6359, P426, DOI 10.1007/978-3-642-16111-7_49
[77]  
Ikizler-Cinbis N, 2010, LECT NOTES COMPUT SC, V6311, P494, DOI 10.1007/978-3-642-15549-9_36
[78]   Occlusion-aware optical flow estimation [J].
Ince, Serdar ;
Konrad, Janusz .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (08) :1443-1451
[79]  
Intille SS, 1999, SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), P518
[80]   Better exploiting motion for better action recognition [J].
Jain, Mihir ;
Jegou, Herve ;
Bouthemy, Patrick .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2555-2562