Semantic human activity recognition: A literature review

被引:160
作者
Ziaeefard, Maryarn [1 ]
Bergevin, Robert [1 ]
机构
[1] Univ Laval, Dept Elect & Comp Engn, Comp Vis & Syst Lab, Quebec City, PQ G1V 0A6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Human activity recognition; Pose; Poselet; Attribute; Human-object interaction; Scene; Survey; EVENT RECOGNITION; HUMAN MOVEMENT; OBJECT; BODY; REPRESENTATION; SELECTIVITY; TRACKING; MODEL;
D O I
10.1016/j.patcog.2015.03.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an overview of state-of-the-art methods in activity recognition using semantic features. Unlike low-level features, semantic features describe inherent characteristics of activities. Therefore, semantics make the recognition task more reliable especially when the same actions look visually different due to the variety of action executions. We define a semantic space including the most popular semantic features of an action namely the human body (pose and poselet), attributes, related objects, and scene context. We present methods exploiting these semantic features to recognize activities from still images and video data as well as four groups of activities: atomic actions, people interactions, human-object interactions, and group activities. Furthermore, we provide potential applications of semantic approaches along with directions for future research. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2329 / 2345
页数:17
相关论文
共 118 条
[21]  
Bourdev L, 2011, IEEE I CONF COMP VIS, P1543, DOI 10.1109/ICCV.2011.6126413
[22]   Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations [J].
Bourdev, Lubomir ;
Malik, Jitendra .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1365-1372
[23]  
Bourdev L, 2010, LECT NOTES COMPUT SC, V6316, P168, DOI 10.1007/978-3-642-15567-3_13
[24]  
Breiman L., 2001, Learn, V45, P5
[25]   Recognize Human Activities from Partially Observed Videos [J].
Cao, Yu ;
Barrett, Daniel ;
Barbu, Andrei ;
Narayanaswamy, Siddharth ;
Yu, Haonan ;
Michaux, Aaron ;
Lin, Yuewei ;
Dickinson, Sven ;
Siskind, Jeffrey Mark ;
Wang, Song .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2658-2665
[26]   MOTION-BASED RECOGNITION - A SURVEY [J].
CEDRAS, C ;
SHAH, M .
IMAGE AND VISION COMPUTING, 1995, 13 (02) :129-155
[27]  
Cheema S., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P1302, DOI 10.1109/ICCVW.2011.6130402
[28]   Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots [J].
Chen, Chao-Yeh ;
Grauman, Kristen .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :572-579
[29]   Uncertainty Reasoning Based Formal Framework for Big Video Data Understanding [J].
Chen, Shuwei ;
Clawson, Kathy ;
Jing, Min ;
Liu, Jun ;
Wang, Hui ;
Scotney, Bryan .
2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, :487-494
[30]   A Hierarchical Human Activity Recognition Framework Based on Automated Reasoning [J].
Chen, Shuwei ;
Liu, Jun ;
Wang, Hui ;
Augusto, Juan Carlos .
2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, :3495-3499