EDCAR: A knowledge representation framework to enhance automatic video surveillance

被引:33
作者
Caruccio, Loredana [1 ]
Polese, Giuseppe [1 ]
Tortora, Genoveffa [1 ]
Iannone, Daniele [2 ]
机构
[1] Univ Salerno, Dept Comp Sci, Via Giovanni Paolo II 132, Fisciano, SA, Italy
[2] Datonix SpA, Via Francesco De Sanctis 2, Avella, AV, Italy
关键词
Knowledge representation framework; Video scenario; Action composition; Event recognition; Video surveillance; RECOGNITION; SPACE; WEB;
D O I
10.1016/j.eswa.2019.04.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main purpose of video-based event recognition is to interpret activities or behaviors within video sequences, in order to detect and isolate specific events, which have to be readily recognized and prompted to the people responsible for their monitoring. In this paper, we present a knowledge representation framework and a system for automatic video surveillance, which analyzes record scenes in order to detect the occurrence of specific events defined as targets. The framework, named Elements and Descriptors of Context and Action Representations (EDCAR), enables the representation of relevant elements, general descriptors of the context, and actions that have to be captured, including the definition of action compositions and sequences, in order to monitor and recognize abnormal situations. EDCAR and the associated system also support video summarization of relevant scenes, providing an inference engine to handle complex queries. They have been used experimentally on several video surveillance scenarios, which enabled us to prove their effectiveness with respect to similar solutions described in the literature. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:190 / 207
页数:18
相关论文
共 50 条
[1]   Automatic visual detection of human behavior: A review from 2000 to 2014 [J].
Afsar, Palwasha ;
Cortez, Paulo ;
Santos, Henrique .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (20) :6935-6956
[2]   Human Activity Analysis: A Review [J].
Aggarwal, J. K. ;
Ryoo, M. S. .
ACM COMPUTING SURVEYS, 2011, 43 (03)
[3]  
[Anonymous], UT-Interaction Dataset, ICPR contest on Semantic Description of Human Activities (SDHA)
[4]  
[Anonymous], 2006, P IEEE COMP SOC C CO
[5]   Coupled hidden Markov models for complex action recognition [J].
Brand, M ;
Oliver, N ;
Pentland, A .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :994-999
[6]   An Ontology for Event Detection and its Application in Surveillance Video [J].
Carlos SanMiguel, Juan ;
Martinez, Jose M. ;
Garcia, Alvaro .
AVSS: 2009 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2009, :220-225
[7]   Understanding user intent on the web through interaction mining [J].
Caruccio, Loredana ;
Deufemia, Vincenzo ;
Polese, Giuseppe .
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2015, 31 :230-236
[8]   Intelligent surveillance system with integration of heterogeneous information for intrusion detection [J].
Castro, J. L. ;
Delgado, M. ;
Medina, J. ;
Ruiz-Lozano, M. D. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) :11182-11192
[9]   A survey of video datasets for human action and activity recognition [J].
Chaquet, Jose M. ;
Carmona, Enrique J. ;
Fernandez-Caballero, Antonio .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (06) :633-659
[10]   Facial expression recognition from video sequences: temporal and static modeling [J].
Cohen, I ;
Sebe, N ;
Garg, A ;
Chen, LS ;
Huang, TS .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 91 (1-2) :160-187