Determining the best suited semantic events for cognitive surveillance

被引：17

作者：

Fernandez, C. ^{[1
]}

Baiget, P. ^{[1
]}

Roca, F. X. ^{[1
]}

Gonzalez, J. ^{[1
]}

机构：

[1] UAB, Comp Vis Ctr, Barcelona 08193, Spain

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2011年 / 38卷 / 04期

关键词：

Cognitive surveillance; Event modeling; Content-based video retrieval; Ontologies; Advanced user interfaces; IMAGE; RETRIEVAL; TRACKING; OBJECT; SYSTEM;

D O I：

10.1016/j.eswa.2010.09.070

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

State-of-the-art systems on cognitive surveillance identify and describe complex events in selected domains, thus providing end-users with tools to easily access the contents of massive video footage. Nevertheless, as the complexity of events increases in semantics and the types of indoor/outdoor scenarios diversify, it becomes difficult to assess which events describe better the scene, and how to model them at a pixel level to fulfill natural language requests. We present an ontology-based methodology that guides the identification, step-by-step modeling, and generalization of the most relevant events to a specific domain. Our approach considers three steps: (1) end-users provide textual evidence from surveilled video sequences; (2) transcriptions are analyzed top-down to build the knowledge bases for event description; and (3) the obtained models are used to generalize event detection to different image sequences from the surveillance domain. This framework produces user-oriented knowledge that improves on existing advanced interfaces for video indexing and retrieval, by determining the best suited events for video understanding according to end-users. We have conducted experiments with outdoor and indoor scenes showing thefts, chases, and vandalism, demonstrating the feasibility and generalization of this proposal. (C) 2010 Elsevier Ltd. All rights reserved.

引用

页码：4068 / 4079

页数：12

共 27 条

[1] A Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video [J].