A Spatiotemporal Saliency Model for Video Surveillance

被引：62

作者：

Tong Yubing ^{[1
]}

Cheikh, Faouzi Alaya ^{[2
]}

Guraya, Fahad Fazal Elahi ^{[2
]}

Konik, Hubert ^{[1
]}

Tremeau, Alain ^{[1
]}

机构：

[1] Univ St Etienne, Lab Hubert Crurien, UMR 5516, F-42000 St Etienne, France

[2] Gjovik Univ Coll, Fac Comp Sci & Media Technol, Gjovik, Norway

来源：

COGNITIVE COMPUTATION | 2011年 / 3卷 / 01期

关键词：

Visual saliency; Motion saliency; Background subtraction; Center-surround saliency; Face detection; Video surveillance; ATTENTION; OBJECTS;

D O I：

10.1007/s12559-010-9094-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A video sequence is more than a sequence of still images. It contains a strong spatial-temporal correlation between the regions of consecutive frames. The most important characteristic of videos is the perceived motion foreground objects across the frames. The motion of foreground objects dramatically changes the importance of the objects in a scene and leads to a different saliency map of the frame representing the scene. This makes the saliency analysis of videos much more complicated than that of still images. In this paper, we investigate saliency in video sequences and propose a novel spatiotemporal saliency model devoted for video surveillance applications. Compared to classical saliency models based on still images, such as Itti's model, and space-time saliency models, the proposed model is more correlated to visual saliency perception of surveillance videos. Both bottom-up and top-down attention mechanisms are involved in this model. Stationary saliency and motion saliency are, respectively, analyzed. First, a new method for background subtraction and foreground extraction is developed based on content analysis of the scene in the domain of video surveillance. Then, a stationary saliency model is setup based on multiple features computed from the foreground. Every feature is analyzed with a multi-scale Gaussian pyramid, and all the features conspicuity maps are combined using different weights. The stationary model integrates faces as a supplement feature to other low level features such as color, intensity and orientation. Second, a motion saliency map is calculated using the statistics of the motion vectors field. Third, both motion saliency map and stationary saliency map are merged based on center-surround framework defined by an approximated Gaussian function. The video saliency maps computed from our model have been compared to the gaze maps obtained from subjective experiments with SMI eye tracker for surveillance video sequences. The results show strong correlation between the output of the proposed spatiotemporal saliency model and the experimental gaze maps.

引用

页码：241 / 263

页数：23

共 50 条

[41] Review of background subtraction methods using Gaussian mixture model for video surveillance systems
Goyal, Kalpana
Singhai, Jyoti
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 50 (02) : 241 - 259
[42] Review of background subtraction methods using Gaussian mixture model for video surveillance systems
Kalpana Goyal
Jyoti Singhai
Artificial Intelligence Review, 2018, 50 : 241 - 259
[43] High Efficiency Video Coding Compliant Perceptual Video Coding Using Entropy Based Visual Saliency Model
Zeeshan, Muhammad
Majid, Muhammad
ENTROPY, 2019, 21 (10)
[44] Mining Visitors in Video Surveillance System
Momin, B. F.
Jere, Y. R.
2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
[45] Smart Surveillance Based on Video Summarization
Thomas, Sinnu Susan
Gupta, Sumana
Subramanian, Venkatesh K.
2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
[46] Image categorization based on visual saliency and Bag-of-Words model
Li, Wenxiang
Chen, Yanfei
Wu, Zecheng
Peng, Hongsheng
MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
[47] Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video
Li, Jia
Tian, Yonghong
Huang, Tiejun
Gao, Wen
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 90 (02) : 150 - 165
[48] VIDEO PRE-ANALYZING AND CODING IN THE CONTEXT OF VIDEO SURVEILLANCE APPLICATIONS
Ben Hamida, Amal
Koubaa, Mohamed
Nicolas, Henri
Ben Amar, Chokri
ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
[49] Multi-distribution model for background subtraction in long-term video surveillance system
Cen, F
Qi, FH
Chen, ML
JOURNAL OF INFRARED AND MILLIMETER WAVES, 2002, 21 (01) : 59 - 63
[50] A deep learning behavior analysis model for efficient video surveillance using multi pose features
Shana, L.
Christopher, C. Seldev
AIN SHAMS ENGINEERING JOURNAL, 2025, 16 (02)

← 1 2 3 4 5 →