A Spatiotemporal Saliency Model for Video Surveillance

被引:62
|
作者
Tong Yubing [1 ]
Cheikh, Faouzi Alaya [2 ]
Guraya, Fahad Fazal Elahi [2 ]
Konik, Hubert [1 ]
Tremeau, Alain [1 ]
机构
[1] Univ St Etienne, Lab Hubert Crurien, UMR 5516, F-42000 St Etienne, France
[2] Gjovik Univ Coll, Fac Comp Sci & Media Technol, Gjovik, Norway
关键词
Visual saliency; Motion saliency; Background subtraction; Center-surround saliency; Face detection; Video surveillance; ATTENTION; OBJECTS;
D O I
10.1007/s12559-010-9094-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A video sequence is more than a sequence of still images. It contains a strong spatial-temporal correlation between the regions of consecutive frames. The most important characteristic of videos is the perceived motion foreground objects across the frames. The motion of foreground objects dramatically changes the importance of the objects in a scene and leads to a different saliency map of the frame representing the scene. This makes the saliency analysis of videos much more complicated than that of still images. In this paper, we investigate saliency in video sequences and propose a novel spatiotemporal saliency model devoted for video surveillance applications. Compared to classical saliency models based on still images, such as Itti's model, and space-time saliency models, the proposed model is more correlated to visual saliency perception of surveillance videos. Both bottom-up and top-down attention mechanisms are involved in this model. Stationary saliency and motion saliency are, respectively, analyzed. First, a new method for background subtraction and foreground extraction is developed based on content analysis of the scene in the domain of video surveillance. Then, a stationary saliency model is setup based on multiple features computed from the foreground. Every feature is analyzed with a multi-scale Gaussian pyramid, and all the features conspicuity maps are combined using different weights. The stationary model integrates faces as a supplement feature to other low level features such as color, intensity and orientation. Second, a motion saliency map is calculated using the statistics of the motion vectors field. Third, both motion saliency map and stationary saliency map are merged based on center-surround framework defined by an approximated Gaussian function. The video saliency maps computed from our model have been compared to the gaze maps obtained from subjective experiments with SMI eye tracker for surveillance video sequences. The results show strong correlation between the output of the proposed spatiotemporal saliency model and the experimental gaze maps.
引用
收藏
页码:241 / 263
页数:23
相关论文
共 50 条
  • [41] Review of background subtraction methods using Gaussian mixture model for video surveillance systems
    Goyal, Kalpana
    Singhai, Jyoti
    ARTIFICIAL INTELLIGENCE REVIEW, 2018, 50 (02) : 241 - 259
  • [42] Review of background subtraction methods using Gaussian mixture model for video surveillance systems
    Kalpana Goyal
    Jyoti Singhai
    Artificial Intelligence Review, 2018, 50 : 241 - 259
  • [43] High Efficiency Video Coding Compliant Perceptual Video Coding Using Entropy Based Visual Saliency Model
    Zeeshan, Muhammad
    Majid, Muhammad
    ENTROPY, 2019, 21 (10)
  • [44] Mining Visitors in Video Surveillance System
    Momin, B. F.
    Jere, Y. R.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [45] Smart Surveillance Based on Video Summarization
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
  • [46] Image categorization based on visual saliency and Bag-of-Words model
    Li, Wenxiang
    Chen, Yanfei
    Wu, Zecheng
    Peng, Hongsheng
    MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [47] Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video
    Li, Jia
    Tian, Yonghong
    Huang, Tiejun
    Gao, Wen
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 90 (02) : 150 - 165
  • [48] VIDEO PRE-ANALYZING AND CODING IN THE CONTEXT OF VIDEO SURVEILLANCE APPLICATIONS
    Ben Hamida, Amal
    Koubaa, Mohamed
    Nicolas, Henri
    Ben Amar, Chokri
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [49] Multi-distribution model for background subtraction in long-term video surveillance system
    Cen, F
    Qi, FH
    Chen, ML
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2002, 21 (01) : 59 - 63
  • [50] A deep learning behavior analysis model for efficient video surveillance using multi pose features
    Shana, L.
    Christopher, C. Seldev
    AIN SHAMS ENGINEERING JOURNAL, 2025, 16 (02)