A Spatiotemporal Saliency Model for Video Surveillance

被引:62
|
作者
Tong Yubing [1 ]
Cheikh, Faouzi Alaya [2 ]
Guraya, Fahad Fazal Elahi [2 ]
Konik, Hubert [1 ]
Tremeau, Alain [1 ]
机构
[1] Univ St Etienne, Lab Hubert Crurien, UMR 5516, F-42000 St Etienne, France
[2] Gjovik Univ Coll, Fac Comp Sci & Media Technol, Gjovik, Norway
关键词
Visual saliency; Motion saliency; Background subtraction; Center-surround saliency; Face detection; Video surveillance; ATTENTION; OBJECTS;
D O I
10.1007/s12559-010-9094-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A video sequence is more than a sequence of still images. It contains a strong spatial-temporal correlation between the regions of consecutive frames. The most important characteristic of videos is the perceived motion foreground objects across the frames. The motion of foreground objects dramatically changes the importance of the objects in a scene and leads to a different saliency map of the frame representing the scene. This makes the saliency analysis of videos much more complicated than that of still images. In this paper, we investigate saliency in video sequences and propose a novel spatiotemporal saliency model devoted for video surveillance applications. Compared to classical saliency models based on still images, such as Itti's model, and space-time saliency models, the proposed model is more correlated to visual saliency perception of surveillance videos. Both bottom-up and top-down attention mechanisms are involved in this model. Stationary saliency and motion saliency are, respectively, analyzed. First, a new method for background subtraction and foreground extraction is developed based on content analysis of the scene in the domain of video surveillance. Then, a stationary saliency model is setup based on multiple features computed from the foreground. Every feature is analyzed with a multi-scale Gaussian pyramid, and all the features conspicuity maps are combined using different weights. The stationary model integrates faces as a supplement feature to other low level features such as color, intensity and orientation. Second, a motion saliency map is calculated using the statistics of the motion vectors field. Third, both motion saliency map and stationary saliency map are merged based on center-surround framework defined by an approximated Gaussian function. The video saliency maps computed from our model have been compared to the gaze maps obtained from subjective experiments with SMI eye tracker for surveillance video sequences. The results show strong correlation between the output of the proposed spatiotemporal saliency model and the experimental gaze maps.
引用
收藏
页码:241 / 263
页数:23
相关论文
共 50 条
  • [1] A Spatiotemporal Saliency Model for Video Surveillance
    Tong Yubing
    Faouzi Alaya Cheikh
    Fahad Fazal Elahi Guraya
    Hubert Konik
    Alain Trémeau
    Cognitive Computation, 2011, 3 : 241 - 263
  • [2] PREDICTIVE VISUAL SALIENCY MODEL FOR SURVEILLANCE VIDEO
    Guraya, Fahad Fazal Elahi
    Cheikh, Faouzi Alaya
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 554 - 558
  • [3] Spatiotemporal Saliency Detection in Traffic Surveillance
    Li, Wei
    Setiawan, Dhoni Putra
    Zhao, Hua-An
    2017 INTERNATIONAL CONFERENCE ON CONTROL, ELECTRONICS, RENEWABLE ENERGY AND COMMUNICATIONS (ICCREC), 2017, : 139 - 142
  • [4] A New Method for Spatiotemporal Textual Saliency Detection in Video
    Shan, Susu
    Xu, Hailiang
    Su, Feng
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3240 - 3245
  • [5] VIDEO SALIENCY DETECTION BASED ON SPATIOTEMPORAL FEATURE LEARNING
    Lee, Se-Ho
    Kim, Jin-Hwan
    Choi, Kwang Pyo
    Sim, Jae-Young
    Kim, Chang-Su
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1120 - 1124
  • [6] Saliency-Based Spatiotemporal Attention for Video Captioning
    Chen, Yangyu
    Zhang, Weigang
    Wang, Shuhui
    Li, Liang
    Huang, Qingming
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [7] VIDEO SALIENCY INCORPORATING SPATIOTEMPORAL CUES AND UNCERTAINTY WEIGHTING
    Fang, Yuming
    Wang, Zhou
    Lin, Weisi
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [8] Multi-Feature Based Visual Saliency Detection in Surveillance Video
    Tong, Yubing
    Konik, Hubert
    Cheikh, Faouzi Alaya
    Guraya, Fahad Fazal Elahi
    Tremeau, Alain
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2010, 2010, 7744
  • [9] Robust segmentation of moving objects in video based on spatiotemporal visual saliency and active contour model
    Ramadan, Hiba
    Tairi, Hamid
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)
  • [10] Spatiotemporal Saliency in Dynamic Scenes
    Mahadevan, Vijay
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (01) : 171 - 177