A Spatiotemporal Saliency Model for Video Surveillance

被引:62
|
作者
Tong Yubing [1 ]
Cheikh, Faouzi Alaya [2 ]
Guraya, Fahad Fazal Elahi [2 ]
Konik, Hubert [1 ]
Tremeau, Alain [1 ]
机构
[1] Univ St Etienne, Lab Hubert Crurien, UMR 5516, F-42000 St Etienne, France
[2] Gjovik Univ Coll, Fac Comp Sci & Media Technol, Gjovik, Norway
关键词
Visual saliency; Motion saliency; Background subtraction; Center-surround saliency; Face detection; Video surveillance; ATTENTION; OBJECTS;
D O I
10.1007/s12559-010-9094-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A video sequence is more than a sequence of still images. It contains a strong spatial-temporal correlation between the regions of consecutive frames. The most important characteristic of videos is the perceived motion foreground objects across the frames. The motion of foreground objects dramatically changes the importance of the objects in a scene and leads to a different saliency map of the frame representing the scene. This makes the saliency analysis of videos much more complicated than that of still images. In this paper, we investigate saliency in video sequences and propose a novel spatiotemporal saliency model devoted for video surveillance applications. Compared to classical saliency models based on still images, such as Itti's model, and space-time saliency models, the proposed model is more correlated to visual saliency perception of surveillance videos. Both bottom-up and top-down attention mechanisms are involved in this model. Stationary saliency and motion saliency are, respectively, analyzed. First, a new method for background subtraction and foreground extraction is developed based on content analysis of the scene in the domain of video surveillance. Then, a stationary saliency model is setup based on multiple features computed from the foreground. Every feature is analyzed with a multi-scale Gaussian pyramid, and all the features conspicuity maps are combined using different weights. The stationary model integrates faces as a supplement feature to other low level features such as color, intensity and orientation. Second, a motion saliency map is calculated using the statistics of the motion vectors field. Third, both motion saliency map and stationary saliency map are merged based on center-surround framework defined by an approximated Gaussian function. The video saliency maps computed from our model have been compared to the gaze maps obtained from subjective experiments with SMI eye tracker for surveillance video sequences. The results show strong correlation between the output of the proposed spatiotemporal saliency model and the experimental gaze maps.
引用
收藏
页码:241 / 263
页数:23
相关论文
共 50 条
  • [21] A Multi-layer Scene Model for Video Surveillance Applications
    Huang, Chung-Hsien
    Wu, Ruei-Cheng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 68 - 79
  • [22] An efficient saliency prediction model for Unmanned Aerial Vehicle video
    Zhang, Kao
    Chen, Zhenzhong
    Li, Songnan
    Liu, Shan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 194 : 152 - 166
  • [23] COMPARISON OF VISUAL SALIENCY MODELS FOR COMPRESSED VIDEO
    Khatoonabadi, Sayed Hossein
    Bajic, Ivan V.
    Shan, Yufeng
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1081 - 1085
  • [24] Visual saliency guided video compression algorithm
    Gupta, Rupesh
    Khanna, Meera Thapar
    Chaudhury, Santanu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (09) : 1006 - 1022
  • [25] A DATASET AND EVALUATION METHODOLOGY FOR VISUAL SALIENCY IN VIDEO
    Li, Jia
    Tian, Yonghong
    Huang, Tiejun
    Gao, Wen
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 442 - +
  • [26] A Saliency Based Approach for Foreground Extraction from a Video
    Ahsan, Sk Md Masudul
    Nafew, Abu Naser Md
    Amit, Rifat Haque
    2017 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT 2017), 2017,
  • [27] Multiview human activity recognition system based on spatiotemporal template for video surveillance system
    Kushwaha, Alok Kumar Singh
    Srivastava, Rajeev
    JOURNAL OF ELECTRONIC IMAGING, 2015, 24 (05)
  • [28] Motion Detection for Video Surveillance
    Singh, Birmohan
    Singh, Dalwinder
    Singh, Gurwinder
    Sharma, Neeraj
    Sibbal, Vicky
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROPAGATION AND COMPUTER TECHNOLOGY (ICSPCT 2014), 2014, : 578 - 584
  • [29] Visual Saliency Detection Using Spatiotemporal Decomposition
    Bhattacharya, Saumik
    Venkatesh, K. Subramanian
    Gupta, Sumana
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (04) : 1665 - 1675
  • [30] Motion Saliency Maps from Spatiotemporal Filtering
    Belardinelli, Anna
    Pirri, Fiora
    Carbone, Andrea
    ATTENTION IN COGNITIVE SYSTEMS, 2009, 5395 : 112 - 123