Extensive Benchmark and Survey of Modeling Methods for Scene Background Initialization

被引:52
作者
Jodoin, Pierre-Marc [1 ]
Maddalena, Lucia [2 ]
Petrosino, Alfredo [3 ]
Wang, Yi [1 ]
机构
[1] Univ Sherbrooke, Sherbrooke, PQ J1K 2R1, Canada
[2] CNR, I-80131 Naples, Italy
[3] Univ Naples Parthenope, I-80143 Naples, Italy
基金
加拿大自然科学与工程研究理事会;
关键词
Background initialization; video analysis; survey; benchmarking; VIDEO; IMAGE; SEGMENTATION; SUBTRACTION; TRACKING; SHADOWS; OBJECTS;
D O I
10.1109/TIP.2017.2728181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene background initialization is the process by which a method tries to recover the background image of a video without foreground objects in it. Having a clear understanding about which approach is more robust and/or more suited to a given scenario is of great interest to many end users or practitioners. The aim of this paper is to provide an extensive survey of scene background initialization methods as well as a novel benchmarking framework. The proposed framework involves several evaluation metrics and state-of-the-art methods, as well as the largest video data set ever made for this purpose. The data set consists of several camera-captured videos that: 1) span categories focused on various background initialization challenges; 2) are obtained with different cameras of different lengths, frame rates, spatial resolutions, lighting conditions, and levels of compression; and 3) contain indoor and outdoor scenes. The wide variety of our data set prevents our analysis from favoring a certain family of background initialization methods over others. Our evaluation framework allows us to quantitatively identify solved and unsolved issues related to scene background initialization. We also identify scenarios for which state-of-the-art methods systematically fail.
引用
收藏
页码:5244 / 5256
页数:13
相关论文
共 85 条
[1]   Interactive digital photomontage [J].
Agarwala, A ;
Dontcheva, M ;
Agrawala, M ;
Drucker, S ;
Colburn, A ;
Curless, B ;
Salesin, D ;
Cohen, M .
ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03) :294-302
[2]  
Ali S., 2007, P IEEE COMP SOC C CO, P18
[3]   A robust video foreground segmentation by using generalized Gaussian mixture modeling [J].
Allili, Mohand Saied ;
Bouguila, Nizar ;
Ziou, Djemel .
FOURTH CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, PROCEEDINGS, 2007, :503-+
[4]   Linguistic summarization of video for fall detection using voxel person and fuzzy logic [J].
Anderson, Derek ;
Luke, Robert H. ;
Keller, James M. ;
Skubic, Marjorie ;
Rantz, Marilyn ;
Aud, Myra .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (01) :80-89
[5]  
[Anonymous], 2014, P IEEE C COMP VIS PA
[6]  
[Anonymous], IEEE T CIRCUITS SYST
[7]  
[Anonymous], 2008, COMP VIS PATT REC 20
[8]  
[Anonymous], 1949, Extrapolation, interpolation, and smoothing of stationary time series
[9]  
[Anonymous], 2000, P EUR C COMP VIS
[10]  
[Anonymous], 2005, P 3 ACM INT WORKSH V