Scene-Specific Pedestrian Detection for Static Video Surveillance

被引:108
作者
Wang, Xiaogang [1 ]
Wang, Meng [1 ]
Li, Wei [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Ho Sin Hang Engn Bldg, Shatin, Hong Kong, Peoples R China
关键词
Pedestrian detection; transfer learning; confidence-encoded SVM; domain adaptation; video surveillance;
D O I
10.1109/TPAMI.2013.124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of a generic pedestrian detector may drop significantly when it is applied to a specific scene due to the mismatch between the source training set and samples from the target scene. We propose a new approach of automatically transferring a generic pedestrian detector to a scene-specific detector in static video surveillance without manually labeling samples from the target scene. The proposed transfer learning framework consists of four steps. 1) Through exploring the indegrees from target samples to source samples on a visual affinity graph, the source samples are weighted to match the distribution of target samples. 2) It explores a set of context cues to automatically select samples from the target scene, predicts their labels, and computes confidence scores to guide transfer learning. 3) The confidence scores propagate among target samples according to their underlying visual structures. 4) Target samples with higher confidence scores have larger influence on training scene-specific detectors. All these considerations are formulated under a single objective function called confidence-encoded SVM, which avoids hard thresholding on confidence scores. During test, only the appearance-based detector is used without context cues. The effectiveness is demonstrated through experiments on two video surveillance data sets. Compared with a generic detector, it improves the detection rates by 48 and 36 percent at one false positive per image (FPPI) on the two data sets, respectively. The training process converges after one or two iterations on the data sets in experiments.
引用
收藏
页码:361 / 374
页数:14
相关论文
共 55 条
[21]  
Benenson R., 2012, P P IEEE C COMP VIS
[22]  
Bourdev L., 2009, P 12 IEEE INT C COMP
[23]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[24]  
Dalal N., CVPR, P886, DOI [10.1109/CVPR.2005.177, DOI 10.1109/CVPR.2005.177]
[25]  
Daume III H., 2010, P WORKSH DOM AD NAT
[26]   Pedestrian Detection: An Evaluation of the State of the Art [J].
Dollar, Piotr ;
Wojek, Christian ;
Schiele, Bernt ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761
[27]  
Duan LX, 2009, PROC CVPR IEEE, P1375, DOI [10.1109/CVPRW.2009.5206747, 10.1109/CVPR.2009.5206747]
[28]  
Enzweiler M., 2010, PROC IEEE CONF COMPU
[29]   Monocular Pedestrian Detection: Survey and Experiments [J].
Enzweiler, Markus ;
Gavrila, Dariu M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (12) :2179-2195
[30]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871