BEST: Benchmark and Evaluation of Surveillance Task

被引：2

作者：

Zhang, Chongyang ^{[1
]}

Ni, Bingbing ^{[1
]}

Song, Li ^{[1
]}

Zhai, Guangtao ^{[1
]}

Yang, Xiaokang ^{[1
]}

Zhang, Wenjun ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China

来源：

COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III | 2017年 / 10118卷

关键词：

D O I：

10.1007/978-3-319-54526-4_29

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Smart/Intelligent video surveillance technology plays the central role in the emerging smart city systems. Most intelligent visual algorithms require large-scale image/video datasets to train classifiers or acquire discriminative features using machine learning. However, most existing datasets are collected from non-surveillance conditions, which have significant differences as compared to the practical surveillance data. As a consequence, many existing intelligent visual algorithms trained on traditional datasets perform not so well in the real world surveillance applications. We believe the lack of high quality surveillance datasets has greatly limited the application of the computer vision algorithms in practical surveillance scenarios. To solve this problem, one large-scale and comprehensive surveillance image and video database and test platform, called Benchmark and Evaluation of Surveillance Task (abbreviated as BEST), is developed in this work. The original images and videos in BEST were all collected from on-using surveillance cameras, and have been carefully selected to cover a wide and balanced range of outdoor surveillance scenarios. Compared with the existing surveillance/non-surveillance datasets, the proposed BEST dataset provides a realistic, extensive and diversified testbed for a more comprehensive performance evaluation. Our experimental results show that, performance of seven pedestrian detection algorithms on BEST is worse than that on the existing datasets. This highlights the difference between non-surveillance data and real surveillance data, which is the major cause of the performance decreases. The dataset is open to the public and can be downloaded at: http://ivlab.sjtu.edu.cn/best/Data/List/Datasets.

引用

页码：393 / 407

页数：15

共 23 条

[1]

[Anonymous], P NEURAL INFORM PROC

[2]

[Anonymous], IEEE I CONF COMP VIS

[3]

[Anonymous], 13 AS C COMP VIS WOR

[4]

[Anonymous], JOINT C 4 INT C INF

[5]

[Anonymous], IEEE T SOFTW ENG

[6]

[Anonymous], 2014, ADV NEURAL INFORM PR

[7] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[8]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[9] Fast Feature Pyramids for Object Detection [J].

Dollar, Piotr ;

Appel, Ron ;

Belongie, Serge ;

Perona, Pietro .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545

[10] Pedestrian Detection: An Evaluation of the State of the Art [J].

Dollar, Piotr ;

Wojek, Christian ;

Schiele, Bernt ;

Perona, Pietro .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761

← 1 2 3 →