Benchmarking a large-scale FIR dataset for on-road pedestrian detection

被引:42
作者
Xu, Zhewei [1 ]
Zhuang, Jiajun [2 ]
Liu, Qiong [1 ]
Zhou, Jingkai [1 ]
Peng, Shaowu [1 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Zhongkai Univ Agr & Engn, Coll Computat Sci, Guangzhou 510225, Guangdong, Peoples R China
关键词
FIR pedestrian detection; Faster R-CNN; Convolutional neural networks; Dataset; ROBUST;
D O I
10.1016/j.infrared.2018.11.007
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Far infrared (FIR) pedestrian detection is an essential module of advanced driver assistance systems (ADAS) at nighttime. Although recent deep learning-based detectors have achieved excellent results on visible images in the daytime, their performance on nighttime FIR images is still unidentified, due to the existing nighttime FIR data set is not sufficient to fully train a deep learning detector. To this end, a nighttime FIR pedestrian dataset with the largest scale at present is introduced in this paper, which is called SCUT (South China University of Technology) dataset. The dataset contains fine-grained annotated videos recorded from variable road scenes. In addition, a detailed statistical analysis of the dataset is provided and four representative pedestrian detection methods are evaluated. Benefit from the volume and diversity of training data, the experiment results show that convolutional neural networks (CNN) based detectors obtained good performance on FIR image. To further explore the performance of pedestrian detection on FIR images, four important modifications on Faster R-CNN were studied and a strong baseline for SCUT dataset was proposed, which achieves the best detection result by reducing log-average miss rate from 36.78% to 17.73%. The dataset will be public online (SCUT dataset will be published on: https://github.com/SCUT-CV/SCUT_FIR_Pedestrian_Dataset).
引用
收藏
页码:199 / 208
页数:10
相关论文
共 44 条
[1]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.141
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], REPULSION LOSS DETEC
[4]  
[Anonymous], 2009, Integral channel featuresJ
[5]  
[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[6]  
[Anonymous], VERY DEEP CONVOLUTIO
[7]  
[Anonymous], YOLOV3 INCREMENTAL I
[8]   Illuminating Pedestrians via Simultaneous Detection & Segmentation [J].
Brazil, Garrick ;
Yin, Xi ;
Liu, Xiaoming .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4960-4969
[9]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[10]  
Choi H, 2016, INT C PATT RECOG, P621, DOI 10.1109/ICPR.2016.7899703