Linear Support Tensor Machine With LSK Channels: Pedestrian Detection in Thermal Infrared Images

被引:61
作者
Biswas, Sujoy Kumar [1 ]
Milanfar, Peyman [2 ]
机构
[1] IBM Corp, Almaden Res Ctr, 650 Harry Rd, San Jose, CA 95120 USA
[2] Google Res, Computat Imaging Image Proc Team, Mountain View, CA 94043 USA
关键词
Object detection; pedestrian detection; local steering kernel; lsk channels; maximum margin method; support tensor machine; OBJECT;
D O I
10.1109/TIP.2017.2705426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian detection in thermal infrared images poses unique challenges because of the low resolution and noisy nature of the image. Here, we propose a mid-level attribute in the form of the multidimensional template, or tensor, using local steering kernel (LSK) as low-level descriptors for detecting pedestrians in far infrared images. LSK is specifically designed to deal with intrinsic image noise and pixel level uncertainty by capturing local image geometry succinctly instead of collecting local orientation statistics (e.g., histograms in histogram of oriented gradients). In order to learn the LSK tensor, we introduce a new image similarity kernel following the popular maximum margin framework of support vector machines facilitating a relatively short and simple training phase for building a rigid pedestrian detector. Tensor representation has several advantages, and indeed, LSK templates allow exact acceleration of the sluggish but de facto sliding window-based detection methodology with multichannel discrete Fourier transform, facilitating very fast and efficient pedestrian localization. The experimental studies on publicly available thermal infrared images justify our proposals and model assumptions. In addition, the proposed work also involves the release of our in-house annotations of pedestrians in more than 17 000 frames of OSU color thermal database for the purpose of sharing with the research community.
引用
收藏
页码:4229 / 4242
页数:14
相关论文
共 55 条
[1]  
[Anonymous], 2001, Learning with Kernels |
[2]  
[Anonymous], 2007, P IEEE C COMP VIS PA, DOI DOI 10.1109/CVPR.2007.383003
[3]  
[Anonymous], PROC CVPR IEEE
[4]  
[Anonymous], IEEE OTCBVC WS SERIE
[5]  
[Anonymous], 2008, VLFeat: An open and portable library of computer vision algorithms
[6]  
[Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
[7]  
[Anonymous], 2016, PIOTRS COMPUTER VISI
[8]   Algorithm 862: MATLAB tensor classes for fast algorithm prototyping [J].
Bader, Brett W. ;
Kolda, Tamara G. .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2006, 32 (04) :635-653
[9]   Ten Years of Pedestrian Detection, What Have We Learned? [J].
Benenson, Rodrigo ;
Omran, Mohamed ;
Hosang, Jan ;
Schiele, Bernt .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT II, 2015, 8926 :613-627
[10]   Seeking the strongest rigid detector [J].
Benenson, Rodrigo ;
Mathias, Markus ;
Tuytelaars, Tinne ;
Van Gool, Luc .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3666-3673