Learning by tracking: Siamese CNN for robust target association

被引:286
作者
Leal-Taixe, Laura [1 ]
Canton-Ferrer, Cristian [2 ]
Schindler, Konrad [3 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Microsoft Corp, Redmond, WA 98052 USA
[3] ETH, Zurich, Switzerland
来源
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016) | 2016年
关键词
D O I
10.1109/CVPRW.2016.59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel approach to the task of data association within the context of pedestrian tracking, by introducing a two-stage learning scheme to match pairs of detections. First, a Siamese convolutional neural network (CNN) is trained to learn descriptors encoding local spatio-temporal structures between the two input image patches, aggregating pixel values and optical flow information. Second, a set of contextual features derived from the position and size of the compared input patches are combined with the CNN output by means of a gradient boosting classifier to generate the final matching probability. This learning approach is validated by using a linear programming based multi-person tracker showing that even a simple and efficient tracker may outperform much more complex models when fed with our learned matching probabilities. Results on publicly available sequences show that our method meets state-of-the-art standards in multiple people tracking.
引用
收藏
页码:418 / 425
页数:8
相关论文
共 63 条
[1]  
[Anonymous], 2013, ICCV
[2]  
[Anonymous], 2005, CVPR
[3]  
[Anonymous], ARXIV E PRINTS
[4]  
[Anonymous], 2015, ICCV
[5]  
[Anonymous], ICRA
[6]  
[Anonymous], 2015, ICCV
[7]  
[Anonymous], 2012, NIPS
[8]  
[Anonymous], TPAMI
[9]  
[Anonymous], 2015, CVPR
[10]  
[Anonymous], 2011, CVPR