Learning by tracking: Siamese CNN for robust target association

被引:286
作者
Leal-Taixe, Laura [1 ]
Canton-Ferrer, Cristian [2 ]
Schindler, Konrad [3 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Microsoft Corp, Redmond, WA 98052 USA
[3] ETH, Zurich, Switzerland
来源
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016) | 2016年
关键词
D O I
10.1109/CVPRW.2016.59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel approach to the task of data association within the context of pedestrian tracking, by introducing a two-stage learning scheme to match pairs of detections. First, a Siamese convolutional neural network (CNN) is trained to learn descriptors encoding local spatio-temporal structures between the two input image patches, aggregating pixel values and optical flow information. Second, a set of contextual features derived from the position and size of the compared input patches are combined with the CNN output by means of a gradient boosting classifier to generate the final matching probability. This learning approach is validated by using a linear programming based multi-person tracker showing that even a simple and efficient tracker may outperform much more complex models when fed with our learned matching probabilities. Results on publicly available sequences show that our method meets state-of-the-art standards in multiple people tracking.
引用
收藏
页码:418 / 425
页数:8
相关论文
共 63 条
[21]  
[Anonymous], TPAMI
[22]  
[Anonymous], BLOG ENTRY
[23]  
[Anonymous], GITHUB
[24]  
[Anonymous], TPAMI
[25]  
[Anonymous], 2015, CVPR
[26]  
[Anonymous], 2009, CVPR
[27]  
[Anonymous], 2014, CVPR
[28]  
[Anonymous], 2009, ICCV
[29]  
[Anonymous], ARXIV150606825
[30]  
[Anonymous], 2011, TPAMI