Spatiotemporal KSVD Dictionary Learning for Online Multi-target Tracking

被引:2
作者
Manh, Huunh [1 ]
Alaghband, Gita [1 ]
机构
[1] Univ Colorado, Dept Comp Sci & Engn, Denver, CO 80202 USA
来源
2018 15TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV) | 2018年
关键词
multi-target tracking; dictionary learning; online appearance learning; DISCRIMINATIVE DICTIONARY; K-SVD; SPARSE;
D O I
10.1109/CRV.2018.00030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present a new spatiotemporal discriminative KSVD dictionary algorithm (STKSVD) for learning target appearance in online multi-target tracking system. Different from other classification/recognition tasks (e.g. face, image recognition), learning target's appearance in online multi-target tracking is impacted by factors such as: posture/articulation changes, partial occlusion by background scene or other targets, background changes (human detection bounding box covers both human parts and part of the scene), etc. However, we observe that these variations occur gradually relative to spatial and temporal dynamics. We characterize the spatial and temporal information between target's samples through a new STKSVD appearance learning algorithm to better discriminate targets. Our STKSVD method is able to learn discriminative sparse code, linear classifier parameters, and minimize reconstruction error in single optimization system. Our appearance learning algorithm and tracking framework employs two different methods of calculating appearance similarity score in each stage of a two-stage association: a linear classifier in the first stage, and minimum residual errors in the second stage. The results tested using 2DMOT2015 dataset and its public Aggregated Channel Features (ACF) human detection for all comparisons show that our method outperforms the existing related learning methods.
引用
收藏
页码:150 / 157
页数:8
相关论文
共 30 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]   Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].
Alahi, Alexandre ;
Goel, Kratarth ;
Ramanathan, Vignesh ;
Robicquet, Alexandre ;
Li Fei-Fei ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :961-971
[3]   Robust Online Multi-Object Tracking based on Tracklet Confidence and Online Discriminative Appearance Learning [J].
Bae, Seung-Hwan ;
Yoon, Kuk-Jin .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1218-1225
[4]  
Bao CL, 2012, PROC CVPR IEEE, P1830, DOI 10.1109/CVPR.2012.6247881
[5]  
Benfold B., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3457, DOI 10.1109/CVPR.2011.5995667
[6]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[7]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545
[8]   Image denoising via sparse and redundant representations over learned dictionaries [J].
Elad, Michael ;
Aharon, Michal .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) :3736-3745
[9]  
Fagot-Bouquet L, 2015, IEEE IMAGE PROC, P2414, DOI 10.1109/ICIP.2015.7351235
[10]  
Golub G. H., 2006, SIAM J MATRIX ANAL A