Aggregate Tracklet Appearance Features for Multi-Object Tracking

被引:42
作者
Chen, Long [1 ]
Ai, Haizhou [1 ]
Chen, Rui [1 ]
Zhuang, Zijie [1 ]
机构
[1] Tsinghua Univ, Comp Sci & Technol Dept, Beijing 100084, Peoples R China
关键词
Target tracking; Trajectory; Feature extraction; Aggregates; Training; Benchmark testing; Multi-object tracking; tracklet association; appearance model; spatial-temporal attention; ASSOCIATION;
D O I
10.1109/LSP.2019.2940922
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-object tracking (MOT) has wide applications in the fields of video analysis and signal processing. A major challenge in MOT is how to associate the noisy detections into long and continuous trajectories. In this letter, we address the association problem at the tracklet-level, and mainly focus on the appearance representation designed for tracklets. A multitask convolutional neural network is proposed to learn the discriminative features and spatial-temporal attentions jointly. In particular, we decompose an object in a static image with spatial attentions, and then aggregate multiple features in a tracklet based on the temporal attentions. Appearance misalignment that caused by occlusion and inaccurate bounding is then mitigated by multi-feature aggregation. Experimental results on two challenging MOT benchmarks have demonstrated the effectiveness of the proposed method and shown significant improvement on the quality of tracking identities.
引用
收藏
页码:1613 / 1617
页数:5
相关论文
共 37 条
[1]  
[Anonymous], 2018, ARXIV181107258
[2]   Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].
Bernardin, Keni ;
Stiefelhagen, Rainer .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Chen L, 2017, IEEE IMAGE PROC, P645, DOI 10.1109/ICIP.2017.8296360
[5]   Concurrent lattice infill with feature evolution optimization for additive manufactured heat conduction design [J].
Cheng, Lin ;
Liu, Jikai ;
To, Albert C. .
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2018, 58 (02) :511-535
[6]   Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor [J].
Choi, Wongun .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3029-3037
[7]   Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [J].
Chu, Qi ;
Ouyang, Wanli ;
Li, Hongsheng ;
Wang, Xiaogang ;
Liu, Bin ;
Yu, Nenghai .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4846-4855
[8]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136