Aggregate Tracklet Appearance Features for Multi-Object Tracking

被引：42

作者：

Chen, Long ^{[1
]}

Ai, Haizhou ^{[1
]}

Chen, Rui ^{[1
]}

Zhuang, Zijie ^{[1
]}

机构：

[1] Tsinghua Univ, Comp Sci & Technol Dept, Beijing 100084, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2019年 / 26卷 / 11期

关键词：

Target tracking; Trajectory; Feature extraction; Aggregates; Training; Benchmark testing; Multi-object tracking; tracklet association; appearance model; spatial-temporal attention; ASSOCIATION;

D O I：

10.1109/LSP.2019.2940922

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Multi-object tracking (MOT) has wide applications in the fields of video analysis and signal processing. A major challenge in MOT is how to associate the noisy detections into long and continuous trajectories. In this letter, we address the association problem at the tracklet-level, and mainly focus on the appearance representation designed for tracklets. A multitask convolutional neural network is proposed to learn the discriminative features and spatial-temporal attentions jointly. In particular, we decompose an object in a static image with spatial attentions, and then aggregate multiple features in a tracklet based on the temporal attentions. Appearance misalignment that caused by occlusion and inaccurate bounding is then mitigated by multi-feature aggregation. Experimental results on two challenging MOT benchmarks have demonstrated the effectiveness of the proposed method and shown significant improvement on the quality of tracking identities.

引用

页码：1613 / 1617

页数：5

共 37 条

[1]

[Anonymous], 2018, ARXIV181107258

[2] Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics [J].

Bernardin, Keni ;

Stiefelhagen, Rainer .

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)

[3] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[4]

Chen L, 2017, IEEE IMAGE PROC, P645, DOI 10.1109/ICIP.2017.8296360

[5] Concurrent lattice infill with feature evolution optimization for additive manufactured heat conduction design [J].

Cheng, Lin ;

Liu, Jikai ;

To, Albert C. .

STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2018, 58 (02) :511-535

[6] Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor [J].

Choi, Wongun .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3029-3037

[7] Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [J].

Chu, Qi ;

Ouyang, Wanli ;

Li, Hongsheng ;

Wang, Xiaogang ;

Liu, Bin ;

Yu, Nenghai .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4846-4855

[8]

Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036

[9] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[10] The PASCAL Visual Object Classes Challenge: A Retrospective [J].

Everingham, Mark ;

Eslami, S. M. Ali ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136

← 1 2 3 4 →