OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance Scene

被引:20
作者
Gao, Xu [1 ]
Jiang, Tingting [1 ]
机构
[1] Peking Univ, Cooperat Medianet Innovat Ctr, Sch EECS, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
来源
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18) | 2018年
关键词
Multiple Object Tracking; Surveillance; Scene Structure Model; Attention-Based Appearance Model; Obstacle Map;
D O I
10.1145/3240508.3240548
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With demands of the intelligent monitoring, multiple object tracking (MOT) in surveillance scene has become an essential but challenging task. Occlusion is the primary difficulty in surveillance MOT, which can be categorized into the inter-object occlusion and the obstacle occlusion. Many current studies on general MOT focus on the former occlusion, but few studies have been conducted on the latter one. In fact, there are useful prior knowledge in surveillance videos, because the scene structure is fixed. Hence, we propose two models for dealing with these two kinds of occlusions. The attention-based appearance model is proposed to solve the inter-object occlusion, and the scene structure model is proposed to solve the obstacle occlusion. We also design an obstacle map segmentation method for segmenting obstacles from the surveillance scene. Furthermore, to evaluate our method, we propose four new surveillance datasets that contain videos with obstacles. Experimental results show the effectiveness of our two models.
引用
收藏
页码:201 / 210
页数:10
相关论文
共 43 条
[11]   Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor [J].
Choi, Wongun .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3029-3037
[12]   Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism [J].
Chu, Qi ;
Ouyang, Wanli ;
Li, Hongsheng ;
Wang, Xiaogang ;
Liu, Bin ;
Yu, Nenghai .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4846-4855
[13]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[14]   The Way They Move: Tracking Multiple Targets with Similar Appearance [J].
Dicle, Caglayan ;
Camps, Octavia I. ;
Sznaier, Mario .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2304-2311
[15]  
Geng M., 2016, ABS161105244 CORR
[16]  
Hallman S, 2015, PROC CVPR IEEE, P1732, DOI 10.1109/CVPR.2015.7298782
[17]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[18]  
Hirzer M, 2011, LECT NOTES COMPUT SC, V6688, P91, DOI 10.1007/978-3-642-21227-7_9
[19]  
Kingma D. P., P 3 INT C LEARN REPR
[20]   DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification [J].
Li, Wei ;
Zhao, Rui ;
Xiao, Tong ;
Wang, Xiaogang .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :152-159