Multiple pedestrian tracking under first-person perspective using deep neural network and social force optimization

被引:12
作者
Xue, Yongjie [1 ]
Ju, Zhiyong [1 ]
机构
[1] Univ Shanghai Sci & technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
来源
OPTIK | 2021年 / 240卷 / 240期
基金
中国国家自然科学基金;
关键词
Detection; Multiple pedestrian tracking; Social force model; Deep learning; FASTER R-CNN; FEATURES; MODEL;
D O I
10.1016/j.ijleo.2021.166981
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Multiple pedestrian tracking in the first-person perspective is a challenging problem, obstacles of which are mainly caused by camera moving, frequent occlusions, and collision avoidance. To solve the mentioned issues, we proposed a novel deep learning-based approach. Firstly, a dense connection and attention based YOLO (DCA-YOLO) is proposed for ameliorating the detection performance. Then, the detection results are sent to a wide residual network for feature extraction. We use the Kuhn-Munkres algorithm to construct a similarity matrix and find the best match of two detection boxes. To tackle the frequent occlusion and ID-switch issues caused by collision avoidances or grouping behavior, we introduce a social force model into the proposed network to optimize the tracking results. The experimental results on widely used challenging MOT2015 and MOT2016 benchmarks demonstrate the effectiveness of our proposed algorithm.
引用
收藏
页数:16
相关论文
共 38 条
  • [1] Hierarchical modeling for first-person vision activity recognition
    Abebe, Girmaw
    Cavallaro, Andrea
    [J]. NEUROCOMPUTING, 2017, 267 : 362 - 377
  • [2] Social LSTM: Human Trajectory Prediction in Crowded Spaces
    Alahi, Alexandre
    Goel, Kratarth
    Ramanathan, Vignesh
    Robicquet, Alexandre
    Li Fei-Fei
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
  • [3] Non-texture image inpainting using histogram of oriented gradients
    Alilou, Vahid K.
    Yaghmaee, Farzin
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 43 - 53
  • [4] Tracking without bells and whistles
    Bergmann, Philipp
    Meinhardt, Tim
    Leal-Taixe, Laura
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 941 - 951
  • [5] Bose Biswajit., 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition, P1
  • [6] Aggregate Tracklet Appearance Features for Multi-Object Tracking
    Chen, Long
    Ai, Haizhou
    Chen, Rui
    Zhuang, Zijie
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (11) : 1613 - 1617
  • [7] Improved SIFT image registration algorithm on characteristic statistical distributions and consistency constraint
    Chen, Yong
    Shang, Lei
    [J]. OPTIK, 2016, 127 (02): : 900 - 911
  • [8] Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor
    Choi, Wongun
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3029 - 3037
  • [9] Online Multi-Object Tracking with Instance-Aware Tracker and Dynamic Model Refreshment
    Chu, Peng
    Fan, Heng
    Tan, Chiu C.
    Ling, Haibin
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 161 - 170
  • [10] Annular Sector Model for tracking multiple indistinguishable and deformable objects in occlusions
    Guo, Biao
    Luo, Guibo
    Weng, Zhenyu
    Zhu, Yuesheng
    [J]. NEUROCOMPUTING, 2019, 333 : 419 - 428