Two-Stream Multirate Recurrent Neural Network for Video-Based Pedestrian Reidentification

被引:56
作者
Zeng, Zhiqiang [1 ]
Li, Zhihui [2 ]
Cheng, De [3 ]
Zhang, Huaxiang [4 ]
Zhan, Kun [5 ]
Yang, Yi [6 ,7 ]
机构
[1] Xiamen Univ Technol, Coll Comp & Informat Engn, Xiamen 361005, Peoples R China
[2] Beijing Etrol Technol Co Ltd, Beijing 100095, Peoples R China
[3] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710000, Shaanxi, Peoples R China
[4] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Shandong, Peoples R China
[5] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Gansu, Peoples R China
[6] Univ Technol Sydney, Ctr Artificial Intelligence, Ultimo, NSW 2007, Australia
[7] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Person reidentification; recurrent neural networks; video surveillance; PERSON REIDENTIFICATION;
D O I
10.1109/TII.2017.2767557
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video-based pedestrian reidentification is an emerging task in video surveillance and is closely related to several real-world applications. Its goal is to match pedestrians across multiple nonoverlapping network cameras. Despite the recent effort, the performance of pedestrian reidentification needs further improvement. Hence, we propose a novel two-stream multirate recurrent neural network for video-based pedestrian reidentification with two inherent advantages: First, capturing the static spatial and temporal information; Second, Author: Figure II is not cited in the text. Please cite it at the appropriate place. dealing with motion speed variance. Given video sequences of pedestrians, we start with extracting spatial and motion features using two different deep neural networks. Then, we explore the feature correlation which results in a regularized fusion network integrating the two aforementioned networks. Considering that pedestrians, sometimes even the same pedestrian, move in different speeds across different camera views, we extend our approach by feeding the two networks into a multirate recurrent network to exploit the temporal correlations. Extensive experiments have been conducted on two real-world video-based pedestrian reidentification benchmarks: iLIDS-VID and PRID 2011 datasets. The experimental results confirm the efficacy of the proposed method. Our code will be released upon acceptance.
引用
收藏
页码:3179 / 3186
页数:8
相关论文
共 58 条
  • [1] Corner-Boundary Processor Allocation for 3D Mesh-Connected Multicomputers
    Ababneh, Ismail
    Bani-Mohammad, Saad
    Al Smadi, Motasem
    [J]. INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2015, 5 (01) : 1 - 13
  • [2] Automated wireless video surveillance: an evaluation framework
    Alsmirat, Mohammad A.
    Jararweh, Yaser
    Obaidat, Islam
    Gupta, Brij B.
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2017, 13 (03) : 527 - 546
  • [3] [Anonymous], 2012, Europe Conference Computer Vision on Workshops and Demonstrations
  • [4] [Anonymous], 2015, INT C LEAR REPR
  • [5] Bar-Hillel AB, 2005, J MACH LEARN RES, V6, P937
  • [6] Bi-Level Semantic Representation Analysis for Multimedia Event Detection
    Chang, Xiaojun
    Ma, Zhigang
    Yang, Yi
    Zeng, Zhiqiang
    Hauptmann, Alexander G.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1180 - 1197
  • [7] Semantic Pooling for Complex Event Analysis in Untrimmed Videos
    Chang, Xiaojun
    Yu, Yao-Liang
    Yang, Yi
    Xing, Eric P.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1617 - 1632
  • [8] Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities
    Chen, Jiaxin
    Zhang, Zhaoxiang
    Wang, Yunhong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 4741 - 4755
  • [9] Deep Ranking for Person Re-Identification via Joint Representation Learning
    Chen, Shi-Zhe
    Guo, Chun-Chao
    Lai, Jian-Huang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) : 2353 - 2367
  • [10] Chen YC, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3402