Two-Stream Multirate Recurrent Neural Network for Video-Based Pedestrian Reidentification

被引：56

作者：

Zeng, Zhiqiang ^{[1
]}

Li, Zhihui ^{[2
]}

Cheng, De ^{[3
]}

Zhang, Huaxiang ^{[4
]}

Zhan, Kun ^{[5
]}

Yang, Yi ^{[6
,7
]}

机构：

[1] Xiamen Univ Technol, Coll Comp & Informat Engn, Xiamen 361005, Peoples R China

[2] Beijing Etrol Technol Co Ltd, Beijing 100095, Peoples R China

[3] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710000, Shaanxi, Peoples R China

[4] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Shandong, Peoples R China

[5] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Gansu, Peoples R China

[6] Univ Technol Sydney, Ctr Artificial Intelligence, Ultimo, NSW 2007, Australia

[7] Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing 430072, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2018年 / 14卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Person reidentification; recurrent neural networks; video surveillance; PERSON REIDENTIFICATION;

D O I：

10.1109/TII.2017.2767557

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video-based pedestrian reidentification is an emerging task in video surveillance and is closely related to several real-world applications. Its goal is to match pedestrians across multiple nonoverlapping network cameras. Despite the recent effort, the performance of pedestrian reidentification needs further improvement. Hence, we propose a novel two-stream multirate recurrent neural network for video-based pedestrian reidentification with two inherent advantages: First, capturing the static spatial and temporal information; Second, Author: Figure II is not cited in the text. Please cite it at the appropriate place. dealing with motion speed variance. Given video sequences of pedestrians, we start with extracting spatial and motion features using two different deep neural networks. Then, we explore the feature correlation which results in a regularized fusion network integrating the two aforementioned networks. Considering that pedestrians, sometimes even the same pedestrian, move in different speeds across different camera views, we extend our approach by feeding the two networks into a multirate recurrent network to exploit the temporal correlations. Extensive experiments have been conducted on two real-world video-based pedestrian reidentification benchmarks: iLIDS-VID and PRID 2011 datasets. The experimental results confirm the efficacy of the proposed method. Our code will be released upon acceptance.

引用

页码：3179 / 3186

页数：8

共 58 条

[1] Corner-Boundary Processor Allocation for 3D Mesh-Connected Multicomputers
Ababneh, Ismail
Bani-Mohammad, Saad
Al Smadi, Motasem
[J]. INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2015, 5 (01) : 1 - 13
[2] Automated wireless video surveillance: an evaluation framework
Alsmirat, Mohammad A.
Jararweh, Yaser
Obaidat, Islam
Gupta, Brij B.
[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2017, 13 (03) : 527 - 546
[3] [Anonymous], 2012, Europe Conference Computer Vision on Workshops and Demonstrations
[4] [Anonymous], 2015, INT C LEAR REPR
[5] Bar-Hillel AB, 2005, J MACH LEARN RES, V6, P937
[6] Bi-Level Semantic Representation Analysis for Multimedia Event Detection
Chang, Xiaojun
Ma, Zhigang
Yang, Yi
Zeng, Zhiqiang
Hauptmann, Alexander G.
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1180 - 1197
[7] Semantic Pooling for Complex Event Analysis in Untrimmed Videos
Chang, Xiaojun
Yu, Yao-Liang
Yang, Yi
Xing, Eric P.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1617 - 1632
[8] Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities
Chen, Jiaxin
Zhang, Zhaoxiang
Wang, Yunhong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 4741 - 4755
[9] Deep Ranking for Person Re-Identification via Joint Representation Learning
Chen, Shi-Zhe
Guo, Chun-Chao
Lai, Jian-Huang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) : 2353 - 2367
[10] Chen YC, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3402

← 1 2 3 4 5 6 →