Vehicle Re-Identification by Deep Hidden Multi-View Inference

被引:107
作者
Zhou, Yi [1 ]
Liu, Li [1 ]
Shao, Ling [1 ]
机构
[1] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
关键词
Vehicle re-identification; multi-view; spatially concatenated ConvNet; CNN-LSTM bi-directional loop; PERSON REIDENTIFICATION; RECOGNITION; ROAD;
D O I
10.1109/TIP.2018.2819820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vehicle re-identification (re-ID) is an area that has received far less attention in the computer vision community than the prevalent person re-ID. Possible reasons for this slow progress are the lack of appropriate research data and the special 3D structure of a vehicle. Previous works have generally focused on some specific views (e.g., front); but, these methods are less effective in realistic scenarios, where vehicles usually appear in arbitrary views to cameras. In this paper, we focus on the uncertainty of vehicle viewpoint in re-ID, proposing two end-to-end deep architectures: the Spatially Concatenated ConvNet and convolutional neural network (CNN)-LSTM bi-directional loop. Our models exploit the great advantages of the CNN and long short-term memory (LSTM) to learn transformations across different viewpoints of vehicles. Thus, a multi-view vehicle representation containing all viewpoints' information can be inferred from the only one input view, and then used for learning to measure distance. To verify our models, we also introduce a Toy Car RE-ID data set with images from multiple viewpoints of 200 vehicles. We evaluate our proposed methods on the Toy Car RE-ID data set and the public Multi-View Car, VehicleID, and VeRi data sets. Experimental results illustrate that our models achieve consistent improvements over the state-of-the-art vehicle re-ID approaches.
引用
收藏
页码:3275 / 3287
页数:13
相关论文
共 56 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299016
[2]  
[Anonymous], INT J SCI ENG RES
[3]  
[Anonymous], P INT WORKSH MULT IN
[4]   Person Reidentification Using Multiple Egocentric Views [J].
Chakraborty, Anirban ;
Mandal, Bappaditya ;
Yuan, Junsong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (03) :484-498
[5]   Vehicle Color Recognition on Urban Road by Feature Context [J].
Chen, Pan ;
Bai, Xiang ;
Liu, Wenyu .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (05) :2340-2346
[6]  
Davis J.V., 2007, P 24 INT C MACHINE L, P209, DOI DOI 10.1145/1273496.1273523
[7]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[8]   Person Re-Identification by Symmetry-Driven Accumulation of Local Features [J].
Farenzena, M. ;
Bazzani, L. ;
Perina, A. ;
Murino, V. ;
Cristani, M. .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2360-2367
[9]   3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images [J].
Ge, Liuhao ;
Liang, Hui ;
Yuan, Junsong ;
Thalmann, Daniel .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5679-5688
[10]  
Gheissari N, 2006, P IEEE C COMP VIS PA, P1528, DOI DOI 10.1109/CVPR.2006.223