Deep asymmetric video-based person re-identification

被引:24
作者
Meng, Jingke [1 ,4 ]
Wu, Ancong [3 ]
Zheng, Wei-Shi [1 ,2 ,4 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou, Guangdong, Peoples R China
[4] Sun Yat Sen Univ, Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Guangdong, Peoples R China
关键词
Person re-identification; Visual surveillance; REPRESENTATION;
D O I
10.1016/j.patcog.2019.04.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the problem of video-based person re-identification (re-id) which matches people's video clips across non-overlapping camera views at different time. A key challenge of video based person re-id is a person's appearance and motion would always display differently and take effects unequally at disjoint camera views due to the change of lighting, viewpoint, background and etc., which we call the "view-bias" problem. However, many previous video-based person re-id approaches have not quantified the importance of different types of features at different camera views, so that the two types of important features (i.e. appearance and motion features) do not collaborate effectively and thus the "view-bias" problem remains unsolved. To address this problem, we propose a Deep Asymmetric Metric learning (DAM) method that embeds a proposed asymmetric distance metric learning loss into a two-stream deep neural network for jointly learning view-specific and feature-specific transformations to overcome the "view-bias" problem in video-based person re-id. As learning these view-specific transformations become expensive when there are large amount of camera views, a clustering-based DAM method is developed to make our DAM scalable. Extensive evaluations have been carried out on three public datasets: PRID2011, iLIDS-VID and MARS. Our results verify that learning view-specific and feature-specific transformations are beneficial, and the presented DAM has empirically performed more effectively overall for video-based person re-id on challenging benchmarks. (C) 2019 Published by Elsevier Ltd.
引用
收藏
页码:430 / 441
页数:12
相关论文
共 93 条
[1]  
Ahmed E, 2015, PROC CVPR IEEE, P3908, DOI 10.1109/CVPR.2015.7299016
[2]   Person Re-Identification by Robust Canonical Correlation Analysis [J].
An, Le ;
Yang, Songfan ;
Bhanu, Bir .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (08) :1103-1107
[3]  
[Anonymous], AMSTER658
[4]  
[Anonymous], P IEEE INT C COMP VI
[5]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[6]  
[Anonymous], 2019, P ASS ADV ART INT
[7]  
[Anonymous], P IEEE INT C COMP VI
[8]  
[Anonymous], ARXIV170310717
[9]  
[Anonymous], ARXIV160601609
[10]  
[Anonymous], 2017, IEEE T IMAGE PROCESS