PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data

被引:127
作者
Tang, Zheng [1 ,2 ]
Naphade, Milind [1 ]
Birchfield, Stan [1 ]
Tremblay, Jonathan [1 ]
Hodge, William [1 ]
Kumar, Ratnesh [1 ]
Wang, Shuo [1 ]
Yang, Xiaodong [1 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
[2] Amazon, Seattle, WA 98108 USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In comparison with person re-identification (ReID), which has been widely studied in the research community, vehicle ReID has received less attention. Vehicle ReID is challenging due to 1) high intra-class variability (caused by the dependency of shape and appearance on viewpoint), and 2) small inter-class variability (caused by the similarity in shape and appearance between vehicles produced by different manufacturers). To address these challenges, we propose a Pose-Aware Multi-Task Re-Identification (PAMTRI) framework. This approach includes two innovations compared with previous methods. First, it overcomes viewpointdependency by explicitly reasoning about vehicle pose and shape via keypoints, heatmaps and segments from pose estimation. Second, it jointly classifies semantic vehicle attributes (colors and types) while performing ReID, through multi-task learning with the embedded pose representations. Since manually labeling images with detailed pose and attribute information is prohibitive, we create a large-scale highly randomized synthetic dataset with automatically annotated vehicle attributes for training. Extensive experiments validate the effectiveness of each proposed component, showing that PAMTRI achieves significant improvement over state-of-the-art on two mainstream vehicle ReID benchmarks: VeRi and CityFlow-ReID.
引用
收藏
页码:211 / 220
页数:10
相关论文
共 46 条
[1]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[2]  
[Anonymous], 2016, CoRR abs/1512.00567, DOI DOI 10.1109/CVPR.2016.308
[3]  
Ansari JA, 2018, IEEE INT C INT ROBOT, P8404, DOI 10.1109/IROS.2018.8593698
[4]   Group-Sensitive Triplet Embedding for Vehicle Reidentification [J].
Bai, Yan ;
Lou, Yihang ;
Gao, Feng ;
Wang, Shiqi ;
Wu, Yuwei ;
Duan, Ling-Yu .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) :2385-2399
[5]  
Chebotar Yevgen., 2018, Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]  
Hermans Alexander, 2017, ARXIV
[8]  
Huang G, 2017, PROC CVPR IEEE, P4700, DOI [DOI 10.1109/CVPR.2017.243, 10.1109/CVPR.2017.243]
[9]  
James Stephen, 2017, PMLR
[10]  
Kingma DP, 2014, ARXIV