Joint Image and Feature Levels Disentanglement for Generalizable Vehicle Re-identification

被引:9
作者
Kuang, Zhenyu [1 ]
He, Chuchu [1 ]
Huang, Yue [1 ]
Ding, Xinghao [1 ]
Li, Huafeng [2 ]
机构
[1] Xiamen Univ, Sch Informat, Fujian Key Lab Sensing & Comp Smart City, Xiamen 361005, Peoples R China
[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Feature extraction; Image reconstruction; Cameras; Training; Neural networks; Interference; Vehicle re-identification; variational autoencoder; domain generalization; representation disentanglement; NETWORK;
D O I
10.1109/TITS.2023.3314213
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Domain generalization (DG), which doesn't require any data from target domains during training, is more challenging but practical than unsupervised domain adaptation (UDA). Since different vehicles of the same type have a similar appearance, neural networks always rely on a small amount of useful information to distinguish them, meaning that is more significant to remove ID-unrelated information for vehicle re-Identification (re-ID). Therefore, it is the key to eliminating the interference of a large amount of redundant information for the generalizable vehicle re-ID method. To address this unique challenge, we propose a novel disentanglement learning method that encourages variational autoencoder (VAE) network to reduce ID-unrelated features of vehicles by minimizing image reconstruction errors and providing sufficient representation to vehicle labels. To capture the intrinsic characteristics associated with the DG task, our core idea is to build the identity information streaming framework to separate ID-related and ID-unrelated information at the image and feature levels. In contrast with the general decoupling methods, our method leverages the decoupling of joint image and feature levels to extract more generalizable features. Furthermore, we present a brand-new vehicle dataset of truck types named "Optimus Prime (Opri)", which includes multiple images of each truck captured by cameras at different high-speed toll gates. Experimental results on public datasets demonstrate that our method can achieve promising results and outperform several state-of-the-art approaches. Our codes and models are available at JIFD.
引用
收藏
页码:15259 / 15273
页数:15
相关论文
共 109 条
[1]  
Ahuja K, 2021, ADV NEUR IN, V34
[2]  
Alemi A. A., 2016, ARXIV
[3]  
[Anonymous], 2009, BMVC
[4]  
[Anonymous], 2012, ACCV, DOI 10.1007/978-3-642-37331-23
[5]  
Arjovsky Martin, 2019, ARXIV
[6]   Disentangled Feature Learning Network and a Comprehensive Benchmark for Vehicle Re-Identification [J].
Bai, Yan ;
Liu, Jun ;
Lou, Yihang ;
Wang, Ce ;
Duan, Ling-yu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :6854-6871
[7]   Person30K: A Dual-Meta Generalization Network for Person Re-Identification [J].
Bai, Yan ;
Jiao, Jile ;
Ce, Wang ;
Liu, Jun ;
Lou, Yihang ;
Feng, Xuetao ;
Duan, Ling-Yu .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2123-2132
[8]  
Balaji Y, 2018, ADV NEUR IN, V31
[9]  
Belghazi MI, 2018, PR MACH LEARN RES, V80
[10]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828