From Multi-Source Virtual to Real: Effective Virtual Data Search for Vehicle Re-Identification

被引:2
作者
Wan, Zhijing [1 ]
Xu, Xin [1 ]
Wang, Zheng [2 ]
Wang, Zhixiang [3 ]
Hu, Ruimin [2 ]
机构
[1] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430065, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430079, Peoples R China
[3] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138654, Japan
基金
中国国家自然科学基金;
关键词
Training; Data models; Redundancy; Solid modeling; Pipelines; Three-dimensional displays; Engines; Virtual-to-real vehicle re-identification; data redundancy; data search;
D O I
10.1109/TITS.2023.3329118
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Without tedious and time-consuming labeling processes, virtual datasets have recently shown their superiority for vehicle re-identification (re-ID). Existing virtual to real vehicle re-ID methods employ only a single virtual dataset for model training, while datasets from different generative sources are not jointly exploited. Multiple source virtual datasets contain more data diversity that can boost model performance. We thus propose a multi-source virtual to real vehicle re-ID pipeline, where multiple source virtual datasets are used during training. However, the multi-source virtual dataset suffers from more data redundancy than the single virtual dataset, which can affect the training efficiency. Intuitively, it can be mitigated by virtual data search. Unlike a single virtual dataset, a performance gap exists between multiple source virtual datasets, indicating their different contributions to model learning. Accordingly, we propose to split the multi-source virtual dataset into the main training set and the auxiliary training set, and then design the sampling strategy separately. For the main training set, the Consistent Attribute Distribution-FEature distance Trade-off (CAD-FET) strategy is designed to search for representative data. For the auxiliary training set, a cluster-based sampling strategy is further proposed to search for the most diverse subset. Besides, a simple yet effective two-stage training strategy is proposed to utilize these subsets reasonably. Extensive virtual-to-real vehicle re-ID experiments show that our data sampling method can reduce the volume of the multi-source virtual dataset by around 77%/96% and boost the model performance when tested on the VeRi776/VehicleID.
引用
收藏
页码:3433 / 3444
页数:12
相关论文
共 48 条
[1]  
Chen WB, 2021, Int C Auto Cont Robo, P1, DOI [10.1109/CACRE52464.2021.9501327, 10.1109/CLEO/Europe-EQEC52157.2021.9542206]
[2]  
Chen Wuyang, 2020, P MACHINE LEARNING R, V119
[3]   Vehicle Re-Identification Using Distance-Based Global and Partial Multi-Regional Feature Learning [J].
Chen, Xu ;
Sui, Haigang ;
Fang, Jian ;
Feng, Wenqing ;
Zhou, Mingting .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) :1276-1286
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]  
Ding YF, 2015, PR MACH LEARN RES, V37, P579
[6]   Low-Resolution Face Recognition in the Wild via Selective Knowledge Distillation [J].
Ge, Shiming ;
Zhao, Shengwei ;
Li, Chenyu ;
Li, Jia .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) :2051-2062
[7]  
Gretton A, 2012, J MACH LEARN RES, V13, P723
[8]  
Guo HY, 2018, AAAI CONF ARTIF INTE, P6853
[9]   Multi-Domain Learning and Identity Mining for Vehicle Re-Identification [J].
He, Shuting ;
Luo, Hao ;
Chen, Weihua ;
Zhang, Miao ;
Zhang, Yuqi ;
Wang, Fan ;
Li, Hao ;
Jiang, Wei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :2485-2493
[10]  
Heusel M, 2017, ADV NEUR IN, V30