Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification

被引:18
|
作者
Li, Ming [1 ]
Liu, Jun [2 ]
Zheng, Ce [3 ]
Huang, Xinming [4 ]
Zhang, Ziming [4 ]
机构
[1] Natl Univ Singapore, Inst Data Sci, Singapore 119077, Singapore
[2] Singapore Univ Technol & Design, Informat Syst Technol & Design, Singapore 487372, Singapore
[3] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
[4] Worcester Polytech Inst, Dept Elect & Comp Engn, Worcester, MA 01609 USA
关键词
Transformers; Correlation; Feature extraction; Visualization; Training; Benchmark testing; Task analysis; Correlation exploiting; multi-view learning; transformer; vehicle re-identification;
D O I
10.1109/TMM.2021.3134839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image-based vehicle re-identification (ReID) has witnessed much progress in recent years. However, most of existing works struggled to extract robust but discriminative features from a single image to represent one vehicle instance. We argue that images taken from distinct viewpoints, e.g., front and back, have significantly different appearances and patterns for recognition. In order to identify each vehicle, these models have to capture consistent "ID codes " from totally different views, causing learning difficulties. Additionally, we claim that part-level correspondences among views, i.e., various vehicle parts observed from the identical image and the same part visible from different viewpoints, contribute to instance-level feature learning as well. Motivated by these, we propose to extract comprehensive vehicle instance representations from multiple views through modelling part-wise correlations. To this end, we present our efficient transformer-based framework to exploit both inner- and inter-view correlations for vehicle ReID. In specific, we first adopt a convnet encoder to condense a series of patch embeddings from each view. Then our efficient transformer, consisting of a distillation token and a noise token in addition to a regular classification token, is constructed for enforcing these patch embeddings to interact with each other regardless of whether they are taken from identical or different views. We conduct extensive experiments on widely used vehicle ReID benchmarks, and our approach achieves the state-of-the-art performance, showing the effectiveness of our method.
引用
收藏
页码:919 / 929
页数:11
相关论文
共 50 条
  • [41] MvHAAN: multi-view hierarchical attention adversarial network for person re-identification
    Zhu, Lei
    Yu, Weiren
    Zhu, Xinghui
    Zhang, Chengyuan
    Li, Yangding
    Zhang, Shichao
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [42] A vehicle re-identification algorithm based on multi-sensor correlation
    Tian, Yin
    Dong, Hong-hui
    Jia, Li-min
    Li, Si-yu
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2014, 15 (05): : 372 - 382
  • [43] A vehicle re-identification algorithm based on multi-sensor correlation
    Yin TIAN
    Hong-hui DONG
    Li-min JIA
    Si-yu LI
    Frontiers of Information Technology & Electronic Engineering, 2014, (05) : 372 - 382
  • [44] A vehicle re-identification algorithm based on multi-sensor correlation
    Yin Tian
    Hong-hui Dong
    Li-min Jia
    Si-yu Li
    Journal of Zhejiang University SCIENCE C, 2014, 15 : 372 - 382
  • [45] DecTrans: Person Re-identification with Multifaceted Part Features via Decomposed Transformer
    Zhang, Yan
    Gao, Guangyu
    Wang, Qianxiang
    Ge, Jing
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 29 - 42
  • [46] MP-GIEN: Vehicle Re-Identification Method Based on Multi-View Progressive Graph Interactive Embedding Network
    Wang, Ruoda
    Guo, Min
    Ma, Miao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (15)
  • [47] Multi-view Person Re-identification in a Fisheye Camera Network with Different Viewing Directions
    G. Blott
    J. Yu
    C. Heipke
    PFG – Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 2019, 87 : 263 - 274
  • [48] Maximal granularity structure and generalized multi-view discriminant analysis for person re-identification
    Zhao, Cairong
    Wang, Xuekuan
    Miao, Duoqian
    Wang, Hanli
    Zheng, Weishi
    Xu, Yong
    Zhang, David
    PATTERN RECOGNITION, 2018, 79 : 79 - 96
  • [49] Cross-domain unsupervised pedestrian re-identification based on multi-view decomposition
    Xiaofeng Yang
    Zihao Zhou
    Qianshan Wang
    Zhiwei Wang
    Xi Li
    Haifang Li
    Multimedia Tools and Applications, 2022, 81 : 39387 - 39408
  • [50] Multi-view Person Re-identification in a Fisheye Camera Network with Different Viewing Directions
    Blott, G.
    Yu, J.
    Heipke, C.
    PFG-JOURNAL OF PHOTOGRAMMETRY REMOTE SENSING AND GEOINFORMATION SCIENCE, 2019, 87 (5-6): : 263 - 274