Transformer With Regularized Dual Modal Meta Metric Learning for Attribute-Image Person Re-Identification

被引:0
作者
Xu, Xianri [1 ]
Xu, Rongxian [2 ]
机构
[1] Fujian Business Univ, Coll Informat Engn, Fuzhou 350012, Peoples R China
[2] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Transformers; Measurement; Identification of persons; Training; Pedestrians; Computer vision; Visualization; Encoding; Bonding; Transformer; meta learning; cross-model; metric learning; person retrieval; NETWORK;
D O I
10.1109/ACCESS.2024.3511034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attribute-image person re-identification (AIPR) is a meaningful and challenging task to retrieve images based on attribute descriptions. In this paper, we propose a regularized dual modal meta metric learning ((RDML)-L-3) method for AIPR, which employs meta-learning training methods to enhance the transformer's capacity to acquire latent knowledge. During training, data are initially divided into a single-modal support set with images and a dual-modal query set containing both attributes and images. The (RDML)-L-3 method introduces an attribute-image transformer (AIT) as a novel feature extraction backbone, extending the visual transformer concept. Utilizing the concept of hard sample mining, the method designs attribute-image cross-modal meta metrics and image-image intra-modal meta metrics. The triple loss function based on meta-metrics is then applied to converge the same category samples and diverge different categories, thereby enhancing cross-modal and intramodal discrimination abilities. Finally, a regularization term is used to aggregate samples of different modalities in the query set to prevent overfitting, ensuring that (RDML)-L-3 maintains the model's generalization ability while aligning the two modalities and identifying unseen classes. Experimental results on the PETA and Market-1501 attribute datasets demonstrate the superiority of the (RDML)-L-3 method, achieving mean average precision (mAP) scores of 36.7% on the Market-1501 Attributes dataset and 60.6% on the PETA dataset.
引用
收藏
页码:183344 / 183353
页数:10
相关论文
共 58 条
  • [11] Finn C, 2017, PR MACH LEARN RES, V70
  • [12] FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
    Gao, Dehong
    Jin, Linbo
    Chen, Ben
    Qiu, Minghui
    Li, Peng
    Wei, Yi
    Hu, Yi
    Wang, Hao
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2251 - 2260
  • [13] A new graph-preserving unsupervised feature selection embedding LLE with low-rank constraint and feature-level representation
    Han, Xiaohong
    Chai, Haishui
    Liu, Ping
    Li, Dengao
    Wang, Li
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2875 - 2903
  • [14] Dual-alignment Feature Embedding for Cross-modality Person Re-identification
    Hao, Yi
    Wang, Nannan
    Gao, Xinbo
    Li, Jie
    Wang, Xiaoyu
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 57 - 65
  • [15] TransReID: Transformer-based Object Re-Identification
    He, Shuting
    Luo, Hao
    Wang, Pichao
    Wang, Fan
    Li, Hao
    Jiang, Wei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14993 - 15002
  • [16] Nguyen HV, 2011, LECT NOTES COMPUT SC, V6493, P709
  • [17] Learning Cross-Modal Retrieval with Noisy Labels
    Hu, Peng
    Peng, Xi
    Zhu, Hongyuan
    Zhen, Liangli
    Lin, Jie
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5399 - 5409
  • [18] Feedback Adversarial Learning: Spatial Feedback for Improving Generative Adversarial Networks
    Huh, Minyoung
    Sun, Shao-Hua
    Zhang, Ning
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1476 - 1485
  • [19] Iodice S., 2020, P BRIT MACH VIS C, P1
  • [20] RETRACTED: Automatic alert generation in a surveillance systems for smart city environment using deep learning algorithm (Retracted Article)
    Janakiramaiah, B.
    Kalyani, G.
    Jayalakshmi, A.
    [J]. EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 635 - 642