Source-Free Image-Text Matching via Uncertainty-Aware Learning

被引:0
|
作者
Tian, Mengxiao [1 ,2 ]
Yang, Shuo [3 ]
Wu, Xinxiao [1 ,2 ]
Jia, Yunde [3 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
[2] Shenzhen MSU BIT Univ, Guangdong Prov Lab Machine Percept & Intelligent C, Shenzhen 518172, Peoples R China
[3] Shenzhen MSU BIT Univ, Guangdong Prov Lab Machine Percept & Intelligent C, Shenzhen 518172, Peoples R China
关键词
Adaptation models; Uncertainty; Noise measurement; Data models; Training; Noise; Visualization; Measurement uncertainty; Computational modeling; Testing; Image-text matching; source-free adaptation; uncertainty-aware learning;
D O I
10.1109/LSP.2024.3488521
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
When applying a trained image-text matching model to a new scenario, the performance may largely degrade due to domain shift, which makes it impractical in real-world applications. In this paper, we make the first attempt on adapting the image-text matching model well-trained on a labeled source domain to an unlabeled target domain in the absence of source data, namely, source-free image-text matching. This task is challenging since it has no direct access to the source data when learning to reduce the doma in shift. To address this challenge, we propose a simple yet effective method that introduces uncertainty-aware learning to generate high-quality pseudo-pairs of image and text for target adaptation. Specifically, starting with using the pre-trained source model to retrieve several top-ranked image-text pairs from the target domain as pseudo-pairs, we then model uncertainty of each pseudo-pair by calculating the variance of retrieved texts (resp. images) given the paired image (resp. text) as query, and finally incorporate the uncertainty into an objective function to down-weight noisy pseudo-pairs for better training, thereby enhancing adaptation. This uncertainty-aware training approach can be generally applied on all existing models. Extensive experiments on the COCO and Flickr30K datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:3059 / 3063
页数:5
相关论文
共 50 条
  • [1] Cross-Modal Remote Sensing Image-Text Retrieval via Context and Uncertainty-Aware Prompt
    Wang, Yijing
    Tang, Xu
    Ma, Jingjing
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [2] UPL-SFDA: Uncertainty-Aware Pseudo Label Guided Source-Free Domain Adaptation for Medical Image Segmentation
    Wu, Jianghao
    Wang, Guotai
    Gu, Ran
    Lu, Tao
    Chen, Yinan
    Zhu, Wentao
    Vercauteren, Tom
    Ourselin, Sebastien
    Zhang, Shaoting
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3932 - 3943
  • [3] Reference-Aware Adaptive Network for Image-Text Matching
    Xiong, Guoxin
    Meng, Meng
    Zhang, Tianzhu
    Zhang, Dongming
    Zhang, Yongdong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9678 - 9691
  • [4] A NEIGHBOR-AWARE APPROACH FOR IMAGE-TEXT MATCHING
    Liu, Chunxiao
    Mao, Zhendong
    Zang, Wenyu
    Wang, Bin
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3970 - 3974
  • [5] Regularizing Visual Semantic Embedding With Contrastive Learning for Image-Text Matching
    Liu, Yang
    Liu, Hong
    Wang, Huaqiu
    Liu, Mengyuan
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1332 - 1336
  • [6] Enhanced Semantic Similarity Learning Framework for Image-Text Matching
    Zhang, Kun
    Hu, Bo
    Zhang, Huatian
    Li, Zhe
    Mao, Zhendong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2973 - 2988
  • [7] SELF-SUPERVISED LEARNING FOR SENTIMENT ANALYSIS VIA IMAGE-TEXT MATCHING
    Zhu, Haidong
    Zheng, Zhaoheng
    Soleymani, Mohammad
    Nevatia, Ram
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1710 - 1714
  • [8] Uncertainty-Induced Transferability Representation for Source-Free Unsupervised Domain Adaptation
    Pei, Jiangbo
    Jiang, Zhuqing
    Men, Aidong
    Chen, Liang
    Liu, Yang
    Chen, Qingchao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2033 - 2048
  • [9] An end-to-end image-text matching approach considering semantic uncertainty
    Tuerhong, Gulanbaier
    Dai, Xin
    Tian, Liwei
    Wushouer, Mairidan
    NEUROCOMPUTING, 2024, 607
  • [10] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
    Enkhsaikhan, Bayaraa
    Jo, Ohyun
    IEEE ACCESS, 2024, 12 : 166553 - 166563