Source-Free Image-Text Matching via Uncertainty-Aware Learning

被引：0

作者：

Tian, Mengxiao ^{[1
,2
]}

Yang, Shuo ^{[3
]}

Wu, Xinxiao ^{[1
,2
]}

Jia, Yunde ^{[3
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China

[2] Shenzhen MSU BIT Univ, Guangdong Prov Lab Machine Percept & Intelligent C, Shenzhen 518172, Peoples R China

[3] Shenzhen MSU BIT Univ, Guangdong Prov Lab Machine Percept & Intelligent C, Shenzhen 518172, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Adaptation models; Uncertainty; Noise measurement; Data models; Training; Noise; Visualization; Measurement uncertainty; Computational modeling; Testing; Image-text matching; source-free adaptation; uncertainty-aware learning;

D O I：

10.1109/LSP.2024.3488521

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

When applying a trained image-text matching model to a new scenario, the performance may largely degrade due to domain shift, which makes it impractical in real-world applications. In this paper, we make the first attempt on adapting the image-text matching model well-trained on a labeled source domain to an unlabeled target domain in the absence of source data, namely, source-free image-text matching. This task is challenging since it has no direct access to the source data when learning to reduce the doma in shift. To address this challenge, we propose a simple yet effective method that introduces uncertainty-aware learning to generate high-quality pseudo-pairs of image and text for target adaptation. Specifically, starting with using the pre-trained source model to retrieve several top-ranked image-text pairs from the target domain as pseudo-pairs, we then model uncertainty of each pseudo-pair by calculating the variance of retrieved texts (resp. images) given the paired image (resp. text) as query, and finally incorporate the uncertainty into an objective function to down-weight noisy pseudo-pairs for better training, thereby enhancing adaptation. This uncertainty-aware training approach can be generally applied on all existing models. Extensive experiments on the COCO and Flickr30K datasets demonstrate the effectiveness of the proposed method.

引用

页码：3059 / 3063

页数：5

共 50 条

[1] Cross-Modal Remote Sensing Image-Text Retrieval via Context and Uncertainty-Aware Prompt
Wang, Yijing
Tang, Xu
Ma, Jingjing
Zhang, Xiangrong
Liu, Fang
Jiao, Licheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[2] UPL-SFDA: Uncertainty-Aware Pseudo Label Guided Source-Free Domain Adaptation for Medical Image Segmentation
Wu, Jianghao
Wang, Guotai
Gu, Ran
Lu, Tao
Chen, Yinan
Zhu, Wentao
Vercauteren, Tom
Ourselin, Sebastien
Zhang, Shaoting
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3932 - 3943
[3] Reference-Aware Adaptive Network for Image-Text Matching
Xiong, Guoxin
Meng, Meng
Zhang, Tianzhu
Zhang, Dongming
Zhang, Yongdong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9678 - 9691
[4] A NEIGHBOR-AWARE APPROACH FOR IMAGE-TEXT MATCHING
Liu, Chunxiao
Mao, Zhendong
Zang, Wenyu
Wang, Bin
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3970 - 3974
[5] Regularizing Visual Semantic Embedding With Contrastive Learning for Image-Text Matching
Liu, Yang
Liu, Hong
Wang, Huaqiu
Liu, Mengyuan
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1332 - 1336
[6] Enhanced Semantic Similarity Learning Framework for Image-Text Matching
Zhang, Kun
Hu, Bo
Zhang, Huatian
Li, Zhe
Mao, Zhendong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2973 - 2988
[7] SELF-SUPERVISED LEARNING FOR SENTIMENT ANALYSIS VIA IMAGE-TEXT MATCHING
Zhu, Haidong
Zheng, Zhaoheng
Soleymani, Mohammad
Nevatia, Ram
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1710 - 1714
[8] Uncertainty-Induced Transferability Representation for Source-Free Unsupervised Domain Adaptation
Pei, Jiangbo
Jiang, Zhuqing
Men, Aidong
Chen, Liang
Liu, Yang
Chen, Qingchao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2033 - 2048
[9] An end-to-end image-text matching approach considering semantic uncertainty
Tuerhong, Gulanbaier
Dai, Xin
Tian, Liwei
Wushouer, Mairidan
NEUROCOMPUTING, 2024, 607
[10] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
Enkhsaikhan, Bayaraa
Jo, Ohyun
IEEE ACCESS, 2024, 12 : 166553 - 166563

← 1 2 3 4 5 →