Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations

被引:0
|
作者
Yanagi, Rintaro [1 ]
Togo, Ren [2 ]
Ogawa, Takahiro [3 ]
Haseyama, Miki [3 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Hokkaido Univ, Educ & Res Ctr Math & Data Sci, Sapporo, Hokkaido 0600812, Japan
[3] Hokkaido Univ, Fac Informat Sci & Technol, Div Media & Network Technol, Sapporo, Hokkaido 0600814, Japan
关键词
cross-modal retrieval; text-to-image generative adversarial network; style transfer; domain adaptation;
D O I
10.1587/transfun.2020IMP0011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various cross-modal retrieval methods that can retrieve images related to a query sentence without text annotations have been proposed. Although a high level of retrieval performance is achieved by these methods, they have been developed for a single domain retrieval setting. When retrieval candidate images come from various domains, the retrieval performance of these methods might be decreased. To deal with this problem, we propose a new domain adaptive cross-modal retrieval method. By translating a modality and domains of a query and candidate images, our method can retrieve desired images accurately in a different domain retrieval setting. Experimental results for clipart and painting datasets showed that the proposed method has better retrieval performance than that of other conventional and state-of-the-art methods.
引用
收藏
页码:866 / 875
页数:10
相关论文
共 50 条
  • [1] Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation
    Zhao, Wentian
    Wu, Xinxiao
    Luo, Jiebo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1180 - 1192
  • [2] Multi-level Alignment Network for Domain Adaptive Cross-modal Retrieval
    Dong, Jianfeng
    Long, Zhongzi
    Mao, Xiaofeng
    Lin, Changting
    He, Yuan
    Ji, Shouling
    NEUROCOMPUTING, 2021, 440 : 207 - 219
  • [3] Domain Invariant Subspace Learning for Cross-Modal Retrieval
    Liu, Chenlu
    Xu, Xing
    Yang, Yang
    Lu, Huimin
    Shen, Fumin
    Ji, Yanli
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 94 - 105
  • [4] Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning
    Wang, Jian
    He, Yonghao
    Kang, Cuicui
    Xiang, Shiming
    Pan, Chunhong
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 347 - 354
  • [5] Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems
    Pereira, Jose Costa
    Vasconcelos, Nuno
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 124 : 123 - 135
  • [6] Modality-specific adaptive scaling and attention network for cross-modal retrieval
    Ke, Xiao
    Chen, Baitao
    Cai, Yuhang
    Liu, Hao
    Guo, Wenzhong
    Chen, Weibin
    NEUROCOMPUTING, 2025, 612
  • [7] FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
    Gao, Dehong
    Jin, Linbo
    Chen, Ben
    Qiu, Minghui
    Li, Peng
    Wei, Yi
    Hu, Yi
    Wang, Hao
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2251 - 2260
  • [8] Continual learning for cross-modal image-text retrieval based on domain-selective attention
    Yang, Rui
    Wang, Shuang
    Gu, Yu
    Wang, Jihui
    Sun, Yingzhi
    Zhang, Huan
    Liao, Yu
    Jiao, Licheng
    PATTERN RECOGNITION, 2024, 149
  • [9] Adaptive Adversarial Learning based cross-modal retrieval
    Li, Zhuoyi
    Lu, Huibin
    Fu, Hao
    Wang, Zhongrui
    Gu, Guanghun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [10] Modality-specific matrix factorization hashing for cross-modal retrieval
    Xiong, Haixia
    Ou, Weihua
    Yan, Zengxian
    Gou, Jianping
    Zhou, Quan
    Wang, Anzhi
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (11) : 5067 - 5081