Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations

被引：0

作者：

Yanagi, Rintaro ^{[1
]}

Togo, Ren ^{[2
]}

Ogawa, Takahiro ^{[3
]}

Haseyama, Miki ^{[3
]}

机构：

[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan

[2] Hokkaido Univ, Educ & Res Ctr Math & Data Sci, Sapporo, Hokkaido 0600812, Japan

[3] Hokkaido Univ, Fac Informat Sci & Technol, Div Media & Network Technol, Sapporo, Hokkaido 0600814, Japan

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2021年 / E104A卷 / 06期

关键词：

cross-modal retrieval; text-to-image generative adversarial network; style transfer; domain adaptation;

D O I：

10.1587/transfun.2020IMP0011

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Various cross-modal retrieval methods that can retrieve images related to a query sentence without text annotations have been proposed. Although a high level of retrieval performance is achieved by these methods, they have been developed for a single domain retrieval setting. When retrieval candidate images come from various domains, the retrieval performance of these methods might be decreased. To deal with this problem, we propose a new domain adaptive cross-modal retrieval method. By translating a modality and domains of a query and candidate images, our method can retrieve desired images accurately in a different domain retrieval setting. Experimental results for clipart and painting datasets showed that the proposed method has better retrieval performance than that of other conventional and state-of-the-art methods.

引用

页码：866 / 875

页数：10

共 50 条

[1] Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation
Zhao, Wentian
Wu, Xinxiao
Luo, Jiebo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1180 - 1192
[2] Multi-level Alignment Network for Domain Adaptive Cross-modal Retrieval
Dong, Jianfeng
Long, Zhongzi
Mao, Xiaofeng
Lin, Changting
He, Yuan
Ji, Shouling
NEUROCOMPUTING, 2021, 440 : 207 - 219
[3] Domain Invariant Subspace Learning for Cross-Modal Retrieval
Liu, Chenlu
Xu, Xing
Yang, Yang
Lu, Huimin
Shen, Fumin
Ji, Yanli
MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 94 - 105
[4] Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning
Wang, Jian
He, Yonghao
Kang, Cuicui
Xiang, Shiming
Pan, Chunhong
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 347 - 354
[5] Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems
Pereira, Jose Costa
Vasconcelos, Nuno
COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 124 : 123 - 135
[6] Modality-specific adaptive scaling and attention network for cross-modal retrieval
Ke, Xiao
Chen, Baitao
Cai, Yuhang
Liu, Hao
Guo, Wenzhong
Chen, Weibin
NEUROCOMPUTING, 2025, 612
[7] FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
Gao, Dehong
Jin, Linbo
Chen, Ben
Qiu, Minghui
Li, Peng
Wei, Yi
Hu, Yi
Wang, Hao
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2251 - 2260
[8] Continual learning for cross-modal image-text retrieval based on domain-selective attention
Yang, Rui
Wang, Shuang
Gu, Yu
Wang, Jihui
Sun, Yingzhi
Zhang, Huan
Liao, Yu
Jiao, Licheng
PATTERN RECOGNITION, 2024, 149
[9] Adaptive Adversarial Learning based cross-modal retrieval
Li, Zhuoyi
Lu, Huibin
Fu, Hao
Wang, Zhongrui
Gu, Guanghun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[10] Modality-specific matrix factorization hashing for cross-modal retrieval
Xiong, Haixia
Ou, Weihua
Yan, Zengxian
Gou, Jianping
Zhou, Quan
Wang, Anzhi
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (11) : 5067 - 5081

← 1 2 3 4 5 →