Heterogeneous Interactive Learning Network for Unsupervised Cross-Modal Retrieval

被引:0
|
作者
Zheng, Yuanchao [1 ]
Zhang, Xiaowei [1 ]
机构
[1] Qingdao Univ, Qingdao, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Cross-modal hashing; Heterogeneous interactive; Adversarial loss;
D O I
10.1007/978-3-031-26316-3_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal hashing has received a lot of attention because of its unique characteristic of low storage cost and high retrieval efficiency. However, these existing cross-modal retrieval approaches often fail to align effectively semantic information due to information asymmetry between image and text modality. To address this issue, we propose Heterogeneous Interactive Learning Network (HILN) for unsupervised cross-modal retrieval to alleviate the problem of the heterogeneous semantic gap. Specifically, we introduce a multi-head self-attention mechanism to capture the global dependencies of semantic features within the modality. Moreover, since the semantic relations among object entities from different modalities exist consistency, we perform heterogeneous feature fusion through the heterogeneous feature interaction module, especially through the cross attention in it to learn the interaction between different modal features. Finally, to further maintain semantic consistency, we introduce adversarial loss into network learning to generate more robust hash codes. Extensive experiments demonstrate that the proposed HILN improves the accuracy of T -> I and I -> T cross-modal retrieval tasks by 7.6% and 5.5% over the best competitor DGCPN on the NUS-WIDE dataset, respectively. Code is available at https://github.com/Z000204/HILN.
引用
收藏
页码:692 / 707
页数:16
相关论文
共 50 条
  • [41] High-order nonlocal Hashing for unsupervised cross-modal retrieval
    Peng-Fei Zhang
    Yadan Luo
    Zi Huang
    Xin-Shun Xu
    Jingkuan Song
    World Wide Web, 2021, 24 : 563 - 583
  • [42] Unsupervised Cross-Modal Medical Image Retrieval with Ensemble Prototype Alignment
    Yao, Yishan
    Liu, Xiaoqing
    Yu, Zhiwen
    Lv, Jianming
    Hu, Yang
    Yang, Kaixiang
    2024 IEEE INTERNATIONAL CONFERENCE ON MEDICAL ARTIFICIAL INTELLIGENCE, MEDAI 2024, 2024, : 161 - 167
  • [43] Cross-Modal Retriever: Unsupervised Image Retrieval with Text and Reference Images
    Desai, Padmashree
    Kumar, Vivek
    Srivastava, Chandan
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [44] Unsupervised cross-modal hashing retrieval via Dynamic Contrast and Optimization
    Xie, Xiumin
    Li, Zhixin
    Li, Bo
    Zhang, Canlong
    Ma, Huifang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [45] Structure-aware contrastive hashing for unsupervised cross-modal retrieval
    Cui, Jinrong
    He, Zhipeng
    Huang, Qiong
    Fu, Yulu
    Li, Yuting
    Wen, Jie
    NEURAL NETWORKS, 2024, 174
  • [46] Self-Attentive CLIP Hashing for Unsupervised Cross-Modal Retrieval
    Yu, Heng
    Ding, Shuyan
    Li, Lunbo
    Wu, Jiexin
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [47] High-order nonlocal Hashing for unsupervised cross-modal retrieval
    Zhang, Peng-Fei
    Luo, Yadan
    Huang, Zi
    Xu, Xin-Shun
    Song, Jingkuan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (02): : 563 - 583
  • [48] Cross-Modal Interaction Network for Video Moment Retrieval
    Ping, Shen
    Jiang, Xiao
    Tian, Zean
    Cao, Ronghui
    Chi, Weiming
    Yang, Shenghong
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (08)
  • [49] Event-Driven Network for Cross-Modal Retrieval
    Zeng, Zhixiong
    Xu, Nan
    Mao, Wenji
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2297 - 2300
  • [50] DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL
    Zhou, Yu
    Feng, Yong
    Zhou, Mingliang
    Qiang, Baohua
    Hou, Leong U.
    Zhu, Jiajie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4325 - 4329