Cross-modal retrieval based on deep regularized hashing constraints

被引：10

作者：

Khan, Asad ^{[1
]}

Hayat, Sakander ^{[2
]}

Ahmad, Muhammad ^{[3
]}

Wen, Jinyu ^{[1
]}

Farooq, Muhammad Umar ^{[4
]}

Fang, Meie ^{[1
]}

Jiang, Wenchao ^{[5
]}

机构：

[1] Guangzhou Univ, Sch Comp Sci & Cyber Engn, Guangzhou 510006, Peoples R China

[2] Guangzhou Univ, Sch Math & Informat Sci, Guangzhou, Peoples R China

[3] Natl Univ Comp & Emerging Sci NUCES FAST, Dept Comp Sci, Faisalabad Campus, Chiniot, Pakistan

[4] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei, Peoples R China

[5] Guangdong Univ Technol, Sch Comp, Guangzhou, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS | 2022年 / 37卷 / 09期

基金：

中国国家自然科学基金;

关键词：

cross-modal retrieval; hashing learning; image search; multilabel information; neural network; ranking model; triplet loss; SIMILARITY; NETWORK;

D O I：

10.1002/int.22853

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modal retrieval has attracted great attention due to the increasing demand for tremendous amounts of multimodal data in recent years. These retrievals could either be text-to-image or image-to-text. To address the problem of inappropriate information included between images and texts, we propose two cross-modal recovery techniques established on a dual-branch neural network defined on a common subspace and the hashing learning method. First, a cross-modal recovery technique established on a multilabel information deep ranking model (MIDRM) is provided. In this method, we introduce a triplet-loss function into the dual-branch neural network model. This function takes advantage of the semantic information of the bimodal components, focusing on not only the similarities between similar images and text features but also the distances between dissimilar images and texts. Second, we establish a new cross-modal hashing technique said to be the deep regularized hashing constraint (DRHC). In this method, the regularized function is used to replace the binary constraint, and the discrete value is constrained to a certain numerical range so that the network can achieve end-to-end training. Overall, the time complexity is greatly improved, and the occupied storage space is also greatly reduced. Different experiments on our proposed MIDRM and DRHC models demonstrate their superior performance to those of the state-of-the-art methods on two widely used data sets. The experimental results show that our approach also increases the mean average precision of cross-modal recovery.

引用

页码：6508 / 6530

页数：23

共 50 条

[1] Cross-Modal Hashing Retrieval Based on Deep Residual Network
Li, Zhiyi
Xu, Xiaomian
Zhang, Du
Zhang, Peng
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 36 (02): : 383 - 405
[2] Deep Hashing Similarity Learning for Cross-Modal Retrieval
Ma, Ying
Wang, Meng
Lu, Guangyun
Sun, Yajun
IEEE ACCESS, 2024, 12 : 8609 - 8618
[3] Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval
Zhan, Yu-Wei
Luo, Xin
Wang, Yongxin
Xu, Xin-Shun
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3386 - 3394
[4] Deep Multiscale Fusion Hashing for Cross-Modal Retrieval
Nie, Xiushan
Wang, Bowei
Li, Jiajia
Hao, Fanchang
Jian, Muwei
Yin, Yilong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 401 - 410
[5] Triplet-Based Deep Hashing Network for Cross-Modal Retrieval
Deng, Cheng
Chen, Zhaojia
Liu, Xianglong
Gao, Xinbo
Tao, Dacheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) : 3893 - 3903
[6] Label-Based Deep Semantic Hashing for Cross-Modal Retrieval
Weng, Weiwei
Wu, Jiagao
Yang, Lu
Liu, Linfeng
Hu, Bin
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 24 - 36
[7] Semantic Constraints Matrix Factorization Hashing for cross-modal retrieval
Li, Weian
Xiong, Haixia
Ou, Weihua
Gou, Jianping
Deng, Jiaxing
Liang, Linqing
Zhou, Quan
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
[8] Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval
Zhong, Fangming
Chen, Zhikui
Min, Geyong
PATTERN RECOGNITION, 2018, 83 : 64 - 77
[9] Deep Cross-Modal Hashing
Jiang, Qing-Yuan
Li, Wu-Jun
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3270 - 3278
[10] Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval
Lu, Kangkang
Yu, Yanhua
Liang, Meiyu
Zhang, Min
Cao, Xiaowen
Zhao, Zehua
Yin, Mengran
Xue, Zhe
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 126 - 131

← 1 2 3 4 5 →