Joint Specifics and Dual-Semantic Hashing Learning for Cross-Modal Retrieval

被引:10
作者
Teng, Shaohua [1 ]
Lin, Shengjie [1 ]
Teng, Luyao [3 ]
Wu, Naiqi [2 ]
Zheng, Zefeng [1 ]
Fei, Lunke [1 ]
Zhang, Wei [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China
[2] Macau Univ Sci & Technol, Inst Syst Engn, Macao Special Adm Reg China, Taipa, Peoples R China
[3] Guangzhou Panyu Polytech, Sch Informat Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal; Similarity searching; Label semantic; Sample semantic; BINARY-CODES; FRAMEWORK;
D O I
10.1016/j.neucom.2023.126993
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to its low memory and computational requirements, hashing techniques are widely applied for cross modal retrieval. However, there are still two unresolved issues: 1) the class-wise similarity of samples for each modality is not well exploited, and 2) most methods ignore the discriminative capacity of modality-specific information. To solve these two issues, we propose a novel supervised cross-modal hashing method called Joint Specifics and Dual-Semantic Hashing Learning for Cross-Modal Retrieval (SDSHL). SDSHL consists of three methods, i.e., Semantic Embedded Triple Matrix Factorization (SETMF), Modality Specific Dual Semantic Learning (MSDSL) and Modality Consistent Dual Semantic Learning (MCDSL). SETMF utilizes triple matrix factorization to fully explore modality features. MSDSL applies clustering to find the class-wise similarity for each modality, preserving modality-specific information well. MCDSL adopts asymmetric distance-distance difference minimization to capture modality-consistent information among modalities. By using SDSHL, the discrepancies between features and labels are reduced, while both modality-specific and modality-consistent information is well preserved in a shared hash code. Comprehensive experimentation on three benchmark datasets demonstrates the superior performance of SDSHL.
引用
收藏
页数:15
相关论文
共 63 条
[1]   Enhanced Discrete Multi-Modal Hashing: More Constraints Yet Less Time to Learn [J].
Chen, Yong ;
Zhang, Hui ;
Tian, Zhibao ;
Wang, Jun ;
Zhang, Dell ;
Li, Xuelong .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) :1177-1190
[2]   SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval [J].
Chen, Zhen-Duo ;
Li, Chuan-Xiang ;
Luo, Xin ;
Nie, Liqiang ;
Zhang, Wei ;
Xu, Xin-Shun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2262-2275
[3]   Collective Matrix Factorization Hashing for Multimodal Data [J].
Ding, Guiguang ;
Guo, Yuchen ;
Zhou, Jile .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2083-2090
[4]   Dynamic Double Classifiers Approximation for Cross-Domain Recognition [J].
Fang, Xiaozhao ;
Han, Na ;
Zhou, Guoxu ;
Teng, Shohua ;
Xu, Yong ;
Xie, Shenli .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (04) :2618-2629
[5]   Flexible Affinity Matrix Learning for Unsupervised and Semisupervised Classification [J].
Fang, Xiaozhao ;
Han, Na ;
Wong, Wai Keung ;
Teng, Shaohua ;
Wu, Jigang ;
Xie, Shengli ;
Li, Xuelong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) :1133-1149
[6]   UCMH: Unpaired cross-modal hashing with matrix factorization [J].
Gao, Jing ;
Zhang, Wenjun ;
Zhong, Fangming ;
Chen, Zhikui .
NEUROCOMPUTING, 2020, 418 :178-190
[7]   Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].
Gong, Yunchao ;
Lazebnik, Svetlana ;
Gordo, Albert ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929
[8]   Collective Reconstructive Embeddings for Cross-Modal Hashing [J].
Hu, Mengqiu ;
Yang, Yang ;
Shen, Fumin ;
Xie, Ning ;
Hong, Richang ;
Shen, Heng Tao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :2770-2784
[9]   Unsupervised Contrastive Cross-Modal Hashing [J].
Hu, Peng ;
Zhu, Hongyuan ;
Lin, Jie ;
Peng, Dezhong ;
Zhao, Yin-Ping ;
Peng, Xi .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) :3877-3889
[10]   Discrete Latent Factor Model for Cross-Modal Hashing [J].
Jiang, Qing-Yuan ;
Li, Wu-Jun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) :3490-3501