EDMH: Efficient discrete matrix factorization hashing for multi-modal similarity retrieval

被引:15
作者
Yang, Fan [1 ]
Ding, Xiaojian [1 ]
Ma, Fumin [1 ]
Tong, Deyu [1 ]
Cao, Jie [1 ]
机构
[1] Nanjing Univ Finance & Econ, Coll Informat Engn, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; Similarity search; Supervised hashing; Discrete optimization; Matrix factorization; ROBUST;
D O I
10.1016/j.ipm.2023.103301
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing has been an emerging topic and has recently attracted widespread attention in multi -modal similarity search applications. However, most existing approaches rely on relaxation schemes to generate binary codes, leading to large quantization errors. In addition, amounts of existing approaches embed labels into the pairwise similarity matrix, leading to expensive time and space costs and losing category information. To address these issues, we propose an Efficient Discrete Matrix factorization Hashing (EDMH). Specifically, EDMH first learns the latent subspaces for individual modality through matrix factorization strategy, which preserves the semantic structure representation information of each modality. In particular, we develop a semantic label offset embedding learning strategy, improving the stability of label embedding regression. Furthermore, we design an efficient discrete optimization scheme to generate compact binary codes discretely. Eventually, we present two efficient learning strategies EDMH-L and EDMH-S to pursue high-quality hash functions. Extensive experiments on various widely-used databases verify that the proposed algorithms produce significant performance and outperform some state-of-the-art approaches, with an average improvement of 2.50% (for Wiki), 2.66% (for MIRFlickr) and 2.25% (for NUS-WIDE) over the best available results, respectively.
引用
收藏
页数:19
相关论文
共 40 条
[1]   Cognitive multi-modal consistent hashing with flexible semantic transformation [J].
An, Junfeng ;
Luo, Haoyang ;
Zhang, Zheng ;
Zhu, Lei ;
Lu, Guangming .
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
[2]   Multimodal Machine Learning: A Survey and Taxonomy [J].
Baltrusaitis, Tadas ;
Ahuja, Chaitanya ;
Morency, Louis-Philippe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443
[3]   Robust Unsupervised Cross-modal Hashing for Multimedia Retrieval [J].
Cheng, Miaomiao ;
Jing, Liping ;
Ng, Michael K. .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (03)
[4]   Collective Matrix Factorization Hashing for Multimodal Data [J].
Ding, Guiguang ;
Guo, Yuchen ;
Zhou, Jile .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2083-2090
[5]   Unsupervised cross-modal retrieval via Multi-modal graph regularized Smooth Matrix Factorization Hashing [J].
Fang, Yixian ;
Zhang, Huaxiang ;
Ren, Yuwei .
KNOWLEDGE-BASED SYSTEMS, 2019, 171 :69-80
[6]   Discrete Latent Factor Model for Cross-Modal Hashing [J].
Jiang, Qing-Yuan ;
Li, Wu-Jun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) :3490-3501
[7]  
Li C, 2019, AAAI CONF ARTIF INTE, P176
[8]   Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval [J].
Li, Chuan-Xiang ;
Yan, Ting-Kun ;
Luo, Xin ;
Nie, Liqiang ;
Xu, Xin-Shun .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (11) :2863-2877
[9]  
Lin ZJ, 2015, PROC CVPR IEEE, P3864, DOI 10.1109/CVPR.2015.7299011
[10]   Cross-Modality Binary Code Learning via Fusion Similarity Hashing [J].
Liu, Hong ;
Ji, Rongrong ;
Wu, Yongjian ;
Huang, Feiyue ;
Zhang, Baochang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6345-6353