Unsupervised Deep Cross-modality Spectral Hashing

被引:23
作者
Hoang, Tuan [1 ]
Do, Thanh-Toan [2 ]
Nguyen, Tam V. [3 ]
Cheung, Ngai-Man [1 ]
机构
[1] Singapore Univ Technol & Design SUTD, Informat Syst Technol & Design, Singapore 487372, Singapore
[2] Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, Merseyside, England
[3] Univ Dayton, Dept Comp Sci, Dayton, OH 45469 USA
基金
新加坡国家研究基金会;
关键词
Binary codes; Semantics; Optimization; Correlation; Sparse matrices; Task analysis; Training data; Cross-modal retrieval; spectral hashing; image search; constraint optimization; BINARY-CODES; QUANTIZATION; SIMILARITY;
D O I
10.1109/TIP.2020.3014727
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel framework, namely Deep Cross-modality Spectral Hashing (DCSH), to tackle the unsupervised learning problem of binary hash codes for efficient cross-modal retrieval. The framework is a two-step hashing approach which decouples the optimization into (1) binary optimization and (2) hashing function learning. In the first step, we propose a novel spectral embedding-based algorithm to simultaneously learn single-modality and binary cross-modality representations. While the former is capable of well preserving the local structure of each modality, the latter reveals the hidden patterns from all modalities. In the second step, to learn mapping functions from informative data inputs (images and word embeddings) to binary codes obtained from the first step, we leverage the powerful CNN for images and propose a CNN-based deep architecture to learn text modality. Quantitative evaluations on three standard benchmark datasets demonstrate that the proposed DCSH method consistently outperforms other state-of-the-art methods.
引用
收藏
页码:8391 / 8406
页数:16
相关论文
共 95 条
[1]  
[Anonymous], 2016, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-46454-1_14
[2]  
[Anonymous], 2016, C IND ELECT APPL
[3]  
[Anonymous], 2020, COMPUT VIS IMAGE UND, DOI DOI 10.1016/J.CVIU.2019.102852
[4]  
[Anonymous], 2015, IEEE T IMAGE PROCESS, DOI DOI 10.1109/TIP.2015.2467315
[5]  
[Anonymous], 2017, IEEE IJCNN
[6]   Beyond diffusion process: Neighbor set similarity for fast re-ranking [J].
Bai, Xiang ;
Bai, Song ;
Wang, Xinggang .
INFORMATION SCIENCES, 2015, 325 :342-354
[7]  
Boutsidis C, 2015, PR MACH LEARN RES, V37, P40
[8]   Cross-Modal Hamming Hashing [J].
Cao, Yue ;
Liu, Bin ;
Long, Mingsheng ;
Wang, Jianmin .
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :207-223
[9]   Deep Cauchy Hashing for Hamming Space Retrieval [J].
Cao, Yue ;
Long, Mingsheng ;
Liu, Bin ;
Wang, Jianmin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1229-1237
[10]   Deep Visual-Semantic Hashing for Cross-Modal Retrieval [J].
Cao, Yue ;
Long, Mingsheng ;
Wang, Jianmin ;
Yang, Qiang ;
Yu, Philip S. .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1445-1454