Semi-supervised Cross-Modal Hashing with Graph Convolutional Networks

被引:8
作者
Duan, Jiasheng [1 ]
Luo, Yadan [1 ]
Wang, Ziwei [1 ]
Huang, Zi [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
来源
DATABASES THEORY AND APPLICATIONS, ADC 2020 | 2020年 / 12008卷
关键词
Cross-modal hashing; GCN; Semi-supervised learning; CODES;
D O I
10.1007/978-3-030-39469-1_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing for large-scale approximate neighbor search has attracted great attention recently because of its significant computational and storage efficiency. However, it is still challenging to generate high-quality binary codes to preserve inter-modal and intra-modal semantics, especially in a semi-supervised manner. In this paper, we propose a semi-supervised cross-modal discrete code learning framework. This is the very first work of applying asymmetric graph convolutional networks (GCNs) for scalable cross-modal retrieval. Specifically, the architecture contains multiple GCN branches, each of which is for one data modality to extract modality-specific features and then to generate unified binary hash codes across different modalities, so that the underlying correlations and similarities across modalities are simultaneously preserved into the hash values. Moreover, the branches are built with asymmetric graph convolutional layers, which employ randomly sampled anchors to tackle the scalability and out-of-sample issue in graph learning, and reduce the complexity of cross-modal similarity calculation. Extensive experiments conducted on benchmark datasets demonstrate that our method can achieve superior retrieval performance in comparison with the state-of-the-art methods.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 2008, ICML
[2]  
[Anonymous], 2017, PROC INT C LEARN REP
[3]  
[Anonymous], 2010, P 27 INT C MACH LEAR, DOI 10.5555/3104322.3104425
[4]  
Bronstein MM, 2010, PROC CVPR IEEE, P3594, DOI 10.1109/CVPR.2010.5539928
[5]  
Chua T. -S., 2009, P ACM INT C IM VID R, V1, P9
[6]   Collective Matrix Factorization Hashing for Multimodal Data [J].
Ding, Guiguang ;
Guo, Yuchen ;
Zhou, Jile .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2083-2090
[7]   Canonical correlation analysis: An overview with application to learning methods [J].
Hardoon, DR ;
Szedmak, S ;
Shawe-Taylor, J .
NEURAL COMPUTATION, 2004, 16 (12) :2639-2664
[8]   Iterative Multi-View Hashing for Cross Media Indexing [J].
Hu, Yao ;
Jin, Zhongming ;
Ren, Hongyi ;
Cai, Deng ;
He, Xiaofei .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :527-536
[9]  
Huiskes MJ, 2008, P 1 ACM INT C MULT I, P39
[10]   Deep Cross-Modal Hashing [J].
Jiang, Qing-Yuan ;
Li, Wu-Jun .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3270-3278