ExchNet: A Unified Hashing Network for Large-Scale Fine-Grained Image Retrieval

被引:39
作者
Cui, Quan [1 ]
Jiang, Qing-Yuan [2 ]
Wei, Xiu-Shen [3 ]
Li, Wu-Jun [2 ]
Yoshie, Osamu [1 ]
机构
[1] Waseda Univ, Grad Sch IPS, Fukuoka, Japan
[2] Nanjing Univ, Dept Comp Sci & Technol, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[3] Megvii Technol, Megvii Res Nanjing, Nanjing, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT III | 2020年 / 12348卷
关键词
Fine-Grained Image Retrieval; Learning to hash; Feature alignment; Large-scale image search; QUANTIZATION;
D O I
10.1007/978-3-030-58580-8_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrieving content relevant images from a large-scale fine-grained dataset could suffer from intolerably slow query speed and highly redundant storage cost, due to high-dimensional real-valued embeddings which aim to distinguish subtle visual differences of fine-grained objects. In this paper, we study the novel fine-grained hashing topic to generate compact binary codes for fine-grained images, leveraging the search and storage efficiency of hash learning to alleviate the aforementioned problems. Specifically, we propose a unified end-to-end trainable network, termed as ExchNet. Based on attention mechanisms and proposed attention constraints, ExchNet can firstly obtain both local and global features to represent object parts and the whole fine-grained objects, respectively. Furthermore, to ensure the discriminative ability and semantic meaning's consistency of these part-level features across images, we design a local feature alignment approach by performing a feature exchanging operation. Later, an alternating learning algorithm is employed to optimize the whole ExchNet and then generate the final binary hash codes. Validated by extensive experiments, our ExchNet consistently outperforms state-of-the-art generic hashing methods on five fine-grained datasets. Moreover, compared with other approximate nearest neighbor methods, ExchNet achieves the best speed-up and storage reduction, revealing its efficiency and practicality.
引用
收藏
页码:189 / 205
页数:17
相关论文
共 43 条
[1]  
[Anonymous], 2013, NeurIPS
[2]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[3]  
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[4]   Hashing with Binary Matrix Pursuit [J].
Cakir, Fatih ;
He, Kun ;
Sclaroff, Stan .
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 :344-361
[5]   Deep Cauchy Hashing for Hamming Space Retrieval [J].
Cao, Yue ;
Long, Mingsheng ;
Liu, Bin ;
Wang, Jianmin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1229-1237
[6]   HashNet: Deep Learning to Hash by Continuation [J].
Cao, Zhangjie ;
Long, Mingsheng ;
Wang, Jianmin ;
Yu, Philip S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5609-5618
[7]   Fast Person Re-identification via Cross-camera Semantic Binary Transformation [J].
Chen, Jiaxin ;
Wang, Yunhong ;
Qin, Jie ;
Liu, Li ;
Shao, Ling .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5330-5339
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   Unsupervised Deep Generative Adversarial Hashing Network [J].
Dizaji, Kamran Ghasedi ;
Zheng, Feng ;
Nourabadi, Najmeh Sadoughi ;
Yang, Yanhua ;
Deng, Cheng ;
Huang, Heng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3664-3673
[10]  
Dolatshah M., 2015, CoRR abs/1511.00628