Learning non-metric visual similarity for image retrieval

被引:28
作者
Garcia, Noa [1 ]
Vogiatzis, George [1 ]
机构
[1] Aston Univ, Birmingham B4 7ET, W Midlands, England
关键词
Image retrieval; Visual similarity; Non-metric learning; FEATURES;
D O I
10.1016/j.imavis.2019.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Measuring visual similarity between two or more instances within a data distribution is a fundamental task in image retrieval. Theoretically, non-metric distances are able to generate a more complex and accurate similarity model than metric distances, provided that the non-linear data distribution is precisely captured by the system. In this work, we explore neural networks models for learning a non-metric similarity function for instance search. We argue that non-metric similarity functions based on neural networks can build a better model of human visual perception than standard metric distances. As our proposed similarity function is differentiable, we explore a real end-to-end trainable approach for image retrieval, i.e. we learn the weights from the input image pixels to the final similarity score. Experimental evaluation shows that non-metric similarity networks are able to learn visual similarities between images and improve performance on top of state-of-the-art image representations, boosting results in standard image retrieval datasets with respect standard metric distances. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 25
页数:8
相关论文
共 62 条
[41]  
Razavian A.S., 2016, ITE Trans. Media Technol. Appl., V4, P251
[42]   CNN Features off-the-shelf: an Astounding Baseline for Recognition [J].
Razavian, Ali Sharif ;
Azizpour, Hossein ;
Sullivan, Josephine ;
Carlsson, Stefan .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, :512-519
[43]   ImageNet Large Scale Visual Recognition Challenge [J].
Russakovsky, Olga ;
Deng, Jia ;
Su, Hao ;
Krause, Jonathan ;
Satheesh, Sanjeev ;
Ma, Sean ;
Huang, Zhiheng ;
Karpathy, Andrej ;
Khosla, Aditya ;
Bernstein, Michael ;
Berg, Alexander C. ;
Fei-Fei, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252
[44]   Faster R-CNN Features for Instance Search [J].
Salvador, Amaia ;
Giro-i-Nieto, Xavier ;
Marques, Ferran ;
Satoh, Shin'ichi .
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :394-401
[45]  
Santoro Adam, 2017, Advances in neural information processing systems, P4967
[46]  
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[47]   Video Google: A text retrieval approach to object matching in videos [J].
Sivic, J ;
Zisserman, A .
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :1470-+
[48]   Deep Metric Learning via Lifted Structured Feature Embedding [J].
Song, Hyun Oh ;
Xiang, Yu ;
Jegelka, Stefanie ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4004-4012
[49]   Learning to Compare: Relation Network for Few-Shot Learning [J].
Sung, Flood ;
Yang, Yongxin ;
Zhang, Li ;
Xiang, Tao ;
Torr, Philip H. S. ;
Hospedales, Timothy M. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1199-1208
[50]  
Szegedy C, 2014, Arxiv, DOI [arXiv:1312.6199, DOI 10.1109/CVPR.2015.7298594]