Learning non-metric visual similarity for image retrieval

被引：28

作者：

Garcia, Noa ^{[1
]}

Vogiatzis, George ^{[1
]}

机构：

[1] Aston Univ, Birmingham B4 7ET, W Midlands, England

来源：

IMAGE AND VISION COMPUTING | 2019年 / 82卷

关键词：

Image retrieval; Visual similarity; Non-metric learning; FEATURES;

D O I：

10.1016/j.imavis.2019.01.001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Measuring visual similarity between two or more instances within a data distribution is a fundamental task in image retrieval. Theoretically, non-metric distances are able to generate a more complex and accurate similarity model than metric distances, provided that the non-linear data distribution is precisely captured by the system. In this work, we explore neural networks models for learning a non-metric similarity function for instance search. We argue that non-metric similarity functions based on neural networks can build a better model of human visual perception than standard metric distances. As our proposed similarity function is differentiable, we explore a real end-to-end trainable approach for image retrieval, i.e. we learn the weights from the input image pixels to the final similarity score. Experimental evaluation shows that non-metric similarity networks are able to learn visual similarities between images and improve performance on top of state-of-the-art image representations, boosting results in standard image retrieval datasets with respect standard metric distances. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：18 / 25

页数：8

共 62 条

[41]

Razavian A.S., 2016, ITE Trans. Media Technol. Appl., V4, P251

[42] CNN Features off-the-shelf: an Astounding Baseline for Recognition [J].

Razavian, Ali Sharif ;

Azizpour, Hossein ;

Sullivan, Josephine ;

Carlsson, Stefan .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, :512-519

[43] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[44] Faster R-CNN Features for Instance Search [J].

Salvador, Amaia ;

Giro-i-Nieto, Xavier ;

Marques, Ferran ;

Satoh, Shin'ichi .

PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :394-401

[45]

Santoro Adam, 2017, Advances in neural information processing systems, P4967

[46]

Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556

[47] Video Google: A text retrieval approach to object matching in videos [J].

Sivic, J ;

Zisserman, A .

NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :1470-+

[48] Deep Metric Learning via Lifted Structured Feature Embedding [J].

Song, Hyun Oh ;

Xiang, Yu ;

Jegelka, Stefanie ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4004-4012

[49] Learning to Compare: Relation Network for Few-Shot Learning [J].

Sung, Flood ;

Yang, Yongxin ;

Zhang, Li ;

Xiang, Tao ;

Torr, Philip H. S. ;

Hospedales, Timothy M. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1199-1208

[50]

Szegedy C, 2014, Arxiv, DOI [arXiv:1312.6199, DOI 10.1109/CVPR.2015.7298594]

← 1 2 3 4 5 6 7 →