Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval

被引:113
作者
Brown, Andrew [1 ]
Xie, Weidi [1 ]
Kalogeiton, Vicky [1 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Visual Geometry Grp, Oxford, England
来源
COMPUTER VISION - ECCV 2020, PT IX | 2020年 / 12354卷
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1007/978-3-030-58545-7_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optimising a ranking-based metric, such as Average Precision (AP), is notoriously challenging due to the fact that it is non-differentiable, and hence cannot be optimised directly using gradient-descent methods. To this end, we introduce an objective that optimises instead a smoothed approximation of AP, coined Smooth-AP. Smooth-AP is a plug-and-play objective function that allows for end-to-end training of deep networks with a simple and elegant implementation. We also present an analysis for why directly optimising the ranking based metric of AP offers benefits over other deep metric learning losses. We apply Smooth-AP to standard retrieval benchmarks: Stanford Online products and VehicleID, and also evaluate on larger-scale datasets: INaturalist for fine-grained category retrieval, and VGGFace2 and IJB-C for face retrieval. In all cases, we improve the performance over the state-of-the-art, especially for larger-scale datasets, thus demonstrating the effectiveness and scalability of Smooth-AP to real-world scenarios.
引用
收藏
页码:677 / 694
页数:18
相关论文
共 74 条
[1]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI DOI 10.1145/1341531.1341544
[2]  
[Anonymous], 2007, Proceedings of the Advances in Neural Information Processing Systems, DOI DOI 10.1007/S10994-010-5185-8
[3]  
[Anonymous], 2005, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2010, ICML
[5]  
Arandjelovic R., 2016, P CVPR
[6]   All about VLAD [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1578-1585
[7]   Neural Codes for Image Retrieval [J].
Babenko, Artem ;
Slesarev, Anton ;
Chigorin, Alexandr ;
Lempitsky, Victor .
COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599
[8]  
Cakir F., 2019, Fastap: deep metric learning to rank
[9]   Deep Metric Learning to Rank [J].
Cakir, Fatih ;
He, Kun ;
Xia, Xide ;
Kulis, Brian ;
Sclaroff, Stan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1861-1870
[10]   VGGFace2: A dataset for recognising faces across pose and age [J].
Cao, Qiong ;
Shen, Li ;
Xie, Weidi ;
Parkhi, Omkar M. ;
Zisserman, Andrew .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74