Recall@k Surrogate Loss with Large Batches and Similarity Mixup

被引：17

作者：

Patel, Yash ^{[1
]}

Tolias, Giorgos ^{[1
]}

Matas, Jiri ^{[1
]}

机构：

[1] Czech Tech Univ, Visual Recognit Grp, Prague, Czech Republic

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.00735

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work focuses on learning deep visual representation models for retrieval by exploring the interplay between a new loss function, the batch size, and a new regularization approach. Direct optimization, by gradient descent, of an evaluation metric, is not possible when it is non-differentiable, which is the case for recall in retrieval. A differentiable surrogate loss for the recall is proposed in this work. Using an implementation that sidesteps the hardware constraints of the GPU memory, the method trains with a very large batch size, which is essential for metrics computed on the entire retrieval database. It is assisted by an efficient mixup regularization approach that operates on pairwise scalar similarities and virtually increases the batch size further. The suggested method achieves state-of-the-art performance in several image retrieval benchmarks when used for deep metric learning. For instance-level recognition, the method outperforms similar approaches that train using an approximation of average precision.

引用

页码：7492 / 7501

页数：10

共 73 条

[1]

Ba J L., LAYER NORMALIZATION, DOI [DOI 10.48550/ARXIV.1607.06450, 10.48550/arXiv.1607.06450]

[2] Neural Codes for Image Retrieval [J].

Babenko, Artem ;

Slesarev, Anton ;

Chigorin, Alexandr ;

Lempitsky, Victor .

COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599

[3]

Bahdanau D., 2017, INT C LEARNING REPRE

[4]

Balle J., 2018, PROC INT C LEARN REP

[5]

Boudiaf Malik, 2020, ECCV

[6] Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval [J].

Brown, Andrew ;

Xie, Weidi ;

Kalogeiton, Vicky ;

Zisserman, Andrew .

COMPUTER VISION - ECCV 2020, PT IX, 2020, 12354 :677-694

[7] Deep Metric Learning to Rank [J].

Cakir, Fatih ;

He, Kun ;

Xia, Xide ;

Kulis, Brian ;

Sclaroff, Stan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1861-1870

[8]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[9] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[10]

Dosovitskiy A, 2021, P INT C LEARN REPR, DOI [DOI 10.48550/ARXIV.2010.11929, 10.48550/arXiv.2010.11929]

← 1 2 3 4 5 6 7 8 →