Repeatability Is Not Enough: Learning Affine Regions via Discriminability

被引:166
作者
Mishkin, Dmytro [1 ]
Radenovic, Filip [1 ]
Matas, Jiri [1 ]
机构
[1] Czech Tech Univ, FEE, Visual Recognit Grp, Ctr Machine Percept, Prague, Czech Republic
来源
COMPUTER VISION - ECCV 2018, PT IX | 2018年 / 11213卷
关键词
Local features; Affine shape; Loss function; Image retrieval; QUERY EXPANSION; SCALE; DESCRIPTORS; GEOMETRY;
D O I
10.1007/978-3-030-01240-3_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method for learning local affine-covariant regions is presented. We show that maximizing geometric repeatability does not lead to local regions, a.k.a features, that are reliably matched and this necessitates descriptor-based learning. We explore factors that influence such learning and registration: the loss function, descriptor type, geometric parametrization and the trade-off between matchability and geometric accuracy and propose a novel hard negative-constant loss function for learning of affine regions. The affine shape estimator - AffNet - trained with the hard negative-constant loss outperforms the state-of-the-art in bag-of-words image retrieval and wide baseline stereo. The proposed training process does not require precisely geometrically aligned patches. The source codes and trained weights are available at https://github.com/ducha-aiki/affnet.
引用
收藏
页码:287 / 304
页数:18
相关论文
共 60 条
[31]  
Mishkin D, 2015, BMVC
[32]   Systematic evaluation of convolution neural network advances on the Imagenet [J].
Mishkin, Dmytro ;
Sergievskiy, Nikolay ;
Matas, Jiri .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 161 :11-19
[33]   MODS: Fast and robust method for two-view matching [J].
Mishkin, Dmytro ;
Matas, Jiri ;
Perdoch, Michal .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 141 :81-93
[34]  
Muja M, 2009, VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, P331
[35]  
Nair V., 2010, P 27 INT C MACH LEAR, P807
[36]   Large-Scale Image Retrieval with Attentive Deep Local Features [J].
Noh, Hyeonwoo ;
Araujo, Andre ;
Sim, Jack ;
Weyand, Tobias ;
Han, Bohyung .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3476-3485
[37]  
Perd'och M, 2009, PROC CVPR IEEE, P9, DOI 10.1109/CVPRW.2009.5206529
[38]  
Philbin J, 2008, PROC CVPR IEEE, P2285
[39]   Radially-Distorted Conjugate Translations [J].
Pritts, James ;
Kukelova, Zuzana ;
Larsson, Viktor ;
Chum, Ondrej .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1993-2001
[40]   Fine-Tuning CNN Image Retrieval with No Human Annotation [J].
Radenovic, Filip ;
Tolias, Giorgos ;
Chum, Ondrej .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) :1655-1668