SIFT-AID: BOOSTING SIFT WITH AN AFFINE INVARIANT DESCRIPTOR BASED ON CONVOLUTIONAL NEURAL NETWORKS

被引：0

作者：

Rodriguez, M. ^{[1
]}

Facciolo, G. ^{[1
]}

von Gioi, R. Grompone ^{[1
]}

Muse, P. ^{[2
]}

Morel, J. -M. ^{[1
]}

Delon, J. ^{[3
]}

机构：

[1] Univ Paris Saclay, CNRS, ENS Paris Saclay, CMLA, F-94235 Cachan, France

[2] Univ Republica, IIE, Montevideo, Uruguay

[3] Univ Paris 05, MAP5, Paris, France

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年

关键词：

image comparison; affine invariance; IMAS; SIFT; RootSIFT; convolutional neural networks; SCALE; STEREO;

D O I：

10.1109/icip.2019.8803425

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

The classic approach to image matching consists in the detection, description and matching of keypoints. The descriptor encodes the local information around the keypoint. An advantage of local approaches is that viewpoint deformations are well approximated by affine maps. This motivated the quest for affine invariant local descriptors. Despite numerous efforts, such descriptors remained elusive, ultimately resulting in the compromise of using viewpoint simulations to attain affine invariance. In this work we propose a CNN-based patch descriptor which captures affine invariance without the need for viewpoint simulations. This is achieved by training a neural network to associate similar vectorial representations to patches related by affine transformations. During matching, these vectors are compared very efficiently. The invariance to translation, rotation and scale is still obtained by the first stages of SIFT, which produce the keypoints. The proposed descriptor outperforms the state-of-the-art in retaining affine invariant properties.

引用

页码：4225 / 4229

页数：5

共 30 条

[1] [Anonymous], EUR C COMP VIS ECCV
[2] ARANDJELOVIC R, 2012, PROC CVPR IEEE, P2911, DOI DOI 10.1109/CVPR.2012.6248018
[3] Baumberg A, 2000, PROC CVPR IEEE, P774, DOI 10.1109/CVPR.2000.855899
[4] Cao F., 2008, A Theory of Shape Identification
[5] DeTone Daniel, 2016, Deep Image Homography Estimation
[6] Kadir T, 2004, LECT NOTES COMPUT SC, V3021, P228
[7] Microsoft COCO: Common Objects in Context
Lin, Tsung-Yi
Maire, Michael
Belongie, Serge
Hays, James
Perona, Pietro
Ramanan, Deva
Dollar, Piotr
Zitnick, C. Lawrence
[J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
[8] Lindeberg T., 1994, Computer Vision - ECCV'94. Third European Conference on Computer Vision. Proceedings. Vol.I, P389
[9] Lowe D.G., 2004, International journal of computer vision, V60, DOI [DOI 10.1023/B:VISI.0000029664.99615.94, 10.1023/B:VISI.0000029664.99615.94]
[10] Robust wide-baseline stereo from maximally stable extremal regions
Matas, J
Chum, O
Urban, M
Pajdla, T
[J]. IMAGE AND VISION COMPUTING, 2004, 22 (10) : 761 - 767

← 1 2 3 →