SIFT-AID: BOOSTING SIFT WITH AN AFFINE INVARIANT DESCRIPTOR BASED ON CONVOLUTIONAL NEURAL NETWORKS

被引:0
作者
Rodriguez, M. [1 ]
Facciolo, G. [1 ]
von Gioi, R. Grompone [1 ]
Muse, P. [2 ]
Morel, J. -M. [1 ]
Delon, J. [3 ]
机构
[1] Univ Paris Saclay, CNRS, ENS Paris Saclay, CMLA, F-94235 Cachan, France
[2] Univ Republica, IIE, Montevideo, Uruguay
[3] Univ Paris 05, MAP5, Paris, France
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
image comparison; affine invariance; IMAS; SIFT; RootSIFT; convolutional neural networks; SCALE; STEREO;
D O I
10.1109/icip.2019.8803425
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
The classic approach to image matching consists in the detection, description and matching of keypoints. The descriptor encodes the local information around the keypoint. An advantage of local approaches is that viewpoint deformations are well approximated by affine maps. This motivated the quest for affine invariant local descriptors. Despite numerous efforts, such descriptors remained elusive, ultimately resulting in the compromise of using viewpoint simulations to attain affine invariance. In this work we propose a CNN-based patch descriptor which captures affine invariance without the need for viewpoint simulations. This is achieved by training a neural network to associate similar vectorial representations to patches related by affine transformations. During matching, these vectors are compared very efficiently. The invariance to translation, rotation and scale is still obtained by the first stages of SIFT, which produce the keypoints. The proposed descriptor outperforms the state-of-the-art in retaining affine invariant properties.
引用
收藏
页码:4225 / 4229
页数:5
相关论文
共 30 条
  • [1] [Anonymous], EUR C COMP VIS ECCV
  • [2] ARANDJELOVIC R, 2012, PROC CVPR IEEE, P2911, DOI DOI 10.1109/CVPR.2012.6248018
  • [3] Baumberg A, 2000, PROC CVPR IEEE, P774, DOI 10.1109/CVPR.2000.855899
  • [4] Cao F., 2008, A Theory of Shape Identification
  • [5] DeTone Daniel, 2016, Deep Image Homography Estimation
  • [6] Kadir T, 2004, LECT NOTES COMPUT SC, V3021, P228
  • [7] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [8] Lindeberg T., 1994, Computer Vision - ECCV'94. Third European Conference on Computer Vision. Proceedings. Vol.I, P389
  • [9] Lowe D.G., 2004, International journal of computer vision, V60, DOI [DOI 10.1023/B:VISI.0000029664.99615.94, 10.1023/B:VISI.0000029664.99615.94]
  • [10] Robust wide-baseline stereo from maximally stable extremal regions
    Matas, J
    Chum, O
    Urban, M
    Pajdla, T
    [J]. IMAGE AND VISION COMPUTING, 2004, 22 (10) : 761 - 767