Relative Camera Pose Estimation Using Convolutional Neural Networks

被引:102
作者
Melekhov, Iaroslav [1 ]
Ylioinas, Juha [1 ]
Kannala, Juho [1 ]
Rahtu, Esa [2 ]
机构
[1] Aalto Univ, Helsinki, Finland
[2] Tampere Univ Technol, Tampere, Finland
来源
ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017) | 2017年 / 10617卷
关键词
Relative camera pose estimation; Deep neural networks; Spatial pyramid pooling;
D O I
10.1007/978-3-319-70353-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a convolutional neural network based approach for estimating the relative pose between two cameras. The proposed network takes RGB images from both cameras as input and directly produces the relative rotation and translation as output. The system is trained in an end-to-end manner utilising transfer learning from a large scale classification dataset. The introduced approach is compared with widely used local feature based methods (SURF, ORB) and the results indicate a clear improvement over the baseline. In addition, a variant of the proposed architecture containing a spatial pyramid pooling (SPP) layer is evaluated and shown to further improve the performance.
引用
收藏
页码:675 / 687
页数:13
相关论文
共 23 条
  • [1] [Anonymous], 2011, ICCV
  • [2] [Anonymous], 2016, CoRR, abs/1606.03798
  • [3] [Anonymous], ABS14126574 CORR
  • [4] [Anonymous], 2015, ICCV
  • [5] [Anonymous], BIGLEARN NIPS WORKSH
  • [6] [Anonymous], 2016, ABS161202401 CORR
  • [7] [Anonymous], 2014, NIPS
  • [8] [Anonymous], 2015, ICCV
  • [9] Azizpour H., 2015, CVPRW
  • [10] SURF: Speeded up robust features
    Bay, Herbert
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417