Linear-PoseNet: A Real-Time Camera Pose Estimation System Using Linear Regression and Principal Component Analysis

被引：0

作者：

Elmoogy, Ahmed ^{[1
]}

Dong, Xiaodai ^{[1
]}

Lu, Tao ^{[1
]}

Westendorp, Robert ^{[2
]}

Reddy, Kishore ^{[2
]}

机构：

[1] Univ Victoria, Elect & Comp Engn, Victoria, BC, Canada

[2] Fortinet, Burnaby, BC, Canada

来源：

2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL) | 2020年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Image localization; Linear regression; PCA;

D O I：

10.11019/VTC2020-Fall49728.2020.9348762

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural networks-based camera pose estimation systems rely on fine tuning very large networks to regress the camera position and orientation with very complex training procedure. In this paper, we explore the following question: do we need to fine tune and train such complex networks to reach the desired accuracy? We show that we can reach comparable or better accuracy for the single image indoor localization systems with using only one layer of ridge regression and pretrained features of ResNet-50 architecture with training time less than a second on CPU instead of hours of GPU training needed by the state of the art. For outdoor scenes, we show that using only 3 fully connected layers on top of pretrained ResNet50 features without fine-tuning can perform well compared to the state of the art with only minutes of training. For more complexity reduction, we show that downsampling the pretrained ResNet-50 features by more than 10 times using principal component analysis (PCA) has a little effect on the performance but can save both training time and storage space.

引用

页数：6

共 30 条

[1] Speeded-Up Robust Features (SURF) [J].

Bay, Herbert ;

Ess, Andreas ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359

[2] DSAC - Differentiable RANSAC for Camera Localization [J].

Brachmann, Eric ;

Krull, Alexander ;

Nowozin, Sebastian ;

Shotton, Jamie ;

Michel, Frank ;

Gumhold, Stefan ;

Rother, Carsten .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2492-2500

[3]

Cai M., 2018, HYBRID PROBABILISTIC

[4]

Choi S., 2009, PUBLIC LAW RES PAPER, DOI 10.5244/C.23.81

[5] Faster Visual-Based Localization with Mobile-PoseNet [J].

Cimarelli, Claudio ;

Cazzato, Dario ;

Olivares-Mendez, Miguel A. ;

Voos, Holger .

COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT II, 2019, 11679 :219-230

[6] VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization [J].

Clark, Ronald ;

Wang, Sen ;

Markham, Andrew ;

Trigoni, Niki ;

Wen, Hongkai .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2652-2660

[7]

Elmoogy A. M., 2020, SURF LSTM DESCRIPTOR

[8] SurfCNN: A Descriptor Accelerated Convolutional Neural Network for Image-Based Indoor Localization [J].

Elmoogy, Ahmed M. ;

Dong, Xiaodai ;

Lu, Tao ;

Westendorp, Robert ;

Tarimala, Kishore Reddy .

IEEE ACCESS, 2020, 8 :59750-59759

[9]

Fengxi Song, 2010, Proceedings of the 2010 International Conference on System Science, Engineering Design and Manufacturing Informatization (ICSEM 2010), P27, DOI 10.1109/ICSEM.2010.14

[10]

Glocker B, 2013, INT SYM MIX AUGMENT, P173, DOI 10.1109/ISMAR.2013.6671777

← 1 2 3 →