Improved Bilinear Pooling for Real-Time Pose Event Camera Relocalisation

被引：0

作者：

Tabia, Ahmed ^{[1
]}

Bonardi, Fabien ^{[1
]}

Bouchafa, Samia ^{[1
]}

机构：

[1] Univ Paris Saclay, Univ Evry, IBISC, F-91025 Evry, France

来源：

IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I | 2023年 / 14233卷

关键词：

6-DOF; Deep Learning; Event-based Camera; Pose Estimation;

D O I：

10.1007/978-3-031-43148-7_19

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional methods for estimating camera pose have been replaced by more advanced camera relocalization methods that utilize both CNNs and LSTMs in the field of simultaneous localization and mapping. However, the reliance on LSTM layers in these methods can lead to overfitting and slow convergence. In this paper, a novel approach for estimating the six degree of freedom (6DOF) pose of an event camera using deep learning is presented. Our method begins by preprocessing the events captured by the event camera to generate a set of images. These images are then passed through two CNNs to extract relevant features. These features are multiplied using an outer product and aggregated across different regions of the image after adding L2 normalization to normalize the combining vector. The final step of the model is a regression layer that predicts the position and orientation of the event camera. The effectiveness of this approach has been tested on various datasets, and the results demonstrate its superiority compared to existing state-of-the-art methods.

引用

页码：222 / 231

页数：10

共 20 条

[1] Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks [J].

Anh Nguyen ;

Thanh-Toan Do ;

Caldwell, Darwin G. ;

Tsagarakis, Nikos G. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1638-1645

[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[3]

Clevert DA, 2016, Arxiv, DOI [arXiv:1511.07289, 10.48550/arXiv.1511.07289]

[4]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[5]

Eitel A, 2015, IEEE INT C INT ROBOT, P681, DOI 10.1109/IROS.2015.7353446

[6] Accurate Angular Velocity Estimation With an Event Camera [J].

Gallego, Guillermo ;

Scaramuzza, Davide .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02) :632-639

[7]

Kendall A, 2016, IEEE INT CONF ROBOT, P4762, DOI 10.1109/ICRA.2016.7487679

[8] PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization [J].

Kendall, Alex ;

Grimes, Matthew ;

Cipolla, Roberto .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2938-2946

[9]

King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001

[10] EPnP: An Accurate O(n) Solution to the PnP Problem [J].

Lepetit, Vincent ;

Moreno-Noguer, Francesc ;

Fua, Pascal .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 81 (02) :155-166

← 1 2 →