Generalizable Sequential Camera Pose Learning Using Surf Enhanced 3D CNN

被引:0
|
作者
Elmoogy, Ahmed [1 ]
Dong, Xiaodai [1 ]
Lu, Tao [1 ]
Westendorp, Robert [2 ]
Reddy, Kishore [2 ]
机构
[1] Univ Victoria, Elect & Comp Engn, Victoria, BC, Canada
[2] Fortinet, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/VTC2020-Fall49728.2020.9348447
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image based localization is a key block of visual simultaneous localization and mapping (SLAM) system where image data is used to localize the camera relative to an arbitrary reference frame. Although finding the location from one image or between two images is well studied in the literature, few works study the problem of finding the pose of multiple images in videos of different frame lengths. Here, we propose two different architectures to address this problem, one using a combination of 2D convolutional neural network (CNN) and recurrent neural networks (RNN) and the other using 3D CNN. We demonstrate that 3D CNN is better for pose estimation problem than CNN-RNN by visualizing the learned features per layer of both architectures and the accuracy performance. Further, instead of using RGB images as input to the networks, we use SURF descriptors to reduce the image dimension of 480x640x3 by more than 48 folds, making the training time much faster and the learning model less complex. Both architectures show competitive performance in comparison to the state of the art on indoor localization dataset with the ability to generalize to test scenes that are completely different from the training scenes.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] 3D Hand Pose Estimation with a Single Infrared Camera via Domain Transfer Learning
    Park, Gabyong
    Kim, Tae-Kyun
    Woo, Woontack
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2020), 2020, : 588 - 599
  • [32] Reducing the device complexity for 3D human pose estimation: A deep learning approach using monocular camera and IMUs
    Zhao, Changyu
    Uchitomi, Hirotaka
    Ogata, Taiki
    Ming, Xianwen
    Miyake, Yoshihiro
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [33] PTZ Camera Pose Estimation by Tracking a 3D Target
    Hrabar, Stefan
    Corke, Peter
    Hilsenstein, Volker
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [34] Recovering Accurate 3D Human Pose in the Wild Using IMUs and a Moving Camera
    von Marcard, Timo
    Henschel, Roberto
    Black, Michael J.
    Rosenhahn, Bodo
    Pons-Moll, Gerard
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 614 - 631
  • [35] 3D Object Pose Estimation Using Viewpoint Generative Learning
    Thachasongtham, Dissaphong
    Yoshida, Takumi
    de Sorbier, Francois
    Saito, Hideo
    IMAGE ANALYSIS, SCIA 2013: 18TH SCANDINAVIAN CONFERENCE, 2013, 7944 : 512 - 521
  • [36] 3D Hand Pose Estimation Using Kinect Enhanced with Recovery Method
    Thanyadit, Santawat
    Madarasami, Suthep
    FIFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2013), 2013, 8878
  • [37] ESD-Pose: Enhanced Semantic Discrimination for Generalizable 6D Pose Estimation
    Deng, Xingyuan
    Wang, Kangru
    Wang, Lei
    Zhu, Dongchen
    Li, Jiamao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VI, 2025, 15036 : 371 - 386
  • [38] 3D Pose Tracking Using a Recovered 3D Model
    Chen, Wei
    Zhao, Yuelong
    Chen, Shu
    Ouyang, Jianquan
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2018, 27 (04)
  • [39] Efficient 3D Reconstruction of NeRF using Camera Pose Interpolation and Photometric Bundle Adjustment
    Takeda, Tsukasa
    Yamaguchi, Shugo
    Sato, Kazuhito
    Fukazawa, Kosuke
    Morishima, Shigeo
    PROCEEDINGS OF SIGGRAPH 2023 POSTERS, SIGGRAPH 2023, 2023,
  • [40] Alignment of Deep Features in 3D Models for Camera Pose Estimation
    Su, Jui-Yuan
    Cheng, Shyi-Chyi
    Chang, Chin-Chun
    Hsieh, Jun-Wei
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 440 - 452