Generalizable Sequential Camera Pose Learning Using Surf Enhanced 3D CNN

被引:0
|
作者
Elmoogy, Ahmed [1 ]
Dong, Xiaodai [1 ]
Lu, Tao [1 ]
Westendorp, Robert [2 ]
Reddy, Kishore [2 ]
机构
[1] Univ Victoria, Elect & Comp Engn, Victoria, BC, Canada
[2] Fortinet, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/VTC2020-Fall49728.2020.9348447
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image based localization is a key block of visual simultaneous localization and mapping (SLAM) system where image data is used to localize the camera relative to an arbitrary reference frame. Although finding the location from one image or between two images is well studied in the literature, few works study the problem of finding the pose of multiple images in videos of different frame lengths. Here, we propose two different architectures to address this problem, one using a combination of 2D convolutional neural network (CNN) and recurrent neural networks (RNN) and the other using 3D CNN. We demonstrate that 3D CNN is better for pose estimation problem than CNN-RNN by visualizing the learned features per layer of both architectures and the accuracy performance. Further, instead of using RGB images as input to the networks, we use SURF descriptors to reduce the image dimension of 480x640x3 by more than 48 folds, making the training time much faster and the learning model less complex. Both architectures show competitive performance in comparison to the state of the art on indoor localization dataset with the ability to generalize to test scenes that are completely different from the training scenes.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Learning camera viewpoint using CNN to improve 3D body pose estimation
    Ghezelghieh, Mona Fathollahi
    Kasturi, Rangachar
    Sarkar, Sudeep
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 685 - 693
  • [2] Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation
    Khaleghi, Leyla
    Marshall, Joshua
    Etemad, Ali
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 535 - 541
  • [3] VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data
    Su, Jiajun
    Wang, Chunyu
    Ma, Xiaoxuan
    Zeng, Wenjun
    Wang, Yizhou
    COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 55 - 71
  • [4] 3D Pose Estimation of Vehicles Using a Stereo Camera
    Barrois, Bjoern
    Hristova, Stela
    Woehler, Christian
    Kummert, Franz
    Hermes, Christoph
    2009 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1 AND 2, 2009, : 267 - 272
  • [5] 3D Palmprint Pose Estimation Using Stereo Camera
    Bingol, Ozkan
    Ekinci, Murat
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [6] Human Pose Estimation in Space and Time Using 3D CNN
    Grinciunaite, Agne
    Gudi, Amogh
    Tasli, Emrah
    den Uyl, Marten
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 32 - 39
  • [7] Automatically Smoothing Camera Pose Using Cross Validation for Sequential Vision-Based 3D Mapping
    Farenzena, M.
    Bartoli, A.
    Mezouar, Y.
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 3616 - 3621
  • [8] Camera Pose Optimization for 3D Mapping
    Lluvia, Iker
    Lazkano, Elena
    Ansuategi, Ander
    IEEE ACCESS, 2023, 11 : 9122 - 9135
  • [9] 3D Camera Pose History Visualization
    Machuca, Mayra Donaji Barrera
    Stuerzlinger, Wolfgang
    SUI'16: PROCEEDINGS OF THE 2016 SYMPOSIUM ON SPATIAL USER INTERACTION, 2016, : 183 - 183
  • [10] Research on 3D Human Pose Estimation Using RGBD Camera
    Tang, Hui
    Wang, Qing
    Chen, Hong
    PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 538 - 541