Generalizable Sequential Camera Pose Learning Using Surf Enhanced 3D CNN

被引:0
|
作者
Elmoogy, Ahmed [1 ]
Dong, Xiaodai [1 ]
Lu, Tao [1 ]
Westendorp, Robert [2 ]
Reddy, Kishore [2 ]
机构
[1] Univ Victoria, Elect & Comp Engn, Victoria, BC, Canada
[2] Fortinet, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/VTC2020-Fall49728.2020.9348447
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image based localization is a key block of visual simultaneous localization and mapping (SLAM) system where image data is used to localize the camera relative to an arbitrary reference frame. Although finding the location from one image or between two images is well studied in the literature, few works study the problem of finding the pose of multiple images in videos of different frame lengths. Here, we propose two different architectures to address this problem, one using a combination of 2D convolutional neural network (CNN) and recurrent neural networks (RNN) and the other using 3D CNN. We demonstrate that 3D CNN is better for pose estimation problem than CNN-RNN by visualizing the learned features per layer of both architectures and the accuracy performance. Further, instead of using RGB images as input to the networks, we use SURF descriptors to reduce the image dimension of 480x640x3 by more than 48 folds, making the training time much faster and the learning model less complex. Both architectures show competitive performance in comparison to the state of the art on indoor localization dataset with the ability to generalize to test scenes that are completely different from the training scenes.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] 3D Pose Estimation and Tracking in Handball Actions Using a Monocular Camera
    Sajina, Romeo
    Ivasic-Kos, Marina
    JOURNAL OF IMAGING, 2022, 8 (11)
  • [22] 3D Head Pose and Facial Expression Tracking using a Single Camera
    Terissi, Lucas D.
    Gomez, Juan C.
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2010, 16 (06) : 903 - 920
  • [23] 3D Hand Pose Estimation Using a Single Camera for Unspecified Users
    Hoshino, Kiyoshi
    Tomida, Motomasa
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2009, 21 (06) : 749 - 757
  • [24] 3D Head Pose Estimation in Monocular Video Sequences by Sequential Camera Self-Calibration
    Marras, Ioannis
    Nikolaidis, Nikos
    Pitas, Ioannis
    2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 68 - 73
  • [25] A generalizable approach for multi-view 3D human pose regression
    Abdolrahim Kadkhodamohammadi
    Nicolas Padoy
    Machine Vision and Applications, 2021, 32
  • [26] A generalizable approach for multi-view 3D human pose regression
    Kadkhodamohammadi, Abdolrahim
    Padoy, Nicolas
    MACHINE VISION AND APPLICATIONS, 2020, 32 (01)
  • [27] Joint Camera Pose Estimation and 3D Human Pose Estimation in a Multi-camera Setup
    Puwein, Jens
    Ballan, Luca
    Ziegler, Remo
    Pollefeys, Marc
    COMPUTER VISION - ACCV 2014, PT II, 2015, 9004 : 473 - 487
  • [28] Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision
    Mehta, Dushyant
    Rhodin, Helge
    Casas, Dan
    Fua, Pascal
    Sotnychenko, Oleksandr
    Xu, Weipeng
    Theobalt, Christian
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 506 - 516
  • [29] 3D Camera Pose Estimation Using Line Correspondences and 1D Homographies
    Reisner-Kollmann, Irene
    Reichinger, Andreas
    Purgathofer, Werner
    ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 41 - +
  • [30] Head Pose Free 3D Gaze Estimation Using RGB-D Camera
    Kacete, Amine
    Seguier, Renaud
    Collobert, Michel
    Royan, Jerome
    EIGHTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2016), 2017, 10225