Geometry-Aware Network for Unsupervised Learning of Monocular Camera's Ego-Motion

被引:2
|
作者
Zhou, Beibei [1 ,2 ]
Xie, Jin [1 ,2 ]
Jin, Zhong [1 ,2 ]
Kong, Hui [3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens Inf, Nanjing 210094, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Laboratoryof Image & Video Understandi, Nanjing 210094, Jiangsu, Peoples R China
[3] Univ Macau, Dept Electromech Engn EME, State Key Lab Internet Things Smart City SKL IOTSC, Macau, Peoples R China
关键词
Index Terms-Monocular visual odometry; geometry-aware; point clouds; visual appearance; 6-DoF poses; VISUAL ODOMETRY; DEPTH;
D O I
10.1109/TITS.2023.3298715
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Deep neural networks have been shown to be effective for unsupervised monocular visual odometry that can predict the camera's ego-motion based on an input of monocular video sequence. However, most existing unsupervised monocular methods haven't fully exploited the extracted information from both local geometric structure and visual appearance of the scenes, resulting in degraded performance. In this paper, a novel geometry-aware network is proposed to predict the camera's ego-motion by learning representations in both 2D and 3D space. First, to extract geometry-aware features, we design an RGB-PointCloud feature fusion module to capture information from both geometric structure and the visual appearance of the scenes by fusing local geometric features from depth-map-derived point clouds and visual features from RGB images. Furthermore, the fusion module can adaptively allocate different weights to the two types of features to emphasize important regions. Then, we devise a relevant feature filtering module to build consistency between the two views and preserve informative features with high relevance. It can capture the correlation of frame pairs in the feature-embedding space by attention mechanisms. Finally, the obtained features are fed into the pose estimator to recover the 6-DoF poses of the camera. Extensive experiments show that our method achieves promising results among the unsupervised monocular deep learning methods on the KITTI odometry and TUM-RGBD datasets.
引用
收藏
页码:14226 / 14236
页数:11
相关论文
共 50 条
  • [21] Simultaneous estimation of ego-motion and vehicle distance by using a monocular camera
    YANG DongFang
    SUN FuChun
    WANG ShiCheng
    ZHANG JinSheng
    ScienceChina(InformationSciences), 2014, 57 (05) : 272 - 281
  • [22] Simultaneous estimation of ego-motion and vehicle distance by using a monocular camera
    DongFang Yang
    FuChun Sun
    ShiCheng Wang
    JinSheng Zhang
    Science China Information Sciences, 2014, 57 : 1 - 10
  • [23] Unsupervised Learning of Depth and Ego-Motion from Video
    Zhou, Tinghui
    Brown, Matthew
    Snavely, Noah
    Lowe, David G.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6612 - +
  • [24] Simultaneous estimation of ego-motion and vehicle distance by using a monocular camera
    Yang DongFang
    Sun FuChun
    Wang ShiCheng
    Zhang JinSheng
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (05) : 1 - 10
  • [25] Geometry-Aware Learning of Maps for Camera Localization
    Brahmbhatt, Samarth
    Gu, Jinwei
    Kim, Kihwan
    Hays, James
    Kautz, Jan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2616 - 2625
  • [26] PoseConvGRU: A Monocular Approach for Visual Ego-motion Estimation by Learning
    Zhai, Guangyao
    Liu, Liang
    Zhang, Linjian
    Liu, Yong
    Jiang, Yunliang
    PATTERN RECOGNITION, 2020, 102
  • [27] A geometry-aware deep network for depth estimation in monocular endoscopy
    Yang, Yongming
    Shao, Shuwei
    Yang, Tao
    Wang, Peng
    Yang, Zhuo
    Wu, Chengdong
    Liu, Hao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [28] UnDEMoN: Unsupervised Deep Network for Depth and Ego-Motion Estimation
    Babu, Madhu, V
    Das, Kaushik
    Majumdar, Anima
    Kumar, Swagat
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1082 - 1088
  • [29] Vehicle ego-motion estimation and moving object detection using a monocular camera
    Yamaguchi, Koichiro
    Kato, Takeo
    Ninomiya, Yoshiki
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 610 - +
  • [30] Learning Monocular Visual Odometry through Geometry-Aware Curriculum Learning
    Saputra, Muhamad Risqi U.
    de Gusmao, Pedro P. B.
    Wang, Sen
    Markham, Andrew
    Trigoni, Niki
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 3549 - 3555