Geometry-Aware Network for Unsupervised Learning of Monocular Camera's Ego-Motion

被引:2
|
作者
Zhou, Beibei [1 ,2 ]
Xie, Jin [1 ,2 ]
Jin, Zhong [1 ,2 ]
Kong, Hui [3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens Inf, Nanjing 210094, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Laboratoryof Image & Video Understandi, Nanjing 210094, Jiangsu, Peoples R China
[3] Univ Macau, Dept Electromech Engn EME, State Key Lab Internet Things Smart City SKL IOTSC, Macau, Peoples R China
关键词
Index Terms-Monocular visual odometry; geometry-aware; point clouds; visual appearance; 6-DoF poses; VISUAL ODOMETRY; DEPTH;
D O I
10.1109/TITS.2023.3298715
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Deep neural networks have been shown to be effective for unsupervised monocular visual odometry that can predict the camera's ego-motion based on an input of monocular video sequence. However, most existing unsupervised monocular methods haven't fully exploited the extracted information from both local geometric structure and visual appearance of the scenes, resulting in degraded performance. In this paper, a novel geometry-aware network is proposed to predict the camera's ego-motion by learning representations in both 2D and 3D space. First, to extract geometry-aware features, we design an RGB-PointCloud feature fusion module to capture information from both geometric structure and the visual appearance of the scenes by fusing local geometric features from depth-map-derived point clouds and visual features from RGB images. Furthermore, the fusion module can adaptively allocate different weights to the two types of features to emphasize important regions. Then, we devise a relevant feature filtering module to build consistency between the two views and preserve informative features with high relevance. It can capture the correlation of frame pairs in the feature-embedding space by attention mechanisms. Finally, the obtained features are fed into the pose estimator to recover the 6-DoF poses of the camera. Extensive experiments show that our method achieves promising results among the unsupervised monocular deep learning methods on the KITTI odometry and TUM-RGBD datasets.
引用
收藏
页码:14226 / 14236
页数:11
相关论文
共 50 条
  • [41] Discriminative and Geometry-Aware Unsupervised Domain Adaptation
    Luo, Lingkun
    Chen, Liming
    Hu, Shiqiang
    Lu, Ying
    Wang, Xiaofang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3914 - 3927
  • [42] Unsupervised Geometry-Aware Deep LiDAR Odometry
    Cho, Younggun
    Kim, Giseop
    Kim, Ayoung
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2145 - 2152
  • [43] Rigid-aware self-supervised GAN for camera ego-motion estimation
    Lin, Lili
    Luo, Wan
    Yan, Zhengmao
    Zhou, Wenhui
    DIGITAL SIGNAL PROCESSING, 2022, 126
  • [44] UNSUPERVISED LEARNING OF DEPTH AND EGO-MOTION WITH SPATIAL-TEMPORAL GEOMETRIC CONSTRAINTS
    Wang, Anjie
    Gao, Yongbin
    Fang, Zhijun
    Jiang, Xiaoyan
    Wang, Shanshe
    Ma, Siwei
    Hwang, Jenq-Neng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1798 - 1803
  • [45] Joint ego-motion and road geometry estimation
    Lundquist, Christian
    Schon, Thomas B.
    INFORMATION FUSION, 2011, 12 (04) : 253 - 263
  • [46] Recognizing Physical Activity from Ego-motion of A Camera
    Zhang, Hong
    Li, Lu
    Jia, Wenyan
    Fernstrom, John D.
    Sclabassi, Robert J.
    Sun, Mingui
    2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 5569 - 5572
  • [47] Regularised Energy Model for Robust Monocular Ego-motion Estimation
    Chien, Hsiang-Jen
    Klette, Reinhard
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 6, 2017, : 361 - 368
  • [48] Geometry-Aware Unsupervised Domain Adaptation for Stereo Matching
    Sakuma, Hiroki
    Konishi, Yoshinori
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 6116 - 6123
  • [49] A direct method on camera's ego-motion estimation using normal flows
    Liu, Miao
    Yuan, Ding
    MIPPR 2013: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2013, 8918
  • [50] Video Demo: Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video
    Sharma, Alisha
    Ventura, Jonathan
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR), 2019, : 255 - 256