APAC-Net: Unsupervised Learning of Depth and Ego-Motion from Monocular Video

被引:0
|
作者
Lin, Rui [1 ]
Lu, Yao [1 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol ShenZhen, Shenzhen 518055, Peoples R China
关键词
Depth estimation; Ego-motion estimation; Attention mechanism;
D O I
10.1007/978-3-030-36189-1_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an unsupervised novel method, Attention-Pixel and Attention-Channel Network (APAC-Net), for unsupervised monocular learning of estimating scene depth and ego-motion. Our model only utilizes monocular image sequences and does not need additional sensor information, such as IMU and GPS, for supervising. The attention mechanism is employed in APAC-Net to improve the networks' efficiency. Specifically, three attention modules are proposed to adjust feature weights when training. Moreover, to minimum the effect of noise, which is produced in the reconstruction processing, the Image-reconstruction loss based on PSNR LPSNR is used to evaluation the reconstruction quality. In addition, due to the fail depth estimation of the objects closed to camera, the Temporal-consistency loss LTemp between adjacent frames and the Scale-based loss LScale among different scales are proposed. Experimental results showed APAC-Net can perform well in both the depth and ego-motion tasks, and it even behaved better in several items on KITTI and Cityscapes.
引用
收藏
页码:336 / 348
页数:13
相关论文
共 50 条
  • [1] Unsupervised Learning of Depth and Ego-Motion from Video
    Zhou, Tinghui
    Brown, Matthew
    Snavely, Noah
    Lowe, David G.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6612 - +
  • [2] Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
    Bian, Jia-Wang
    Li, Zhichao
    Wang, Naiyan
    Zhan, Huangying
    Shen, Chunhua
    Cheng, Ming-Ming
    Reid, Ian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Unsupervised Ego-Motion and Dense Depth Estimation with Monocular Video
    Xu, Yufan
    Wang, Yan
    Guo, Lei
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 1306 - 1310
  • [4] Unsupervised Learning of Depth and Ego-Motion from Continuous Monocular Images
    Wang, Zhuo
    Huang, Min
    Huang, Xiao-Long
    Ma, Fei
    Dou, Jia-Ming
    Lyu, Jian-Li
    Journal of Computers (Taiwan), 2021, 32 (06) : 38 - 51
  • [5] Unsupervised monocular depth and ego-motion learning with structure and semantics
    Casser, Vincent
    Pirk, Soeren
    Mahjourian, Reza
    Angelova, Anelia
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 381 - 388
  • [6] Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video
    Sharma, Alisha
    Ventura, Jonathan
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR), 2019, : 58 - 65
  • [7] Unsupervised Learning of Monocular Depth and Ego-Motion Using Multiple Masks
    Wang, Guangming
    Wang, Hesheng
    Liu, Yiling
    Chen, Weidong
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 4724 - 4730
  • [8] Unsupervised Learning of Monocular Depth and Ego-Motion using Conditional PatchGANs
    Vankadari, Madhu
    Kumar, Swagat
    Majumder, Anima
    Das, Kaushik
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5677 - 5684
  • [9] Unsupervised Learning of Monocular Depth and Ego-Motion in Outdoor/Indoor Environments
    Gao, Ruipeng
    Xiao, Xuan
    Xing, Weiwei
    Li, Chi
    Liu, Lei
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17) : 16247 - 16258
  • [10] Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints
    Mahjourian, Reza
    Wicke, Martin
    Angelova, Anelia
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5667 - 5675