APAC-Net: Unsupervised Learning of Depth and Ego-Motion from Monocular Video

被引:0
|
作者
Lin, Rui [1 ]
Lu, Yao [1 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol ShenZhen, Shenzhen 518055, Peoples R China
关键词
Depth estimation; Ego-motion estimation; Attention mechanism;
D O I
10.1007/978-3-030-36189-1_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an unsupervised novel method, Attention-Pixel and Attention-Channel Network (APAC-Net), for unsupervised monocular learning of estimating scene depth and ego-motion. Our model only utilizes monocular image sequences and does not need additional sensor information, such as IMU and GPS, for supervising. The attention mechanism is employed in APAC-Net to improve the networks' efficiency. Specifically, three attention modules are proposed to adjust feature weights when training. Moreover, to minimum the effect of noise, which is produced in the reconstruction processing, the Image-reconstruction loss based on PSNR LPSNR is used to evaluation the reconstruction quality. In addition, due to the fail depth estimation of the objects closed to camera, the Temporal-consistency loss LTemp between adjacent frames and the Scale-based loss LScale among different scales are proposed. Experimental results showed APAC-Net can perform well in both the depth and ego-motion tasks, and it even behaved better in several items on KITTI and Cityscapes.
引用
收藏
页码:336 / 348
页数:13
相关论文
共 50 条
  • [11] Video Demo: Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video
    Sharma, Alisha
    Ventura, Jonathan
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR), 2019, : 255 - 256
  • [12] Improving Unsupervised Learning of Monocular Depth and Ego-Motion via Stereo Network
    He, Mu
    Xie, Jin
    Yang, Jian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 421 - 433
  • [13] DiPE: Deeper into Photometric Errors for Unsupervised Learning of Depth and Ego-motion from Monocular Videos
    Jiang, Hualie
    Ding, Laiyan
    Sun, Zhenglong
    Huang, Rui
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10061 - 10067
  • [14] Unsupervised Learning of Monocular Depth and Ego-Motion with Optical Flow Features and Multiple Constraints
    Zhao, Baigan
    Huang, Yingping
    Ci, Wenyan
    Hu, Xing
    SENSORS, 2022, 22 (04)
  • [15] Unsupervised learning of monocular depth and ego-motion with space–temporal-centroid loss
    Junning Zhang
    Qunxing Su
    Pengyuan Liu
    Chao Xu
    Yanlong Chen
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 615 - 627
  • [16] Unsupervised learning of monocular depth and ego-motion with space-temporal-centroid loss
    Zhang, Junning
    Su, Qunxing
    Liu, Pengyuan
    Xu, Chao
    Chen, Yanlong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 615 - 627
  • [17] Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video with Applications for Virtual Reality
    Sharma, Alisha
    Nett, Ryan
    Ventura, Jonathan
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2020, 14 (03) : 333 - 356
  • [18] Joint self-supervised learning of interest point, descriptor, depth, and ego-motion from monocular video
    Wang, Zhongyi
    Shen, Mengjiao
    Chen, Qijun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (32) : 77529 - 77547
  • [19] Depth Estimation with Ego-Motion Assisted Monocular Camera
    Mansour M.
    Davidson P.
    Stepanov O.
    Raunio J.-P.
    Aref M.M.
    Piché R.
    Gyroscopy Navig., 3 (111-123): : 111 - 123
  • [20] Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images
    Yang, Delong
    Luo, Zhaohui
    Shang, Peng
    Hu, Zhigang
    2021 9TH INTERNATIONAL CONFERENCE ON TRAFFIC AND LOGISTIC ENGINEERING (ICTLE), 2021, : 51 - 56