Joint Soft-Hard Attention for Self-Supervised Monocular Depth Estimation

被引:5
作者
Fan, Chao [1 ,2 ,3 ]
Yin, Zhenyu [2 ,3 ]
Xu, Fulong [1 ,2 ,3 ]
Chai, Anying [1 ,2 ,3 ]
Zhang, Feiqing [1 ,2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Comp Technol, Shenyang 110168, Peoples R China
[3] Liaoning Key Lab Domest Ind Control Platform Tech, Shenyang 110168, Peoples R China
关键词
monocular depth estimation; self-supervised learning; attention mechanism; VISION;
D O I
10.3390/s21216956
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In recent years, self-supervised monocular depth estimation has gained popularity among researchers because it uses only a single camera at a much lower cost than the direct use of laser sensors to acquire depth. Although monocular self-supervised methods can obtain dense depths, the estimation accuracy needs to be further improved for better applications in scenarios such as autonomous driving and robot perception. In this paper, we innovatively combine soft attention and hard attention with two new ideas to improve self-supervised monocular depth estimation: (1) a soft attention module and (2) a hard attention strategy. We integrate the soft attention module in the model architecture to enhance feature extraction in both spatial and channel dimensions, adding only a small number of parameters. Unlike traditional fusion approaches, we use the hard attention strategy to enhance the fusion of generated multi-scale depth predictions. Further experiments demonstrate that our method can achieve the best self-supervised performance both on the standard KITTI benchmark and the Make3D dataset.
引用
收藏
页数:17
相关论文
共 50 条
  • [11] Monocular Depth Estimation via Self-Supervised Self-Distillation
    Hu, Haifeng
    Feng, Yuyang
    Li, Dapeng
    Zhang, Suofei
    Zhao, Haitao
    SENSORS, 2024, 24 (13)
  • [12] Self-supervised Monocular Depth Estimation on Unseen Synthetic Cameras
    Diana-Albelda, Cecilia
    Bravo Perez-Villar, Juan Ignacio
    Montalvo, Javier
    Garcia-Martin, Alvaro
    Bescos Cano, Jesus
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 449 - 463
  • [13] Self-Supervised Deep Monocular Depth Estimation With Ambiguity Boosting
    Bello, Juan Luis Gonzalez
    Kim, Munchurl
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9131 - 9149
  • [14] Constant Velocity Constraints for Self-Supervised Monocular Depth Estimation
    Zhou, Hang
    Greenwood, David
    Taylor, Sarah
    Gong, Han
    CVMP 2020: THE 17TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2020,
  • [15] Transferring knowledge from monocular completion for self-supervised monocular depth estimation
    Sun, Lin
    Li, Yi
    Liu, Bingzheng
    Xu, Liying
    Zhang, Zhe
    Zhu, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42485 - 42495
  • [16] Transferring knowledge from monocular completion for self-supervised monocular depth estimation
    Lin Sun
    Yi Li
    Bingzheng Liu
    Liying Xu
    Zhe Zhang
    Jie Zhu
    Multimedia Tools and Applications, 2022, 81 : 42485 - 42495
  • [17] RENA-Depth: toward recursion representation enhancement in neighborhood attention guided lightweight self-supervised monocular depth estimation
    Yang, Chaochao
    Lu, Yuanyao
    Qiu, Yongsheng
    Wang, Yuantao
    OPTICAL ENGINEERING, 2024, 63 (08)
  • [18] Self-supervised recurrent depth estimation with attention mechanisms
    Makarov I.
    Bakhanova M.
    Nikolenko S.
    Gerasimova O.
    PeerJ Computer Science, 2022, 8
  • [19] Self-supervised recurrent depth estimation with attention mechanisms
    Makarov, Ilya
    Bakhanova, Maria
    Nikolenko, Sergey
    Gerasimova, Olga
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [20] Self-supervised recurrent depth estimation with attention mechanisms
    Makarov, Ilya
    Bakhanova, Maria
    Nikolenko, Sergey
    Gerasimova, Olga
    PEERJ, 2022, 8