Joint Soft-Hard Attention for Self-Supervised Monocular Depth Estimation

被引:5
|
作者
Fan, Chao [1 ,2 ,3 ]
Yin, Zhenyu [2 ,3 ]
Xu, Fulong [1 ,2 ,3 ]
Chai, Anying [1 ,2 ,3 ]
Zhang, Feiqing [1 ,2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Comp Technol, Shenyang 110168, Peoples R China
[3] Liaoning Key Lab Domest Ind Control Platform Tech, Shenyang 110168, Peoples R China
关键词
monocular depth estimation; self-supervised learning; attention mechanism; VISION;
D O I
10.3390/s21216956
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In recent years, self-supervised monocular depth estimation has gained popularity among researchers because it uses only a single camera at a much lower cost than the direct use of laser sensors to acquire depth. Although monocular self-supervised methods can obtain dense depths, the estimation accuracy needs to be further improved for better applications in scenarios such as autonomous driving and robot perception. In this paper, we innovatively combine soft attention and hard attention with two new ideas to improve self-supervised monocular depth estimation: (1) a soft attention module and (2) a hard attention strategy. We integrate the soft attention module in the model architecture to enhance feature extraction in both spatial and channel dimensions, adding only a small number of parameters. Unlike traditional fusion approaches, we use the hard attention strategy to enhance the fusion of generated multi-scale depth predictions. Further experiments demonstrate that our method can achieve the best self-supervised performance both on the standard KITTI benchmark and the Make3D dataset.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Self-Supervised Monocular Depth Estimation Based on Channel Attention
    Tao, Bo
    Chen, Xinbo
    Tong, Xiliang
    Jiang, Du
    Chen, Baojia
    PHOTONICS, 2022, 9 (06)
  • [2] Self-supervised monocular depth estimation via joint attention and intelligent mask loss
    Guo, Peng
    Pan, Shuguo
    Gao, Wang
    Khoshelham, Kourosh
    MACHINE VISION AND APPLICATIONS, 2025, 36 (01)
  • [3] Revisiting Self-supervised Monocular Depth Estimation
    Kim, Ue-Hwan
    Lee, Gyeong-Min
    Kim, Jong-Hwan
    ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
  • [4] Self-supervised monocular depth estimation in fog
    Tao, Bo
    Hu, Jiaxin
    Jiang, Du
    Li, Gongfa
    Chen, Baojia
    Qian, Xinbo
    OPTICAL ENGINEERING, 2023, 62 (03)
  • [5] Dual-attention-based semantic-aware self-supervised monocular depth estimation
    Xu, Jinze
    Ye, Feng
    Lai, Yizong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 65579 - 65601
  • [6] Self-Supervised Monocular Depth Estimation With Multiscale Perception
    Zhang, Yourun
    Gong, Maoguo
    Li, Jianzhao
    Zhang, Mingyang
    Jiang, Fenlong
    Zhao, Hongyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3251 - 3266
  • [7] Self-supervised monocular depth estimation for gastrointestinal endoscopy
    Liu, Yuying
    Zuo, Siyang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 238
  • [8] Self-Supervised Monocular Depth Estimation With Extensive Pretraining
    Choi, Hyukdoo
    IEEE ACCESS, 2021, 9 : 157236 - 157246
  • [9] Self-supervised monocular depth estimation with large kernel attention and dynamic scene perception
    Xiang, Xuezhi
    Wang, Yao
    Li, Xiaoheng
    Zhang, Lei
    Zhen, Xiantong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 108
  • [10] Enhanced self-supervised monocular depth estimation with self-attention and joint depth-pose loss for laparoscopic images
    Li, Wenda
    Hayashi, Yuichiro
    Oda, Masahiro
    Kitasaka, Takayuki
    Misawa, Kazunari
    Mori, Kensaku
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, : 775 - 785