Pyramid frequency network with spatial attention residual refinement module for monocular depth estimation

被引:13
|
作者
Lu, Zhengyang [1 ]
Chen, Ying [1 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
monocular depth estimation; three-dimensional reconstruction; frequency domain; convolutional neural network;
D O I
10.1117/1.JEI.31.2.023005
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep-learning-based approaches to depth estimation are rapidly advancing, offering superior performance over existing methods. To estimate the depth in real-world scenarios, depth estimation models require the robustness of various noise environments. We propose a pyramid frequency network (PFN) with spatial attention residual refinement module (SARRM) to deal with the weak robustness of existing deep-learning methods. To reconstruct depth maps with accurate details, the SARRM constructs a residual fusion method with an attention mechanism to refine the blur depth. The frequency division strategy is designed, and the frequency pyramid network is developed to extract features from multiple frequency bands. With the frequency strategy, PFN achieves better visual accuracy than state-of-the-art methods in both indoor and outdoor scenes on Make3D, KITTI depth, and NYUv2 datasets. Additional experiments on the noisy NYUv2 dataset demonstrate that PFN is more reliable than existing deep-learning methods in high noise scenes. (C) 2022 SPIE and IS&T
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Triaxial Squeeze Attention Module and Mutual-Exclusion Loss Based Unsupervised Monocular Depth Estimation
    Jiansheng Wei
    Shuguo Pan
    Wang Gao
    Tao Zhao
    Neural Processing Letters, 2022, 54 : 4375 - 4390
  • [42] CATNet: Convolutional attention and transformer for monocular depth estimation
    Tang, Shuai
    Lu, Tongwei
    Liu, Xuanxuan
    Zhou, Huabing
    Zhang, Yanduo
    PATTERN RECOGNITION, 2024, 145
  • [43] Attention Mechanism Used in Monocular Depth Estimation: An Overview
    Li, Yundong
    Wei, Xiaokun
    Fan, Hanlu
    APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [44] Dual-Attention Mechanism for Monocular Depth Estimation
    Chiu, Chui-Hong
    Astuti, Lia
    Lin, Yu-Chen
    Hung, Ming-Ku
    2024 16TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING, ICCAE 2024, 2024, : 456 - 460
  • [45] Residual-Shuffle Network with Spatial Pyramid Pooling Module for COVID-19 Screening
    Zulkifley, Mohd Asyraf
    Abdani, Siti Raihanah
    Zulkifley, Nuraisyah Hani
    Shahrimin, Mohamad Ibrani
    DIAGNOSTICS, 2021, 11 (08)
  • [46] ADAPTIVE WEIGHTED NETWORK WITH EDGE ENHANCEMENT MODULE FOR MONOCULAR SELF-SUPERVISED DEPTH ESTIMATION
    Liu, Hong
    Zhu, Ying
    Hua, Guoliang
    Huang, Weibo
    Ding, Runwei
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2340 - 2344
  • [47] Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8811 - 8822
  • [48] Dynamic Guided Network for Monocular Depth Estimation
    Xing, Xiaoxia
    Cai, Yinghao
    Wang, Yanqing
    Lu, Tao
    Yang, Yiping
    Wen, Dayong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5459 - 5465
  • [49] IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation
    Feng, Cheng
    Chen, Zhen
    Zhang, Congxuan
    Hu, Weiming
    Li, Bing
    Lu, Feng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 329 - 341
  • [50] DTTNet: Depth Transverse Transformer Network for Monocular Depth Estimation
    Kamath, Shreyas K. M.
    Rajeev, Srijith
    Panetta, Karen
    Agaian, Sos S.
    MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2022, 2022, 12100