Pyramid frequency network with spatial attention residual refinement module for monocular depth estimation

被引:13
|
作者
Lu, Zhengyang [1 ]
Chen, Ying [1 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
monocular depth estimation; three-dimensional reconstruction; frequency domain; convolutional neural network;
D O I
10.1117/1.JEI.31.2.023005
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep-learning-based approaches to depth estimation are rapidly advancing, offering superior performance over existing methods. To estimate the depth in real-world scenarios, depth estimation models require the robustness of various noise environments. We propose a pyramid frequency network (PFN) with spatial attention residual refinement module (SARRM) to deal with the weak robustness of existing deep-learning methods. To reconstruct depth maps with accurate details, the SARRM constructs a residual fusion method with an attention mechanism to refine the blur depth. The frequency division strategy is designed, and the frequency pyramid network is developed to extract features from multiple frequency bands. With the frequency strategy, PFN achieves better visual accuracy than state-of-the-art methods in both indoor and outdoor scenes on Make3D, KITTI depth, and NYUv2 datasets. Additional experiments on the noisy NYUv2 dataset demonstrate that PFN is more reliable than existing deep-learning methods in high noise scenes. (C) 2022 SPIE and IS&T
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Attention-based context aggregation network for monocular depth estimation
    Yuru Chen
    Haitao Zhao
    Zhengwei Hu
    Jingchao Peng
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1583 - 1596
  • [22] Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement
    Wu, Jipeng
    Ji, Rongrong
    Wang, Qiang
    Zhang, Shengchuan
    Sun, Xiaoshuai
    Wang, Yan
    Xu, Mingliang
    Huang, Feiyue
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1204 - 1216
  • [23] Erratum to: A Lightweight Network Based on Pyramid Residual Module for Human Pose Estimation
    Bingkun Gao
    Ke Ma
    Hongbo Bi
    Ling Wang
    Pattern Recognition and Image Analysis, 2020, 30 : 565 - 565
  • [24] An Extremely Effective Spatial Pyramid and Pixel Shuffle Upsampling Decoder for Multiscale Monocular Depth Estimation
    Luo, Huilan
    Chen, Yuan
    Zhou, Yifeng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [25] Self-Supervised Monocular Depth Estimation With Frequency-Based Recurrent Refinement
    Li, Rui
    Xue, Danna
    Zhu, Yu
    Wu, Hao
    Sun, Jinqiu
    Zhang, Yanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5626 - 5637
  • [26] ATTENTION-BASED SELF-SUPERVISED LEARNING MONOCULAR DEPTH ESTIMATION WITH EDGE REFINEMENT
    Jiang, Chenweinan
    Liu, Haichun
    Li, Lanzhen
    Pan, Changchun
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3218 - 3222
  • [27] Transfer2Depth: Dual Attention Network With Transfer Learning for Monocular Depth Estimation
    Yeh, Chia-Hung
    Huang, Yao-Pao
    Lin, Chih-Yang
    Chang, Chuan-Yu
    IEEE ACCESS, 2020, 8 : 86081 - 86090
  • [28] Monocular Depth Estimation with Adaptive Geometric Attention
    Naderi, Taher
    Sadovnik, Amir
    Hayward, Jason
    Qi, Hairong
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 617 - 627
  • [29] Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals
    Song, Minsoo
    Lim, Seokjae
    Kim, Wonjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (11) : 4381 - 4393
  • [30] Depth-Relative Self Attention for Monocular Depth Estimation
    Shim, Kyuhong
    Kim, Jiyoung
    Lee, Gusang
    Shim, Byonghyo
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1396 - 1404