FPattNet: A Multi-Scale Feature Fusion Network with Occlusion Awareness for Depth Estimation of Light Field Images

被引:4
|
作者
Xiao, Min [1 ]
Lv, Chen [1 ]
Liu, Xiaomin [1 ]
机构
[1] Zhengzhou Univ, Sch Phys & Microelect, Zhengzhou 450001, Peoples R China
关键词
light field; depth estimation; deep learning; occlusion handling;
D O I
10.3390/s23177480
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A light field camera can capture light information from various directions within a scene, allowing for the reconstruction of the scene. The light field image inherently contains the depth information of the scene, and depth estimations of light field images have become a popular research topic. This paper proposes a depth estimation network of light field images with occlusion awareness. Since light field images contain many views from different viewpoints, identifying the combinations that contribute the most to the depth estimation of the center view is critical to improving the depth estimation accuracy. Current methods typically rely on a fixed set of views, such as vertical, horizontal, and diagonal, which may not be optimal for all scenes. To address this limitation, we propose a novel approach that considers all available views during depth estimation while leveraging an attention mechanism to assign weights to each view dynamically. By inputting all views into the network and employing the attention mechanism, we enable the model to adaptively determine the most informative views for each scene, thus achieving more accurate depth estimation. Furthermore, we introduce a multi-scale feature fusion strategy that amalgamates contextual information and expands the receptive field to enhance the network's performance in handling challenging scenarios, such as textureless and occluded regions.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Occlusion-aware depth estimation for light field using multi-orientation EPIs
    Sheng, Hao
    Zhao, Pan
    Zhang, Shuo
    Zhang, Jun
    Yang, Da
    PATTERN RECOGNITION, 2018, 74 : 587 - 599
  • [42] ACCURATE LIGHT FIELD DEPTH ESTIMATION VIA AN OCCLUSION-AWARE NETWORK
    Guo, Chunle
    Jin, Jing
    Hou, Junhui
    Chen, Jie
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [43] MLFFNet: multilevel feature fusion network for monocular depth estimation from aerial images
    Xu, Huihui
    Li, Fei
    Feng, Zhiquan
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (02)
  • [44] Liver segmentation network based on detail enhancement and multi-scale feature fusion
    Lu, Tinglan
    Qin, Jun
    Qin, Guihe
    Shi, Weili
    Zhang, Wentao
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [45] Point Cloud Semantic Segmentation Network Based on Multi-Scale Feature Fusion
    Du, Jing
    Jiang, Zuning
    Huang, Shangfeng
    Wang, Zongyue
    Su, Jinhe
    Su, Songjian
    Wu, Yundong
    Cai, Guorong
    SENSORS, 2021, 21 (05) : 1 - 20
  • [46] MAFBLiF: Multi-Scale Attention Feature Fusion-Based Blind Light Field Image Quality Assessment
    Zhou, Rui
    Jiang, Gangyi
    Cui, Yueli
    Chen, Yeyao
    Xu, Haiyong
    Luo, Ting
    Yu, Mei
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (04) : 1266 - 1278
  • [47] Swin-Depth: Using Transformers and Multi-Scale Fusion for Monocular-Based Depth Estimation
    Cheng, Zeyu
    Zhang, Yi
    Tang, Chengkai
    IEEE SENSORS JOURNAL, 2021, 21 (23) : 26912 - 26920
  • [48] Monocular Image Depth Estimation Based on Multi-Scale Attention Oriented Network
    Liu J.
    Wen J.
    Liang Y.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (12): : 52 - 62
  • [49] Multi-Scale Feature Fusion Convolutional Neural Network for Concurrent Segmentation of Left Ventricle and Myocardium in Cardiac MR Images
    Qi, Lin
    Zhang, Haoran
    Cao, Xuehao
    Lyu, Xuyang
    Xu, Lisheng
    Yang, Benqiang
    Ou, Yangming
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (05) : 1023 - 1032
  • [50] Attention-based multi-scale feature fusion network for myopia grading using optical coherence tomography images
    Huang, Gengyou
    Wen, Yang
    Qian, Bo
    Bi, Lei
    Chen, Tingli
    Sheng, Bin
    VISUAL COMPUTER, 2024, 40 (09): : 6627 - 6638