FPattNet: A Multi-Scale Feature Fusion Network with Occlusion Awareness for Depth Estimation of Light Field Images

被引:4
|
作者
Xiao, Min [1 ]
Lv, Chen [1 ]
Liu, Xiaomin [1 ]
机构
[1] Zhengzhou Univ, Sch Phys & Microelect, Zhengzhou 450001, Peoples R China
关键词
light field; depth estimation; deep learning; occlusion handling;
D O I
10.3390/s23177480
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A light field camera can capture light information from various directions within a scene, allowing for the reconstruction of the scene. The light field image inherently contains the depth information of the scene, and depth estimations of light field images have become a popular research topic. This paper proposes a depth estimation network of light field images with occlusion awareness. Since light field images contain many views from different viewpoints, identifying the combinations that contribute the most to the depth estimation of the center view is critical to improving the depth estimation accuracy. Current methods typically rely on a fixed set of views, such as vertical, horizontal, and diagonal, which may not be optimal for all scenes. To address this limitation, we propose a novel approach that considers all available views during depth estimation while leveraging an attention mechanism to assign weights to each view dynamically. By inputting all views into the network and employing the attention mechanism, we enable the model to adaptively determine the most informative views for each scene, thus achieving more accurate depth estimation. Furthermore, we introduce a multi-scale feature fusion strategy that amalgamates contextual information and expands the receptive field to enhance the network's performance in handling challenging scenarios, such as textureless and occluded regions.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Multi-scale hierarchical feature fusion network for change detection
    Zheng, Hanhong
    Zhang, Mingyang
    Gong, Maoguo
    Qin, A. K.
    Liu, Tongfei
    Jiang, Fenlong
    PATTERN RECOGNITION, 2025, 161
  • [22] Siamese Network Tracker Based on Multi-Scale Feature Fusion
    Zhao, Jiaxu
    Niu, Dapeng
    SYSTEMS, 2023, 11 (08):
  • [23] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [24] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [25] Component Identification and Depth Estimation for Structural Images Based on Multi-Scale Task Interaction Network
    Ye, Jianlong
    Yu, Hongchuan
    Liu, Gaoyang
    Zhou, Jiong
    Shu, Jiangpeng
    BUILDINGS, 2024, 14 (04)
  • [26] Multi-scale feature fusion based DOA and range estimation for near-field sources
    Liu, Ke
    Fu, Yanyan
    Ma, Junda
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [27] Unsupervised Light Field Depth Estimation via Multi-View Feature Matching With Occlusion Prediction
    Zhang, Shansi
    Meng, Nan
    Lam, Edmund Y.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2261 - 2273
  • [28] Light Field Depth from Multi-scale Particle Filtering
    Chen, Jie
    Chau, Lap-Pui
    Li, He
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [29] Light Field Depth Estimation based on Occlusion Optimization
    Zhang, Long
    Deng, Huiping
    Xiang, Sen
    Li, Shuang
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1635 - 1639
  • [30] Depth Estimation from Light Field Images via Convolutional Residual Network
    Mun, Ji-Hun
    Ho, Yo-Sung
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1495 - 1498