Attention Networks for the Quality Enhancement of Light Field Images

被引:2
作者
Schiopu, Ionut [1 ]
Munteanu, Adrian [1 ]
机构
[1] Vrije Univ Brussel VUB, Dept Elect & Informat ETRO, Pl Laan 2, B-1050 Brussels, Belgium
关键词
attention network; quality enhancement; light field images; video coding; DEPTH ESTIMATION;
D O I
10.3390/s21093246
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we propose a novel filtering method based on deep attention networks for the quality enhancement of light field (LF) images captured by plenoptic cameras and compressed using the High Efficiency Video Coding (HEVC) standard. The proposed architecture was built using efficient complex processing blocks and novel attention-based residual blocks. The network takes advantage of the macro-pixel (MP) structure, specific to LF images, and processes each reconstructed MP in the luminance (Y) channel. The input patch is represented as a tensor that collects, from an MP neighbourhood, four Epipolar Plane Images (EPIs) at four different angles. The experimental results on a common LF image database showed high improvements over HEVC in terms of the structural similarity index (SSIM), with an average Y-Bjontegaard Delta (BD)-rate savings of 36.57%, and an average Y-BD-PSNR improvement of 2.301 dB. Increased performance was achieved when the HEVC built-in filtering methods were skipped. The visual results illustrate that the enhanced image contains sharper edges and more texture details. The ablation study provides two robust solutions to reduce the inference time by 44.6% and the network complexity by 74.7%. The results demonstrate the potential of attention networks for the quality enhancement of LF images encoded by HEVC.
引用
收藏
页数:19
相关论文
共 66 条
[1]  
[Anonymous], 2015, ACS SYM SER
[2]   Coding of Light Fields Using Disparity-Based Sparse Prediction [J].
Astola, Pekka ;
Tabus, Ioan .
IEEE ACCESS, 2019, 7 :176820-176837
[3]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[4]  
Bjontegaard Gisle, 2001, Calculation of average PSNR differ
[5]  
Bossen F., 2012, JCTVCJ1100, V12
[6]  
Cavigelli L, 2017, IEEE IJCNN, P752, DOI 10.1109/IJCNN.2017.7965927
[7]   Light field compression using disparity-compensated lifting and shape adaptation [J].
Chang, CL ;
Zhu, XQ ;
Ramanathan, P ;
Girod, B .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (04) :793-806
[8]   High-accuracy multi-camera reconstruction enhanced by adaptive point cloud correction algorithm [J].
Chen, Mingyou ;
Tang, Yunchao ;
Zou, Xiangjun ;
Huang, Kuangyu ;
Li, Lijuan ;
He, Yuxin .
OPTICS AND LASERS IN ENGINEERING, 2019, 122 :170-183
[9]  
Conti C, 2012, IEEE IMAGE PROC, P1325, DOI 10.1109/ICIP.2012.6467112
[10]   A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding [J].
Dai, Yuanying ;
Liu, Dong ;
Wu, Feng .
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :28-39