Deep Learning-Based Perceptual Video Quality Enhancement for 3D Synthesized View

被引:15
作者
Zhang, Huan [1 ,2 ]
Zhang, Yun [2 ]
Zhu, Linwei [2 ]
Lin, Weisi [3 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Noise reduction; Three-dimensional displays; Distortion; Image denoising; Convolutional neural networks; Solid modeling; Rendering (computer graphics); View synthesis; perceptual quality enhancement; convolutional neural network; temporal flicker distortion; 3D synthesized video; IMAGE; SPARSE; COMPRESSION; DIBR;
D O I
10.1109/TCSVT.2022.3147788
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to occlusion among views and temporal inconsistency in depth video, spatio-temporal distortion occurs in 3D synthesized video with depth image-based rendering. In this paper, we propose a deep Convolutional Neural Network (CNN)-based synthesized video denoising algorithm to reduce temporal flicker distortion and improve perceptual quality of 3D synthesized video. First, we analyze the spatio-temporal distortion, and model eliminating spatio-temporal distortion as a perceptual video denoising problem. Then, a deep learning-based synthesized video denoising network is proposed, in which a CNN-friendly spatio-temporal loss function is derived from a synthesized video quality metric and integrated with a single image denoising network architecture. Finally, specific schemes, i.e., specific Synthesized Video Denoising Networks (SynVD-Nets), and a general scheme, i.e., General SynVD-Net (GSynVD-Net), based on existing CNN-based denoising models, are developed to handle synthesized video with different distortion levels more effectively. Experimental results show that the proposed SynVD-Net and GSynVD-Net can outperform deep learning-based counterparts and conventional denoising methods, and significantly enhance perceptual quality of 3D synthesized video.
引用
收藏
页码:5080 / 5094
页数:15
相关论文
共 55 条
[1]  
3D-ATM, 2023, Reference software for 3D-AVC: 3DV-ATM V10.0
[2]  
3D-HTM, 2023, Anchor software for 3D-HEVC experiments: 3DV-HTM V8.0
[3]  
[Anonymous], 2019, BM3DVBM4D CODE
[4]  
[Anonymous], INT C LEARNING REPRE
[5]  
[Anonymous], VSRS 1D FAST
[6]   MPEG Immersive Video Coding Standard [J].
Boyce, Jill M. ;
Dore, Renaud ;
Dziembowski, Adrian ;
Fleureau, Julien ;
Jung, Joel ;
Kroon, Bart ;
Salahieh, Basel ;
Vadakital, Vinod Kumar Malamal ;
Yu, Lu .
PROCEEDINGS OF THE IEEE, 2021, 109 (09) :1521-1536
[7]   A non-local algorithm for image denoising [J].
Buades, A ;
Coll, B ;
Morel, JM .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65
[8]   Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].
Caballero, Jose ;
Ledig, Christian ;
Aitken, Andrew ;
Acosta, Alejandro ;
Totz, Johannes ;
Wang, Zehan ;
Shi, Wenzhe .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857
[9]   Deep RNNs for Video Denoising [J].
Chen, Xinyuan ;
Song, Li ;
Yang, Xiaokang .
APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXIX, 2016, 9971
[10]   ViDeNN: Deep Blind Video Denoising [J].
Claus, Michele ;
van Gemert, Jan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1843-1852