Deep Learning-Based Perceptual Video Quality Enhancement for 3D Synthesized View

被引:15
作者
Zhang, Huan [1 ,2 ]
Zhang, Yun [2 ]
Zhu, Linwei [2 ]
Lin, Weisi [3 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Noise reduction; Three-dimensional displays; Distortion; Image denoising; Convolutional neural networks; Solid modeling; Rendering (computer graphics); View synthesis; perceptual quality enhancement; convolutional neural network; temporal flicker distortion; 3D synthesized video; IMAGE; SPARSE; COMPRESSION; DIBR;
D O I
10.1109/TCSVT.2022.3147788
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to occlusion among views and temporal inconsistency in depth video, spatio-temporal distortion occurs in 3D synthesized video with depth image-based rendering. In this paper, we propose a deep Convolutional Neural Network (CNN)-based synthesized video denoising algorithm to reduce temporal flicker distortion and improve perceptual quality of 3D synthesized video. First, we analyze the spatio-temporal distortion, and model eliminating spatio-temporal distortion as a perceptual video denoising problem. Then, a deep learning-based synthesized video denoising network is proposed, in which a CNN-friendly spatio-temporal loss function is derived from a synthesized video quality metric and integrated with a single image denoising network architecture. Finally, specific schemes, i.e., specific Synthesized Video Denoising Networks (SynVD-Nets), and a general scheme, i.e., General SynVD-Net (GSynVD-Net), based on existing CNN-based denoising models, are developed to handle synthesized video with different distortion levels more effectively. Experimental results show that the proposed SynVD-Net and GSynVD-Net can outperform deep learning-based counterparts and conventional denoising methods, and significantly enhance perceptual quality of 3D synthesized video.
引用
收藏
页码:5080 / 5094
页数:15
相关论文
共 55 条
[31]   TSAN: Synthesized View Quality Enhancement via Two-Stream Attention Network for 3D-HEVC [J].
Pan, Zhaoqing ;
Yu, Weijie ;
Lei, Jianjun ;
Ling, Nam ;
Kwong, Sam .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :345-358
[32]   Image Sequence Denoising via Sparse and Redundant Representations [J].
Protter, Matan ;
Elad, Michael .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (01) :27-35
[33]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[34]   DIBR-synthesized image quality assessment based on morphological multi-scale approach [J].
Sandic-Stankovic, Dragana ;
Kukolj, Dragan ;
Le Callet, Patrick .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
[35]   Graph Laplacian Regularization With Sparse Coding for Image Restoration and Representation [J].
Sha, Lingdao ;
Schonfeld, Dan ;
Wang, Jing .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2000-2014
[36]   Wavelet-Based Total Variation and Nonlocal Similarity Model for Image Denoising [J].
Shen, Yan ;
Liu, Qing ;
Lou, Shuqin ;
Hou, Ya-Li .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (06) :877-881
[37]   FastDVDnet: Towards Real-Time Deep Video Denoising Without Flow Estimation [J].
Tassano, Matias ;
Delon, Julie ;
Veit, Thomas .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1351-1360
[38]   New Hole-Filling Method Using Extrapolated Spatio-Temporal Background Information for a Synthesized Free-View [J].
Tien-Dat Nguyen ;
Kim, Beomsu ;
Hong, Min-Cheol .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (06) :1345-1358
[39]   Deep Image Prior [J].
Ulyanov, Dmitry ;
Vedaldi, Andrea ;
Lempitsky, Victor .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9446-9454
[40]   EDVR: Video Restoration with Enhanced Deformable Convolutional Networks [J].
Wang, Xintao ;
Chan, Kelvin C. K. ;
Yu, Ke ;
Dong, Chao ;
Loy, Chen Change .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1954-1963