Deep Learning-Based Perceptual Video Quality Enhancement for 3D Synthesized View

被引：15

作者：

Zhang, Huan ^{[1
,2
]}

Zhang, Yun ^{[2
]}

Zhu, Linwei ^{[2
]}

Lin, Weisi ^{[3
]}

机构：

[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 08期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Noise reduction; Three-dimensional displays; Distortion; Image denoising; Convolutional neural networks; Solid modeling; Rendering (computer graphics); View synthesis; perceptual quality enhancement; convolutional neural network; temporal flicker distortion; 3D synthesized video; IMAGE; SPARSE; COMPRESSION; DIBR;

D O I：

10.1109/TCSVT.2022.3147788

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to occlusion among views and temporal inconsistency in depth video, spatio-temporal distortion occurs in 3D synthesized video with depth image-based rendering. In this paper, we propose a deep Convolutional Neural Network (CNN)-based synthesized video denoising algorithm to reduce temporal flicker distortion and improve perceptual quality of 3D synthesized video. First, we analyze the spatio-temporal distortion, and model eliminating spatio-temporal distortion as a perceptual video denoising problem. Then, a deep learning-based synthesized video denoising network is proposed, in which a CNN-friendly spatio-temporal loss function is derived from a synthesized video quality metric and integrated with a single image denoising network architecture. Finally, specific schemes, i.e., specific Synthesized Video Denoising Networks (SynVD-Nets), and a general scheme, i.e., General SynVD-Net (GSynVD-Net), based on existing CNN-based denoising models, are developed to handle synthesized video with different distortion levels more effectively. Experimental results show that the proposed SynVD-Net and GSynVD-Net can outperform deep learning-based counterparts and conventional denoising methods, and significantly enhance perceptual quality of 3D synthesized video.

引用

页码：5080 / 5094

页数：15

共 55 条

[31] TSAN: Synthesized View Quality Enhancement via Two-Stream Attention Network for 3D-HEVC [J].

Pan, Zhaoqing ;

Yu, Weijie ;

Lei, Jianjun ;

Ling, Nam ;

Kwong, Sam .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :345-358

[32] Image Sequence Denoising via Sparse and Redundant Representations [J].

Protter, Matan ;

Elad, Michael .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (01) :27-35

[33] U-Net: Convolutional Networks for Biomedical Image Segmentation [J].

Ronneberger, Olaf ;

Fischer, Philipp ;

Brox, Thomas .

MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241

[34] DIBR-synthesized image quality assessment based on morphological multi-scale approach [J].

Sandic-Stankovic, Dragana ;

Kukolj, Dragan ;

Le Callet, Patrick .

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,

[35] Graph Laplacian Regularization With Sparse Coding for Image Restoration and Representation [J].

Sha, Lingdao ;

Schonfeld, Dan ;

Wang, Jing .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2000-2014

[36] Wavelet-Based Total Variation and Nonlocal Similarity Model for Image Denoising [J].

Shen, Yan ;

Liu, Qing ;

Lou, Shuqin ;

Hou, Ya-Li .

IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (06) :877-881

[37] FastDVDnet: Towards Real-Time Deep Video Denoising Without Flow Estimation [J].

Tassano, Matias ;

Delon, Julie ;

Veit, Thomas .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1351-1360

[38] New Hole-Filling Method Using Extrapolated Spatio-Temporal Background Information for a Synthesized Free-View [J].

Tien-Dat Nguyen ;

Kim, Beomsu ;

Hong, Min-Cheol .

IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (06) :1345-1358

[39] Deep Image Prior [J].

Ulyanov, Dmitry ;

Vedaldi, Andrea ;

Lempitsky, Victor .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9446-9454

[40] EDVR: Video Restoration with Enhanced Deformable Convolutional Networks [J].

Wang, Xintao ;

Chan, Kelvin C. K. ;

Yu, Ke ;

Dong, Chao ;

Loy, Chen Change .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1954-1963

← 1 2 3 4 5 6 →