Deep Learning-Based Perceptual Video Quality Enhancement for 3D Synthesized View

被引：15

作者：

Zhang, Huan ^{[1
,2
]}

Zhang, Yun ^{[2
]}

Zhu, Linwei ^{[2
]}

Lin, Weisi ^{[3
]}

机构：

[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 08期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Noise reduction; Three-dimensional displays; Distortion; Image denoising; Convolutional neural networks; Solid modeling; Rendering (computer graphics); View synthesis; perceptual quality enhancement; convolutional neural network; temporal flicker distortion; 3D synthesized video; IMAGE; SPARSE; COMPRESSION; DIBR;

D O I：

10.1109/TCSVT.2022.3147788

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to occlusion among views and temporal inconsistency in depth video, spatio-temporal distortion occurs in 3D synthesized video with depth image-based rendering. In this paper, we propose a deep Convolutional Neural Network (CNN)-based synthesized video denoising algorithm to reduce temporal flicker distortion and improve perceptual quality of 3D synthesized video. First, we analyze the spatio-temporal distortion, and model eliminating spatio-temporal distortion as a perceptual video denoising problem. Then, a deep learning-based synthesized video denoising network is proposed, in which a CNN-friendly spatio-temporal loss function is derived from a synthesized video quality metric and integrated with a single image denoising network architecture. Finally, specific schemes, i.e., specific Synthesized Video Denoising Networks (SynVD-Nets), and a general scheme, i.e., General SynVD-Net (GSynVD-Net), based on existing CNN-based denoising models, are developed to handle synthesized video with different distortion levels more effectively. Experimental results show that the proposed SynVD-Net and GSynVD-Net can outperform deep learning-based counterparts and conventional denoising methods, and significantly enhance perceptual quality of 3D synthesized video.

引用

页码：5080 / 5094

页数：15

共 55 条

[1]

3D-ATM, 2023, Reference software for 3D-AVC: 3DV-ATM V10.0

[2]

3D-HTM, 2023, Anchor software for 3D-HEVC experiments: 3DV-HTM V8.0

[3]

[Anonymous], 2019, BM3DVBM4D CODE

[4]

[Anonymous], INT C LEARNING REPRE

[5]

[Anonymous], VSRS 1D FAST

[6] MPEG Immersive Video Coding Standard [J].

Boyce, Jill M. ;

Dore, Renaud ;

Dziembowski, Adrian ;

Fleureau, Julien ;

Jung, Joel ;

Kroon, Bart ;

Salahieh, Basel ;

Vadakital, Vinod Kumar Malamal ;

Yu, Lu .

PROCEEDINGS OF THE IEEE, 2021, 109 (09) :1521-1536

[7] A non-local algorithm for image denoising [J].

Buades, A ;

Coll, B ;

Morel, JM .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65

[8] Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].

Caballero, Jose ;

Ledig, Christian ;

Aitken, Andrew ;

Acosta, Alejandro ;

Totz, Johannes ;

Wang, Zehan ;

Shi, Wenzhe .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857

[9] Deep RNNs for Video Denoising [J].

Chen, Xinyuan ;

Song, Li ;

Yang, Xiaokang .

APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXIX, 2016, 9971

[10] ViDeNN: Deep Blind Video Denoising [J].

Claus, Michele ;

van Gemert, Jan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1843-1852

← 1 2 3 4 5 6 →