Adaptive Multi-Modality Residual Network for Compression Distorted Multi-View Depth Video Enhancement

被引：8

作者：

Chen, Siqi ^{[1
,2
]}

Liu, Qiong ^{[1
,2
]}

Yang, You ^{[1
,2
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

[2] Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Color; Distortion; Image coding; Correlation; Adaptive systems; Image color analysis; Three-dimensional displays; Compression distortion; depth map; quality enhancement; residual network;

D O I：

10.1109/ACCESS.2020.2996258

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Compression distorted multi-view video plus depth (MVD) should be enhanced at the receiver side without the original signals, especially the depth maps because they describe the positioning information in 3D space and they are important for subsequent virtual view synthesis. However, challenge arises from how to exploit the contribution from multi-modality priors from neighboring viewpoints, and how to handle the gradient vanishing when textureless depth maps are involved. In this paper, we propose a multi-modality residual network to enhance the quality of compressed multi-view depth video. Taking advantage from high correlation among different viewpoints, depth maps from adjacent views are exploited as guidance for the enhancement of depth video in target view. Color frames in target view are also involved to offer the information object contours, obtaining multi-modality guidance. The proposed network is organized a deep residual network to well eliminate distortion and restore details. Because above multi-modality guidance have different correlations with target depth video and not all information can contribute to the enhancement, an adaptive skip structure is designed to further exploit the contribution from different priors appropriately. Experimental results show that our scheme outperforms other benchmarks and achieves an average 1.935 dB and 0.0227 gains on PSNR and SSIM over all test sequences, respectively. All results on objective, subjective and 3D reconstruction suggest that our method is able to provide superiority performance in practical applications.

引用

页码：97072 / 97081

页数：10

共 31 条

[1]

[Anonymous], 2008, P WORKSH MULT MULT S

[2]

[Anonymous], 2018, P EUR C COMP VIS ECC

[3]

[Anonymous], 2015, Tiny ImageNet Visual Recognition Challenge., DOI DOI 10.1109/ICCV.2015.123

[4]

[Anonymous], 2014, Doc. JCT3V-G1100

[5] A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding [J].

Dai, Yuanying ;

Liu, Dong ;

Wu, Feng .

MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :28-39

[6] Compression Artifacts Reduction by a Deep Convolutional Network [J].

Dong, Chao ;

Deng, Yubin ;

Loy, Chen Change ;

Tang, Xiaoou .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :576-584

[7] Image Super-Resolution Using Deep Convolutional Networks [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307

[8] Denoising Prior Driven Deep Neural Network for Image Restoration [J].

Dong, Weisheng ;

Wang, Peiyao ;

Yin, Wotao ;

Shi, Guangming ;

Wu, Fangfang ;

Lu, Xiaotong .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (10) :2305-2318

[9]

Fehn Christoph, 2007, 3DTV Conference, 2007, P1

[10] Perceptually Driven Nonuniform Asymmetric Coding of Stereoscopic 3D Video [J].

Fezza, Sid Ahmed ;

Larabi, Mohamed-Chaker .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (10) :2231-2245

← 1 2 3 4 →