Reconstruction of Multi-view Video Based on GAN

被引：0

作者：

Li, Song ^{[1
]}

Lan, Chengdong ^{[1
]}

Zhao, Tiesong ^{[1
]}

机构：

[1] Fuzhou Univ, Sch Phys & Informat Engn, Fuzhou, Fujian, Peoples R China

来源：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II | 2018年 / 11165卷

基金：

中国国家自然科学基金;

关键词：

Hybrid resolution; SRGAN; Virtual view reconstruction; EPI; Multi-view video;

D O I：

10.1007/978-3-030-00767-6_57

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

There is a huge amount of data in multi-view video which brings enormous challenges to the compression, storage, and transmission of video data. Transmitting part of the viewpoint information is a prior solution to reconstruct the original multi-viewpoint information. They are all based on pixel matching to obtain the correlation between adjacent viewpoint images. However, pixels cannot express the invariability of image features and are susceptible to noise. Therefore, in order to overcome the above problems, the VGG network is used to extract the high-dimensional features between the images, indicating the relevance of the adjacent images. The GAN is further used to more accurately generate virtual viewpoint images. We extract the lines at the same positions of the viewpoints as local areas for image merging and input the local images into the network. In the reconstruction viewpoint, we generate a local image of a dense viewpoint through the GAN network. Experiments on multiple test sequences show that the proposed method has a 0.2-0.8-dB PSNR and 0.15-0.61 MOS improvement over the traditional method.

引用

页码：618 / 629

页数：12

共 19 条

[11] Virtual View Synthesis for Free Viewpoint Video and Multiview Video Compression using Gaussian Mixture Modelling
Rahaman, D. M. Motiur
Paul, Manoranjan
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1190 - 1201
[12] Robust Super-Resolution for Mixed-Resolution Multiview Image Plus Depth Data
Richter, Thomas
Seiler, Juergen
Schnurrer, Wolfgang
Kaup, Andre
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (05) : 814 - 828
[13] Schreer O., 2013, P 4 ACM MULT SYST C, P249
[14] Seguin D., 2010, CANADAS BROADCAST PR, V2, P43
[15] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[16] Wu G., 2017, P IEEE COMP VIS PATT, P6317
[17] Scalable Bit Allocation Between Texture and Depth Views for 3-D Video Streaming Over Heterogeneous Networks
Xiao, Jimin
Hannuksela, Miska M.
Tillo, Tammam
Gabbouj, Moncef
Zhu, Ce
Zhao, Yao
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (01) : 139 - 152
[18] Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information
Yao, Chao
Tillo, Tammam
Zhao, Yao
Xiao, Jimin
Bai, Huihui
Lin, Chunyu
[J]. IEEE TRANSACTIONS ON BROADCASTING, 2014, 60 (02) : 394 - 404
[19] Zmigrodzka M., 2017, MARK SCI RES ORG

← 1 2 →