Reconstruction of Multi-view Video Based on GAN

被引：0

作者：

Li, Song ^{[1
]}

Lan, Chengdong ^{[1
]}

Zhao, Tiesong ^{[1
]}

机构：

[1] Fuzhou Univ, Sch Phys & Informat Engn, Fuzhou, Fujian, Peoples R China

来源：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II | 2018年 / 11165卷

基金：

中国国家自然科学基金;

关键词：

Hybrid resolution; SRGAN; Virtual view reconstruction; EPI; Multi-view video;

D O I：

10.1007/978-3-030-00767-6_57

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

There is a huge amount of data in multi-view video which brings enormous challenges to the compression, storage, and transmission of video data. Transmitting part of the viewpoint information is a prior solution to reconstruct the original multi-viewpoint information. They are all based on pixel matching to obtain the correlation between adjacent viewpoint images. However, pixels cannot express the invariability of image features and are susceptible to noise. Therefore, in order to overcome the above problems, the VGG network is used to extract the high-dimensional features between the images, indicating the relevance of the adjacent images. The GAN is further used to more accurately generate virtual viewpoint images. We extract the lines at the same positions of the viewpoints as local areas for image merging and input the local images into the network. In the reconstruction viewpoint, we generate a local image of a dense viewpoint through the GAN network. Experiments on multiple test sequences show that the proposed method has a 0.2-0.8-dB PSNR and 0.15-0.61 MOS improvement over the traditional method.

引用

页码：618 / 629

页数：12

共 19 条

[1] SUBJECTIVE STUDY ON COMPRESSED ASYMMETRIC STEREOSCOPIC VIDEO
Aflaki, Payman
Hannuksela, Miska M.
Hakkinen, Jukka
Lindroos, Paul
Gabbouj, Moncef
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4021 - 4024
[2] [Anonymous], P IEEE C IM VIS COMP
[3] Virtual Reality in Health System: Beyond Entertainment. A Mini-Review on the Efficacy of VR During Cancer Treatment
Chirico, Andrea
Lucidi, Fabio
De Laurentiis, Michele
Milanese, Carla
Napoli, Alessandro
Giordano, Antonio
[J]. JOURNAL OF CELLULAR PHYSIOLOGY, 2016, 231 (02) : 275 - 287
[4] Diogo C., 2012, IEEE T CIRCUITS SYST, V20, P132
[5] Do L., 2010, IS T SPIE ELECT IMAG
[6] Horng YR, 2010, IEEE INT SYMP CIRC S, P2650, DOI 10.1109/ISCAS.2010.5537052
[7] Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling
Jin, Zhi
Tillo, Tammam
Xiao, Jimin
Zhao, Yao
[J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
[8] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Ledig, Christian
Theis, Lucas
Huszar, Ferenc
Caballero, Jose
Cunningham, Andrew
Acosta, Alejandro
Aitken, Andrew
Tejani, Alykhan
Totz, Johannes
Wang, Zehan
Shi, Wenzhe
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 105 - 114
[9] Nongeometric Distortion Smoothing Approach for Depth Map Preprocessing
Lee, Pei-Jun
Effendi
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 246 - 254
[10] Oliveira A, 2015, INT CONF ACOUST SPEE, P1186, DOI 10.1109/ICASSP.2015.7178157

← 1 2 →