Reconstruction of Multi-view Video Based on GAN

被引:0
作者
Li, Song [1 ]
Lan, Chengdong [1 ]
Zhao, Tiesong [1 ]
机构
[1] Fuzhou Univ, Sch Phys & Informat Engn, Fuzhou, Fujian, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II | 2018年 / 11165卷
基金
中国国家自然科学基金;
关键词
Hybrid resolution; SRGAN; Virtual view reconstruction; EPI; Multi-view video;
D O I
10.1007/978-3-030-00767-6_57
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a huge amount of data in multi-view video which brings enormous challenges to the compression, storage, and transmission of video data. Transmitting part of the viewpoint information is a prior solution to reconstruct the original multi-viewpoint information. They are all based on pixel matching to obtain the correlation between adjacent viewpoint images. However, pixels cannot express the invariability of image features and are susceptible to noise. Therefore, in order to overcome the above problems, the VGG network is used to extract the high-dimensional features between the images, indicating the relevance of the adjacent images. The GAN is further used to more accurately generate virtual viewpoint images. We extract the lines at the same positions of the viewpoints as local areas for image merging and input the local images into the network. In the reconstruction viewpoint, we generate a local image of a dense viewpoint through the GAN network. Experiments on multiple test sequences show that the proposed method has a 0.2-0.8-dB PSNR and 0.15-0.61 MOS improvement over the traditional method.
引用
收藏
页码:618 / 629
页数:12
相关论文
共 19 条
  • [1] SUBJECTIVE STUDY ON COMPRESSED ASYMMETRIC STEREOSCOPIC VIDEO
    Aflaki, Payman
    Hannuksela, Miska M.
    Hakkinen, Jukka
    Lindroos, Paul
    Gabbouj, Moncef
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4021 - 4024
  • [2] [Anonymous], P IEEE C IM VIS COMP
  • [3] Virtual Reality in Health System: Beyond Entertainment. A Mini-Review on the Efficacy of VR During Cancer Treatment
    Chirico, Andrea
    Lucidi, Fabio
    De Laurentiis, Michele
    Milanese, Carla
    Napoli, Alessandro
    Giordano, Antonio
    [J]. JOURNAL OF CELLULAR PHYSIOLOGY, 2016, 231 (02) : 275 - 287
  • [4] Diogo C., 2012, IEEE T CIRCUITS SYST, V20, P132
  • [5] Do L., 2010, IS T SPIE ELECT IMAG
  • [6] Horng YR, 2010, IEEE INT SYMP CIRC S, P2650, DOI 10.1109/ISCAS.2010.5537052
  • [7] Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling
    Jin, Zhi
    Tillo, Tammam
    Xiao, Jimin
    Zhao, Yao
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
  • [8] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
    Ledig, Christian
    Theis, Lucas
    Huszar, Ferenc
    Caballero, Jose
    Cunningham, Andrew
    Acosta, Alejandro
    Aitken, Andrew
    Tejani, Alykhan
    Totz, Johannes
    Wang, Zehan
    Shi, Wenzhe
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 105 - 114
  • [9] Nongeometric Distortion Smoothing Approach for Depth Map Preprocessing
    Lee, Pei-Jun
    Effendi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 246 - 254
  • [10] Oliveira A, 2015, INT CONF ACOUST SPEE, P1186, DOI 10.1109/ICASSP.2015.7178157