Reconstruction of Multi-view Video Based on GAN

被引:0
作者
Li, Song [1 ]
Lan, Chengdong [1 ]
Zhao, Tiesong [1 ]
机构
[1] Fuzhou Univ, Sch Phys & Informat Engn, Fuzhou, Fujian, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II | 2018年 / 11165卷
基金
中国国家自然科学基金;
关键词
Hybrid resolution; SRGAN; Virtual view reconstruction; EPI; Multi-view video;
D O I
10.1007/978-3-030-00767-6_57
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a huge amount of data in multi-view video which brings enormous challenges to the compression, storage, and transmission of video data. Transmitting part of the viewpoint information is a prior solution to reconstruct the original multi-viewpoint information. They are all based on pixel matching to obtain the correlation between adjacent viewpoint images. However, pixels cannot express the invariability of image features and are susceptible to noise. Therefore, in order to overcome the above problems, the VGG network is used to extract the high-dimensional features between the images, indicating the relevance of the adjacent images. The GAN is further used to more accurately generate virtual viewpoint images. We extract the lines at the same positions of the viewpoints as local areas for image merging and input the local images into the network. In the reconstruction viewpoint, we generate a local image of a dense viewpoint through the GAN network. Experiments on multiple test sequences show that the proposed method has a 0.2-0.8-dB PSNR and 0.15-0.61 MOS improvement over the traditional method.
引用
收藏
页码:618 / 629
页数:12
相关论文
共 19 条
  • [11] Virtual View Synthesis for Free Viewpoint Video and Multiview Video Compression using Gaussian Mixture Modelling
    Rahaman, D. M. Motiur
    Paul, Manoranjan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1190 - 1201
  • [12] Robust Super-Resolution for Mixed-Resolution Multiview Image Plus Depth Data
    Richter, Thomas
    Seiler, Juergen
    Schnurrer, Wolfgang
    Kaup, Andre
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (05) : 814 - 828
  • [13] Schreer O., 2013, P 4 ACM MULT SYST C, P249
  • [14] Seguin D., 2010, CANADAS BROADCAST PR, V2, P43
  • [15] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
  • [16] Wu G., 2017, P IEEE COMP VIS PATT, P6317
  • [17] Scalable Bit Allocation Between Texture and Depth Views for 3-D Video Streaming Over Heterogeneous Networks
    Xiao, Jimin
    Hannuksela, Miska M.
    Tillo, Tammam
    Gabbouj, Moncef
    Zhu, Ce
    Zhao, Yao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (01) : 139 - 152
  • [18] Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information
    Yao, Chao
    Tillo, Tammam
    Zhao, Yao
    Xiao, Jimin
    Bai, Huihui
    Lin, Chunyu
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2014, 60 (02) : 394 - 404
  • [19] Zmigrodzka M., 2017, MARK SCI RES ORG