Disparity-Aware Reference Frame Generation Network for Multiview Video Coding

被引:4
作者
Lei, Jianjun [1 ]
Zhang, Zongqian [1 ,2 ]
Pan, Zhaoqing [1 ]
Liu, Dong [3 ]
Liu, Xiangrui [1 ]
Chen, Ying [4 ]
Ling, Nam [5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Alibaba Grp, Hangzhou 310052, Peoples R China
[3] Univ Sci & Technol China, CAS Key Lab Technol Geo Spatial Informat Proc & A, Hefei 230027, Peoples R China
[4] Alibaba Grp, Hangzhou 310052, Peoples R China
[5] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA
基金
中国国家自然科学基金;
关键词
Image coding; Video coding; Deep learning; Image reconstruction; Estimation; Encoding; Task analysis; Multiview video coding; reference frame generation; disparity-aware alignment; DAG-Net; 3D-HEVC; VIEW SYNTHESIS; PREDICTION; EXTENSIONS;
D O I
10.1109/TIP.2022.3183436
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiview video coding (MVC) aims to compress the multiview video through the elimination of video redundancies, where the quality of the reference frame directly affects the compression efficiency. In this paper, we propose a deep virtual reference frame generation method based on a disparity-aware reference frame generation network (DAG-Net) to transform the disparity relationship between different viewpoints and generate a more reliable reference frame. The proposed DAG-Net consists of a multi-level receptive field module, a disparity-aware alignment module, and a fusion reconstruction module. First, a multi-level receptive field module is designed to enlarge the receptive field, and extract the multi-scale deep features of the temporal and inter-view reference frames. Then, a disparity-aware alignment module is proposed to learn the disparity relationship, and perform disparity shift on the inter-view reference frame to align it with the temporal reference frame. Finally, a fusion reconstruction module is utilized to fuse the complementary information and generate a more reliable virtual reference frame. Experiments demonstrate that the proposed reference frame generation method achieves superior performance for multiview video coding.
引用
收藏
页码:4515 / 4526
页数:12
相关论文
共 50 条
  • [21] A Low-Power Memory Architecture with Application-Aware Power Management for Motion & Disparity Estimation in Multiview Video Coding
    Zatt, Bruno
    Shafique, Muhammad
    Bampi, Sergio
    Henkel, Joerg
    2011 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2011, : 40 - 47
  • [22] Motion-Aware Deep Video Coding Network
    Khan, Rida
    Liu, Ying
    BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [23] Stretching, Compression and Shearing Disparity Compensated Prediction techniques for Stereo and Multiview Video Coding
    Wong, Ka-Man
    Po, Lai-Man
    Cheung, Kwok-Wai
    Ng, Ka-Ho
    Xu, Xuyuan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 841 - 844
  • [24] Multisource surveillance video coding with synthetic reference frame
    Chen, Yu
    Hu, Ruimin
    Xiao, Jing
    Wang, Zhongyuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 65
  • [25] Deep Video Prediction Network-ased Inter-Frame Coding in HEVC
    Lee, Jung-Kyung
    Kim, Nayoung
    Cho, Seunghyun
    Kang, Je-Won
    IEEE ACCESS, 2020, 8 : 95906 - 95917
  • [26] Adaptive Detachable Partition-Based Reference Frame Recompression for Video Coding
    Zhou, Jinjia
    Fu, Chen
    IEEE MULTIMEDIA, 2024, 31 (02) : 17 - 25
  • [27] An epipolar geometry-based fast disparity estimation algorithm for multiview image and video coding
    Lu, Jiangbo
    Cai, Hua
    Lou, Jian-Guang
    Li, Jiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (06) : 737 - 750
  • [28] High Performance and Hardware Efficient Multiview Video Coding Frame Scheduling Algorithms and Architectures
    Choi, Minsu
    Chang, Ik Joon
    Kim, Jinsang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (08) : 1312 - 1321
  • [29] Deep Neural Network Based Frame Reconstruction for Optimized Video Coding
    Ding, Dandan
    Liu, Peng
    Chen, Yu
    Zhu, Zheng
    Liu, Zoe
    Bankoski, James
    ARTIFICIAL INTELLIGENCE AND MOBILE SERVICES - AIMS 2018, 2018, 10970 : 235 - 242
  • [30] An Efficient Reference Frame Compression Approach for Video Coding Systems
    Povala, Guilherme
    Silveira, Dieison
    Amaral, Lvia
    Zatt, Bruno
    Porto, Marcelo
    Agostini, Luciano
    2014 IEEE 5TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS (LASCAS), 2014,