Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement

被引:2
|
作者
Jia, Jianghao [1 ]
Zhang, Yuantong [1 ]
Zhu, Han [1 ]
Chen, Zhenzhong [1 ]
Liu, Zizheng [2 ]
Xu, Xiaozhong [3 ]
Liu, Shan [3 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Peoples R China
[2] Tencent Shenzhen, Shenzhen 518000, Peoples R China
[3] Tencent Amer, Palo Alto, CA 94306 USA
关键词
Interpolation; Optical flow; Extrapolation; Bidirectional control; Kernel; Encoding; Streaming media; Neural-network-based video coding; versatile video coding (VVC); inter prediction; deep learning; NETWORK;
D O I
10.1109/TCSVT.2023.3299410
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In video coding, inter prediction aims to reduce temporal redundancy by using previously encoded frames as references. The quality of reference frames is crucial to the performance of inter prediction. This paper presents a deep reference frame generation method to optimize the inter prediction in Versatile Video Coding (VVC). Specifically, reconstructed frames are sent to a well-designed frame generation network to synthesize a picture similar to the current encoding frame. The synthesized picture serves as an additional reference frame inserted into the reference picture list (RPL) to provide a more reliable reference for subsequent motion estimation (ME) and motion compensation (MC). The frame generation network employs optical flow to predict motion precisely. Moreover, an optical flow reorganization strategy is proposed to enable bi-directional and uni-directional predictions with only a single network architecture. To reasonably apply our method to VVC, we further introduce a normative modification of the temporal motion vector prediction (TMVP). Integrated into the VVC reference software VTM-15.0, the deep reference frame generation method achieves coding efficiency improvements of 5.22%, 3.61%, and 3.83% for the Y component under random access (RA), low delay B (LDB), and low delay P (LDP) configurations, respectively. The proposed method has been discussed in Joint Video Exploration Team (JVET) meeting and is currently part of Exploration Experiments (EE) for further study.
引用
收藏
页码:3111 / 3124
页数:14
相关论文
共 50 条
  • [31] Memory-Augmented Auto-Regressive Network for Frame Recurrent Inter Prediction
    Hu, Yuzhang
    Xia, Sifeng
    Yang, Wenhan
    Liu, Jiaying
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [32] Video Frame Prediction via Deep Learning
    Yilmaz, M. Akin
    Tekalp, A. Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [33] Inter-frame Enhancement of Ultrasound Images Using Optical Flow
    Achmad, Balza
    Mustafa, Mohd Marzuki
    Hussain, Aini
    VISUAL INFORMATICS: BRIDGING RESEARCH AND PRACTICE, 2009, 5857 : 191 - 201
  • [34] VRFCNN: Virtual Reference Frame Generation Network for Quality SHVC
    Ding, Qing
    Shen, Liquan
    Yang, Hao
    Dong, Xinchao
    Xu, Mai
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 2049 - 2053
  • [35] Improving Frame-Online Neural Speech Enhancement With Overlapped-Frame Prediction
    Wang, Zhong-Qiu
    Watanabe, Shinji
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1422 - 1426
  • [36] Cloud Gaming Video Coding Optimization Based on Camera Motion-Guided Reference Frame Enhancement
    Wang, Yifan
    Wang, Hao
    Wang, Kaijie
    Zhang, Wei
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [37] UNDERWATER POLARIZATION IMAGE ENHANCEMENT METHOD BY USING DEEP LEARNING
    Jiang, Nan
    Li, Guohui
    Lv, Dong
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (06) : 1413 - 1422
  • [38] Simplifications in inter-frame prediction in the H.265/HEVC encoder
    Trochimiuk, Maciej
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2015, 2015, 9662
  • [39] A Synthetic Data Generation Technique for Enhancement of Prediction Accuracy of Electric Vehicles Demand
    Chatterjee, Subhajit
    Byun, Yung-Cheol
    SENSORS, 2023, 23 (02)
  • [40] A Deep Learning Method for Automatic News Frame Identification
    Zhang, Xin
    Wei, Qiyi
    Zheng, Bin
    Zhang, Pengzhou
    27TH IEEE/ACIS INTERNATIONAL SUMMER CONFERENCE ON SOFTWARE ENGINEERING ARTIFICIAL INTELLIGENCE NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, SNPD 2024-SUMMER, 2024, : 18 - 24