Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement

被引:2
|
作者
Jia, Jianghao [1 ]
Zhang, Yuantong [1 ]
Zhu, Han [1 ]
Chen, Zhenzhong [1 ]
Liu, Zizheng [2 ]
Xu, Xiaozhong [3 ]
Liu, Shan [3 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Peoples R China
[2] Tencent Shenzhen, Shenzhen 518000, Peoples R China
[3] Tencent Amer, Palo Alto, CA 94306 USA
关键词
Interpolation; Optical flow; Extrapolation; Bidirectional control; Kernel; Encoding; Streaming media; Neural-network-based video coding; versatile video coding (VVC); inter prediction; deep learning; NETWORK;
D O I
10.1109/TCSVT.2023.3299410
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In video coding, inter prediction aims to reduce temporal redundancy by using previously encoded frames as references. The quality of reference frames is crucial to the performance of inter prediction. This paper presents a deep reference frame generation method to optimize the inter prediction in Versatile Video Coding (VVC). Specifically, reconstructed frames are sent to a well-designed frame generation network to synthesize a picture similar to the current encoding frame. The synthesized picture serves as an additional reference frame inserted into the reference picture list (RPL) to provide a more reliable reference for subsequent motion estimation (ME) and motion compensation (MC). The frame generation network employs optical flow to predict motion precisely. Moreover, an optical flow reorganization strategy is proposed to enable bi-directional and uni-directional predictions with only a single network architecture. To reasonably apply our method to VVC, we further introduce a normative modification of the temporal motion vector prediction (TMVP). Integrated into the VVC reference software VTM-15.0, the deep reference frame generation method achieves coding efficiency improvements of 5.22%, 3.61%, and 3.83% for the Y component under random access (RA), low delay B (LDB), and low delay P (LDP) configurations, respectively. The proposed method has been discussed in Joint Video Exploration Team (JVET) meeting and is currently part of Exploration Experiments (EE) for further study.
引用
收藏
页码:3111 / 3124
页数:14
相关论文
共 50 条
  • [41] PREDICTION OF PHOTOVOLTAIC GENERATION USING DEEP LEARNING
    Fraga Hurtado, Isidro
    Gomez Rodriguez, Marco Antonio
    Gomez Sarduy, Julio Rafael
    Garcia Sanchez, Zaid
    REVISTA UNIVERSIDAD Y SOCIEDAD, 2023, 15 : 266 - 275
  • [42] Grey Relational Frame Prediction Method for Anomaly Detection
    Li, Chaobo
    Li, Hongjun
    Sun, Xiaohu
    Zhang, Guoan
    JOURNAL OF GREY SYSTEM, 2022, 34 (01) : 1 - 16
  • [43] Deep Multiframe Enhancement for Motion Prediction in Video Compression
    Prette, Nicola
    Valsesia, Diego
    Bianchi, Tiziano
    2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [44] Deep Inter Prediction with Error-Corrected Auto-Regressive Network for Video Coding
    Hu, Yuzhang
    Yang, Wenhan
    Liu, Jiaying
    Guo, Zongming
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [45] Deep Reference Frame for Versatile Video Coding with Structural Re-parameterization
    Gui, Chengzhuo
    Zhang, Yuantong
    Bao, Weijie
    Chen, Zhenzhong
    Wang, Huairui
    Liu, Shan
    2024 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING, VCIP, 2024,
  • [46] Protein secondary structure prediction by using deep learning method
    Wang, Yangxu
    Mao, Hua
    Yi, Zhang
    KNOWLEDGE-BASED SYSTEMS, 2017, 118 : 115 - 123
  • [47] Gated fusion network for SAO filter and inter frame prediction in Versatile Video Coding
    Kuanar, Shiba
    Athitsos, Vassilis
    Mahapatra, Dwarikanath
    Rao, K. R.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 109
  • [48] Video Frame Prediction by Deep Multi-Branch Mask Network
    Li, Sen
    Fang, Jianwu
    Xu, Hongke
    Xue, Jianru
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1283 - 1295
  • [49] Reference frame list optimization algorithm in video coding by quality enhancement of the nearest picture
    Huo J.
    Qiu R.
    Ma Y.
    Yang F.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (11): : 136 - 147
  • [50] Content Classification based Reference Frame Reduction and Machine Learning based Non-square Block Partition Skipping for Inter Prediction of Screen Content Coding
    Wang, Yawei
    Chen, Gaoxing
    Ikenaga, Takeshi
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 240 - 244