MetaIP: Meta-Network-Based Intra Prediction With Customized Parameters for Video Coding

被引:1
作者
Man, Hengyu [1 ,2 ]
Fan, Xiaopeng [1 ,2 ]
Lu, Riyu [1 ,2 ]
Yu, Chang [1 ,2 ]
Zhao, Debin [1 ,2 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; Predictive models; Image reconstruction; Training; Task analysis; Bit rate; Quantization (signal); Intra prediction; meta-network; parameter customization; video coding;
D O I
10.1109/TCSVT.2024.3395458
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Intra prediction is a vital tool in video coding that eliminates the spatial redundancy within a frame to enhance compression efficiency. Conventional intra prediction methods employ multiple directional prediction modes to describe textures in local areas. Recently, research on neural network-based intra prediction has achieved great success. The block-context pairs are divided into multiple clusters according to a predefined relationship, and a corresponding network is trained and applied for each cluster. However, the networks in these methods adopt fixed parameters to predict diverse image blocks, making it hard to cope with various textures in natural images. Inspired by recent works on parameter prediction, in this paper, we propose a meta-network-based intra prediction method, called MetaIP, that dynamically customizes the network parameters for each block sample in a given cluster. MetaIP consists of a meta-subnetwork and a prediction subnetwork. For an image block, the meta-subnetwork takes its neighboring reference pixels and some auxiliary information (e.g., quantization parameter) as inputs to generate customized parameters first. Then, the prediction subnetwork uses the customized parameters to infer the predicted block. MetaIP can generate multiple sets of network parameters corresponding to multiple modes for an image block. The optimal mode is determined by the rate-distortion optimization. MetaIP is integrated into VVC to assist or replace the directional prediction modes to evaluate its performance. The experimental results demonstrate that MetaIP with four prediction modes achieves an average of 3.84% and 1.96% bitrate saving for the luma component over VTM-17.0 when assisting or replacing VVC intra modes, respectively.
引用
收藏
页码:9591 / 9605
页数:15
相关论文
共 45 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]  
Alshina E., 2022, JVETAA2016
[3]  
Bjontegaard G., 2008, Technical Report VCEG-AI11, ITU-T SG16 Q.6
[4]  
Bossen F, 2019, Tech. Rep. JVET-N1010
[5]   Intra-Frame Coding Using a Conditional Autoencoder [J].
Brand, Fabian ;
Seiler, Juergen ;
Kaup, Andre .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) :354-365
[6]  
Bross B., 2018, document JVET-L0283
[7]   Overview of the Versatile Video Coding (VVC) Standard and its Applications [J].
Bross, Benjamin ;
Wang, Ye-Kui ;
Ye, Yan ;
Liu, Shan ;
Chen, Jianle ;
Sullivan, Gary J. ;
Ohm, Jens-Rainer .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3736-3764
[8]  
Chang Y., 2021, JVETU0100
[9]  
Choi N., 2018, document JVET-K0529
[10]   Convolutional Neural Networks Based Intra Prediction for HEVC [J].
Cui, Wenxue ;
Zhang, Tao ;
Zhang, Shengping ;
Jiang, Feng ;
Zuo, Wangmeng ;
Wan, Zhaolin ;
Zhao, Debin .
2017 DATA COMPRESSION CONFERENCE (DCC), 2017, :436-436