Neural network-based cross-channel chroma prediction for versatile video coding

被引：0

作者：

Liang, Fang ^{[1
]}

Zhang, Jingde ^{[2
]}

机构：

[1] Huaihua Vocat & Tech Coll, Huaihua 418000, Hunan, Peoples R China

[2] Huaihua Tourism Sch, Huaihua 418000, Hunan, Peoples R China

来源：

JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 09期

关键词：

Versatile video coding (VVC ); Neural network; Cross-channel chroma prediction; Transform loss; INTRA-PREDICTION;

D O I：

10.1007/s11227-023-05868-y

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Despite linear models being introduced in the latest versatile video coding (VVC) standard to exploit the correlation among luma and chroma channels for removing redundancy, these models cannot take into account the nonlinearity of components, resulting in degraded intraprediction precision. In this paper, a neural network-based method is proposed for cross-channel chroma intraprediction to enhance the coding efficiency. Specifically, the neighboring reference and co-located samples are separately input into the proposed network to exploit spatial and cross-channel correlations fully. Furthermore, in order to acquire a more compact representation of residual signals, a transform-based loss is employed to enhance the effectiveness of the compression. The proposed method is integrated into VVC, competing with the intrinsic chroma prediction regarding rate-distortion optimization to enhance coding performance further. The extensive experimental results demonstrate the superiority of the proposed method over the VVC test model (VTM) 18.0, achieving average bitrate savings of 0.28%, 2.44%, and 1.89% for Y, U, and V components, respectively.

引用

页码：12166 / 12185

页数：20

共 46 条

[1] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[2]

Bjontegaard G, 2008, ITU T SG16Q6 35 VCEG

[3] Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding [J].

Blanch, Marc Gorriz ;

Blasi, Saverio ;

Smeaton, Alan F. ;

O'Connor, Noel E. ;

Mrak, Marta .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) :366-377

[4]

Blanch MG, 2020, IEEE IMAGE PROC, P783, DOI [10.1109/ICIP40778.2020.9191050, 10.1109/icip40778.2020.9191050]

[5]

Bossen F, 2019, JOINT VIDEO EXPERTS, V16, P19

[6]

Bross B, 2018, JVETJ1001

[7]

De-Luxán-Hernández S, 2019, IEEE IMAGE PROC, P1203, DOI [10.1109/icip.2019.8803777, 10.1109/ICIP.2019.8803777]

[8] High Dynamic Range and Wide Color Gamut Video Coding in HEVC: Status and Potential Future Enhancements [J].

Francois, Edouard ;

Fogg, Chad ;

He, Yuwen ;

Li, Xiang ;

Luthra, Ajay ;

Segall, Andrew .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) :63-75

[9]

Galiano V., 2016, J SUPERCOMPUT, V73, P1

[10] On the use of deep learning and parallelism techniques to significantly reduce the HEVC intra-coding time [J].

Galiano, Vicente ;

Migallon, Hector ;

Martinez-Rach, Miguel ;

Lopez-Granado, Otoniel ;

Malumbres, Manuel P. .

JOURNAL OF SUPERCOMPUTING, 2023, 79 (11) :11641-11659

← 1 2 3 4 5 →