Predicting split decisions in MPEG-2 to HEVC video transcoding

被引：1

作者：

Shanableh, Tamer ^{[1
]}

Hassan, Mahitab ^{[2
]}

机构：

[1] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates

[2] IBM Cloud, Dubai, U Arab Emirates

来源：

SN APPLIED SCIENCES | 2020年 / 2卷 / 06期

关键词：

Video coding; Video transcoding; HEVC; Machine learning; H.264/AVC; INTER;

D O I：

10.1007/s42452-020-2909-7

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This paper proposes learning-based approaches for transcoding videos compressed using the Moving Picture Experts Group 2 format into the High Efficiency Video Coding (HEVC) format. In the training mode of the transcoder, mappings between extracted features and split decisions are calculated. While in the transcoding mode, the split decisions of Coding Units of the HEVC video are predicted. Two formulations are proposed for the prediction of split decisions based on multi model and multi-tier solutions. In the former solution, multi models are generated based on the total number of split flags in a coding unit. While in the latter solution, split decisions are modelled at three different coding depths. The proposed solutions are evaluated in terms of excessive bitrate, drop in PSNR, classification accuracy, model generation time and transcoding speedup. It is shown that the multi-tier solution maintains the rate-distortion behaviour of full re-encoding at the expense of lower gain in transcoding speedup. In comparison to existing work, it is shown that the proposed solutions offer a significant enhancement in terms of rate-distortion performance and classification accuracy.

引用

页数：14