Machine learning-based fast CU size decision algorithm for 3D-HEVC inter-coding

被引：0

作者：

Siham Bakkouri

Abderrahmane Elyousfi

机构：

[1] Ibn-Zohr University Agadir-Morocco,Computer Systems and Vision Laboratory, Faculty of Sciences

[2] Ibn-Zohr University Agadir-Morocco,Department of Computer Science, National Engineering School of Applied Sciences

来源：

Journal of Real-Time Image Processing | 2021年 / 18卷

关键词：

3D-HEVC; Machine learning; Inter-coding; CU size; Binary classification; AdaBoost;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

3D-high efficiency video coding (3D-HEVC) is an extension of the high efficiency video coding (HEVC) standard for the compression of the texture videos and depth maps. In 3D-HEVC inter-coding, the coding unit (CU) is recursively performed on variable sizes, namely, depth levels. The CU size decision process is conducted using all the possible depth levels to obtain the one with the least rate-distortion (RD) cost using the Lagrange multiplier. These tools achieve the highest coding efficiency but incur a very high computational complexity. In this paper, a fast CU size decision algorithm is proposed to reduce the complexity caused by the CU size splitting process. The proposed algorithm is based on the CU homogeneity classification using machine learning technology. First, the tensor feature is extracted to characterize the homogeneity of CU, which has a strong relationship with CU sizes. Then, a boosted decision stump algorithm is employed to analyze and construct a binary classification model from the extracted features and find suitable thresholds for the proposed method. Finally, an efficient early termination of CU splitting is released based on adaptive thresholds for texture videos and depth maps. The experimental results show that the proposed algorithm reduces a significant encoding time, while the loss in coding efficiency is negligible.

引用

页码：983 / 995

页数：12

共 105 条

[1] Müller K(2011)3-D video representation using depth maps Proc. IEEE 99 643-656
[2] Merkle P(2011)Towards a new quality metric for 3-D synthesized view assessment IEEE J. Sel. Top. Signal Process. 5 1332-1343
[3] Wiegand T(2013)3D high-efficiency video coding for multi-view video and depth data IEEE Trans. Image Process. 22 3366-3378
[4] Bosc E(2016)Overview of the multiview and 3D extensions of high efficiency video coding IEEE Trans. Circuits Syst. Video Technol. 26 35-49
[5] Pepion R(2015)Enhanced inter-mode decision algorithm for HEVC/H.265 video coding J. Real-Time Image Process. 16 377-390
[6] Le Callet P(2019)Fast CU partition-based machine learning approach for reducing HEVC complexity J. Real-Time Image Process. 17 185-196
[7] Koppel M(2018)Fast mode decision based on grayscale similarity and inter-view correlation for depth map coding in 3D-HEVC IEEE Trans. Circuits Syst. Video Technol. 28 706-718
[8] Ndjiki-Nya P(2015)Square-type-first inter-CU tree search algorithm for acceleration of HEVC encoder J. Real-Time Image Process. 12 419-432
[9] Pressigout M(2019)Fast CU size and mode decision algorithm for 3D-HEVC intercoding Multimed. Tools Appl. 79 6987-7004
[10] Morin L(2019)Hybrid stopping model-based fast PU and CU decision for 3D-HEVC texture coding J. Real-Time Image Process. 78 10181-10205

← 1 2 3 4 5 6 7 8 9 10 →