Machine Learning-Based Fast Intra Mode Decision for HEVC Screen Content Coding via Decision Trees

被引:39
作者
Kuang, Wei [1 ]
Chan, Yui-Lam [1 ]
Tsang, Sik-Ho [1 ]
Siu, Wan-Chi [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
关键词
Encoding; Copper; Decision trees; Partitioning algorithms; High efficiency video coding; Videos; Computational complexity; Screen content coding (SCC); high efficiency video coding (HEVC); fast algorithm; machine learning; decision tree;
D O I
10.1109/TCSVT.2019.2903547
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The screen content coding (SCC) extension of high efficiency video coding (HEVC) improves coding gain for screen content videos by introducing two new coding modes, namely, intra block copy (IBC) and palette (PLT) modes. However, the coding gain is achieved at the increased cost of computational complexity. In this paper, we propose a decision tree-based framework for fast intra mode decision by investigating various features in the training sets. To avoid the exhaustive mode searching process, a sequential arrangement of decision trees is proposed to check each mode separately by inserting a classifier before checking a mode. As compared with the previous approaches where both IBC and PLT modes are checked for screen content blocks (SCBs), the proposed coding framework is more flexible which facilitates either the IBC or PLT mode to be checked for SCBs such that computational complexity is further reduced. To enhance the accuracy of decision trees, dynamic features are introduced, which reveal the unique intermediate coding information of a coding unit (CU). Then, if all the modes are decided to be skipped for a CU at the last depth level, at least one possible mode is assigned by a CU-type decision tree. Furthermore, a decision tree constraint technique is developed to reduce the rate-distortion performance loss. Compared with the HEVC-SCC reference software SCM-8.3, the proposed algorithm reduces computational complexity by 47.62% on average with a negligible Bjontegaard delta bitrate (BDBR) increase of 1.42% under all-intra (AI) configurations, which outperforms all the state-of-the-art algorithms in the literature.
引用
收藏
页码:1481 / 1496
页数:16
相关论文
共 32 条
[1]  
[Anonymous], [No title captured]
[2]  
[Anonymous], 2016, JCTVCX1015, P1
[3]  
Bjontegaard G., 2001, DOCUMENT VCEG M33 IT, P1
[4]  
BURMAN P, 1989, BIOMETRIKA, V76, P503, DOI 10.2307/2336116
[5]  
Cohen R., 2013, 14 JCT VC M VIENN AU
[6]   Fast Mode and Partition Decision Using Machine Learning for Intra-Frame Coding in HEVC Screen Content Coding Extension [J].
Duanmu, Fanyi ;
Ma, Zhan ;
Wang, Yao .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2016, 6 (04) :517-531
[7]  
Duanmu F, 2015, IEEE IMAGE PROC, P4972, DOI 10.1109/ICIP.2015.7351753
[8]  
Guo J., 2016, 3 JVET M GEN SWITZ M
[9]  
Guyon I., 2003, Journal of Machine Learning Research, V3, P1157, DOI 10.1162/153244303322753616
[10]  
Hall M., 2009, ACM SIGKDD Explor. Newslett., V11, P10, DOI [DOI 10.1145/1656274.1656278, 10.1145/1656274.1656278]