Fast CU Depth Decision for HEVC Using Neural Networks

被引:55
作者
Kim, Kyungah [1 ]
Ro, Won Woo [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul 03722, South Korea
关键词
High Efficiency Video Coding (HEVC); fast coding unit (CU) depth decision; convolutional neural network (CNN); EFFICIENCY; COMPLEXITY; ALGORITHM;
D O I
10.1109/TCSVT.2018.2839113
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a coding unit (CU) depth-decision algorithm using neural networks to reduce the computational overhead of High Efficiency Video Coding (HEVC). The coding tree unit (CTU) of HEVC has a quad-tree structure, and its computational complexity is considerably high because it searches an optimal CU depth from the upper to lower depth recursively and exhaustively. In the proposed method, neural networks are used to predict the CTU depth. A database for neural networks is constructed, which considers both the image and encoding properties of the CU. It consists of the image data representing the image value of the CU, the vector data based on the encoding information of the CU, and the labels indicating whether the CU is divided. By using both properties of the CU, high test accuracy can be achieved. It is completely separated from the test sequence used for encoding and can be configured to use a sequence with various resolutions, motions, and contents for diverse CUs. We also design a neural-network architecture and perform training. The architecture consists of the convolution and pooling layers for analyzing the image property of the CU. The feature map is concatenated with the vector data and trained by fully connected layers in order to analyze the encoding property of the CU. Finally, a fast CU depth-decision algorithm is designed based on the trained neural networks. When the result of the neural network inference with the current CU depth is non-split, the operation on the lower CU depth is skipped. The experimental results show that the proposed method can reduce the computational overhead by 61.77% on average, and by a maximum of 73.45% with 3.91% Bjontegaard-Differencebitrate (BD rate) degradation.
引用
收藏
页码:1462 / 1473
页数:12
相关论文
共 28 条
  • [1] Ahmadi A, 2016, IEEE IMAGE PROC, P1629, DOI 10.1109/ICIP.2016.7532634
  • [2] [Anonymous], 2014, P 2014 IEEE INT C MU, DOI DOI 10.1109/ICMEW.2014.6890647
  • [3] [Anonymous], [No title captured]
  • [4] [Anonymous], 2001, ITU T VCEG M AUST TE
  • [5] Baroncini V., 2012, JCTVCH1004 ITUT
  • [6] HEVC Complexity and Implementation Analysis
    Bossen, Frank
    Bross, Benjamin
    Suehring, Karsten
    Flynn, David
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (12) : 1685 - 1696
  • [7] Fast CU Splitting and Pruning for Suboptimal CU Partitioning in HEVC Intra Coding
    Cho, Seunghyun
    Kim, Munchurl
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) : 1555 - 1564
  • [8] Choi K., 2011, JCTVCF092 ITUT
  • [9] Chu H., 2016, LIGHTNING MEMORY MAP
  • [10] Fast HEVC Encoding Decisions Using Data Mining
    Correa, Guilherme
    Assuncao, Pedro A.
    Agostini, Luciano Volcan
    da Silva Cruz, Luis A.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (04) : 660 - 673