Optimizing the Hierarchical Prediction and Coding in HEVC for Surveillance and Conference Videos With Background Modeling

被引:58
作者
Zhang, Xianguo [1 ]
Tian, Yonghong [1 ]
Huang, Tiejun [1 ]
Dong, Siwei [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
HEVC; hierarchical prediction; surveillance videos; conference videos; background modeling; CU classification;
D O I
10.1109/TIP.2014.2352036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the real-time and low-delay video surveillance and teleconferencing applications, the newly video coding standard HEVC can achieve much higher coding efficiency over H. 264/AVC. However, we still argue that the hierarchical prediction structure in the HEVC low-delay encoder still does not fully utilize the special characteristics of surveillance and conference videos that are usually captured by stationary cameras. In this case, the background picture (G-picture), which is modeled from the original input frames, can be used to further improve the HEVC low-delay coding efficiency meanwhile reducing the complexity. Therefore, we propose an optimization method for the hierarchical prediction and coding in HEVC for these videos with background modeling. First, several experimental and theoretical analyses are conducted on how to utilize the G-picture to optimize the hierarchical prediction structure and hierarchical quantization. Following these results, we propose to encode the G-picture as the long-term reference frame to improve the background prediction, and then present a G-picture-based bit-allocation algorithm to increase the coding efficiency. Meanwhile, according to the proportions of background and foreground pixels in coding units (CUs), an adaptive speed-up algorithm is developed to classify each CU into different categories and then adopt different speed-up strategies to reduce the encoding complexity. To evaluate the performance, extensive experiments are performed on the HEVC test model. Results show our method can averagely save 39.09% bits and reduce the encoding complexity by 43.63% on surveillance videos, whereas those are 5.27% and 43.68% on conference videos.
引用
收藏
页码:4511 / 4526
页数:16
相关论文
共 29 条
  • [1] [Anonymous], P VIS COMM IM PROC J
  • [2] [Anonymous], JCTVF701RL
  • [3] [Anonymous], JCTVCH0178
  • [4] [Anonymous], IEEE INT SYST UNPUB
  • [5] [Anonymous], SAMSUNGS RESPONSE CA
  • [6] [Anonymous], AVSN1955
  • [7] [Anonymous], JCTVN1002
  • [8] [Anonymous], IMAGE COMMUN
  • [9] [Anonymous], P IEEE INT C MULT EX
  • [10] [Anonymous], JCTV10408RL