Universal String Prediction-Based Inter Coding Algorithm Optimization in AVS2 Mixed Content Coding

被引:0
|
作者
Zhao L.-P. [1 ,3 ]
Lin T. [2 ]
Guo J. [2 ]
Zhou K.-L. [2 ]
机构
[1] Department of Computer Science and Engineering, Shaoxing University, Shaoxing, 312000, Zhejiang
[2] Institute of VLSI, Tongji University, Shanghai
[3] College of Mathematics, Physics and Information Engineering, Jiaxing University, Jiaxing, 314000, Zhejiang
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2019年 / 42卷 / 10期
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Audio video coding standard; Fast algorithm; High efficiency video coding; Inter coding; Screen mixed content coding;
D O I
10.11897/SP.J.1016.2019.02190
中图分类号
学科分类号
摘要
With the rapid development of technologies in networking and thin-client devices, such as remote desktop, video conferencing with documents or slides sharing, television programs and many curriculum videos playing in the Internet that contain mixtures of camera-captured content(CC) and screen content(SC), efficient screen content coding(SCC) becomes a hot topic in multimedia applications and has attracted increasing researcher attention from both academia and industry. Two international video coding standards include efficient SCC capability. One is High Efficiency Video Coding (HEVC), the other is the second-generation of the Audio Video Coding Standard(AVS2). In recent years, AVS is developing an AVS2 Screen and Mixed Content Coding(SMCC) extension(AVS2-SMCC). In our previous work, a 4:4:4 format universal string prediction (USP) algorithm integrated with the tradition 4:2:0 format block prediction and transform coding framework is used to code 4:4:4 format mixed content screen. To fully exploit both local and non-local, both general and special, and both complex and simple matching patterns with a variety of sizes and/or shapes and/or positions in a wide range of commonly seen screen content, a USP approach and its key technologies with three modes: general string(GS) mode, constrained string 1(CS1) mode, and constrained string 2(CS2) mode are proposed. The three constrained string modes are implemented with one of the three types of strings: offset string, coordinate string, and unpredictable pixel, or its combination of them. When using USP algorithm to code a coding unit, one of the three constrained modes which gets the minimum rate distortion value is selected to code the CU. Experimental results show that, for text and graphics with motion of AVS2-SMCC test sequences, USP algorithm achieves significantly improved coding efficiency in the All Intra configuration when compared with the latest HEVC-SCC extension. In order to further improve the coding efficiency in the Low Delay configuration and reduce the complexity, in this framework, according to the inherent inter-frame characteristics of the mixed content and the different characteristics for different inter-sub-coding modes, an optimized inter coding algorithm which consists of a 4:4:4 format inter-coding algorithm, a content adaptive weighted chroma distortion algorithm for different inter-sub-coding modes and an early terminate algorithm for inter-sub-coding modes pre-coding and coding tree unit partition, and a newly inter constrained string 1 mode (the search area of CS1 mode extends to the first frame in the reference picture queue) are proposed. Compared with the reference software RMD1.0 of AVS2-SMCC, for the category of text and graphics with motion in the AVS2-SMCC test sequences, the proposed algorithm achieves the average BD-rate reduction up to 14.35%, 90.73% and 88.85% respectively, in lossy low delay configuration, with a nearly 50.86% decrease in encoding runtime and 15.24% increase in decoding runtime. Especially for the video category of text and graphics with motion, the proposed algorithm achieves significantly improved coding efficiency not only in the All Intra configuration but also in the Low Delay configuration when compared with the HEVC-SCC extension. © 2019, Science Press. All right reserved.
引用
收藏
页码:2190 / 2202
页数:12
相关论文
共 34 条
  • [1] Lin T., Zhou K.-L., Wang S.-H., Cloudlet-screen computing: A client-server architecture with top graphics performance, International Journal of Ad Hoc and Ubiquitous Computing, 13, 2, pp. 96-108, (2013)
  • [2] Xu J.-Z., Joshi R., Cohen R.A., Overview of the emerging HEVC screen content coding extension, IEEE Transactions on Circuits and Systems for Video Technology, 26, 1, pp. 50-62, (2016)
  • [3] Peng W., Walls F., Cohen R., Et al., Overview of screen content video coding technologies, standards, and beyond, IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 6, 4, pp. 339-408, (2016)
  • [4] Liu D., Chen G.-S., Song C.-M., Research advances in screen content coding methods, Journal of Computer Research and Development, 54, 9, pp. 2059-2076, (2017)
  • [5] Shen Y.-F., Li J.-T., Zhu Z.-M., Et al., High efficiency video coding, Chinese Journal of Computers, 36, 11, pp. 2340-2355, (2013)
  • [6] Gao W., Ma S.-W., Advanced Video Coding Systems, (2015)
  • [7] Ma S.-W., Luo F.-L., Huang T.-J., Kernel technologies and applications of AVS2 video coding standard, Telecommunications Science, 8, pp. 2-15, (2017)
  • [8] AVS N14175.AVS Document, (2015)
  • [9] Lin T., Zhang P.-J., Wang S.-H., Et al., Mixed chroma sampling-rate high efficiency video coding for full-chroma screen content, IEEE Transactions on Circuits and Systems for Video Technology, 23, 1, pp. 173-185, (2013)
  • [10] Guo L.-W., Pu W., Zou F., Et al., Color palette for screen content coding, Proceedings of the IEEE International Conference on Image Process, pp. 5556-5560, (2013)