High Efficiency Video Coding Compliant Perceptual Video Coding Using Entropy Based Visual Saliency Model

被引:3
作者
Zeeshan, Muhammad [1 ]
Majid, Muhammad [1 ]
机构
[1] Univ Engn & Technol, Dept Comp Engn, Taxila 47050, Pakistan
关键词
entropy; information maximization; high efficiency video coding; perceptual video coding; visual saliency; HARDWARE ARCHITECTURE; ALGORITHM; INTEGRATION; ESTIMATOR; SCHEME;
D O I
10.3390/e21100964
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In past years, several visual saliency algorithms have been proposed to extract salient regions from multimedia content in view of practical applications. Entropy is one of the important measures to extract salient regions, as these regions have high randomness and attract more visual attention. In the context of perceptual video coding (PVC), computational visual saliency models that utilize the charactertistics of the human visual system to improve the compression ratio are of paramount importance. To date, only a few PVC schemes have been reported that use the visual saliency model. In this paper, we conduct the first attempt to utilize entropy based visual saliency models within the high efficiency video coding (HEVC) framework. The visual saliency map generated for each input video frame is optimally thresholded to generate a binary saliency mask. The proposed HEVC compliant PVC scheme adjusts the quantization parameter according to visual saliency relevance at the coding tree unit (CTU) level. Efficient CTU level rate control is achieved by allocating bits to salient and non-salient CTUs by adjusting the quantization parameter values according to their perceptual weighted map. The attention based on information maximization has shown the best performance on newly created ground truth dataset, which is then incorporated in a HEVC framework. An average bitrate reduction of 6.57% is achieved by the proposed HEVC compliant PVC scheme with the same perceptual quality and a nominal increase in coding complexity of 3.34% when compared with HEVC reference software. Moreover, the proposed PVC scheme performs better than other HEVC based PVC schemes when encoded at low data rates.
引用
收藏
页数:21
相关论文
共 51 条
  • [1] [Anonymous], METHODOLOGY SUBJECTI
  • [2] [Anonymous], J INF HIDING MULTIME
  • [3] HEVC-Based Perceptually Adaptive Video Coding Using a DCT-Based Local Distortion Detection Probability Model
    Bae, Sung-Ho
    Kim, Jaeil
    Kim, Munchurl
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3343 - 3357
  • [4] Bayesian Integration of Face and Low-Level Cues for Foveated Video Coding
    Boccignone, Giuseppe
    Marcelli, Angelo
    Napoletano, Paolo
    Di Fiore, Gianluca
    Iacovoni, Giovanni
    Morsa, Salvatore
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (12) : 1727 - 1740
  • [5] Bossen F., 2013, JCTVC L1100 COMMON H
  • [6] Saliency, attention, and visual search: An information theoretic approach
    Bruce, Neil D. B.
    Tsotsos, John K.
    [J]. JOURNAL OF VISION, 2009, 9 (03):
  • [7] Macroblock-level adaptive frequency weighting for perceptual video coding
    Chen, Jianwen
    Zheng, Jianhua
    He, Yun
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (02) : 775 - 781
  • [8] High-accuracy multi-camera reconstruction enhanced by adaptive point cloud correction algorithm
    Chen, Mingyou
    Tang, Yunchao
    Zou, Xiangjun
    Huang, Kuangyu
    Li, Lijuan
    He, Yuxin
    [J]. OPTICS AND LASERS IN ENGINEERING, 2019, 122 : 170 - 183
  • [9] Perceptually-Friendly H.264/AVC Video Coding Based on Foveated Just-Noticeable-Distortion Model
    Chen, Zhenzhong
    Guillemot, Christine
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (06) : 806 - 819
  • [10] Global Contrast Based Salient Region Detection
    Cheng, Ming-Ming
    Mitra, Niloy J.
    Huang, Xiaolei
    Torr, Philip H. S.
    Hu, Shi-Min
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) : 569 - 582