Saliency-Based Fidelity Adaptation Preprocessing for Video Coding

被引:15
作者
Lu, Shao-Ping [1 ]
Zhang, Song-Hai
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
visual saliency; bilateral filter; fidelity adjustment; region-of-interest; encoder; BIT ALLOCATION; MODEL; ENHANCEMENT; COMPRESSION;
D O I
10.1007/s11390-011-9426-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a video coding scheme which applies the technique of visual saliency computation to adjust image fidelity before compression. To extract visually salient features, we construct a spatio-temporal saliency map by analyzing the video using a combined bottom-up and top-down visual saliency model. We then use an extended bilateral filter, in which the local intensity and spatial scales are adjusted according to visual saliency, to adaptively alter the image fidelity. Our implementation is based on the H.264 video encoder JM12.0. Besides evaluating our scheme with the H.264 reference software, we also compare it to a more traditional foreground-background segmentation-based method and a foveation-based approach which employs Gaussian blurring. Our results show that the proposed algorithm can improve the compression ratio significantly while effectively preserving perceptual visual quality.
引用
收藏
页码:195 / 202
页数:8
相关论文
共 29 条
  • [1] [Anonymous], H 264 AVC REFERENCE
  • [2] [Anonymous], 1981, PRINCIPLES PSYCHOL
  • [3] Video enhancement using per-pixel virtual exposures
    Bennett, EP
    McMillan, L
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 845 - 852
  • [4] Semantic video analysis for adaptive content delivery and automatic description
    Cavallaro, A
    Steiger, O
    Ebrahimi, T
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (10) : 1200 - 1209
  • [5] Cerf Moran., 2007, Advances in Neural Information and Processing Systems, P241
  • [6] Chai D, 1997, ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV, P1448, DOI 10.1109/ISCAS.1997.622190
  • [7] ROI video coding based on H.263+with robust skin-color detection technique
    Chen, MJ
    Chi, MC
    Hsu, CT
    Chen, JW
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2003, 49 (03) : 724 - 730
  • [8] Comparison of human face matching behavior and computational image similarity measure
    Chen WenFeng
    Liu ChangHong
    Lander Karen
    Fu XiaoLan
    [J]. SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (02): : 316 - 321
  • [9] Flash photography enhancement via intrinsic relighting
    Eisemann, E
    Durand, F
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03): : 673 - 678
  • [10] Performance characterization of video-shot-change detection methods
    Gargi, U
    Kasturi, R
    Strayer, SH
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (01) : 1 - 13