A semiautomatic saliency model and its application to video compression

被引:0
作者
Lyudvichenko, Vitaliy
Erofeev, Mikhail
Gitman, Yury
Vatolin, Dmitriy
机构
来源
2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP) | 2017年
关键词
Eye-Tracking; Saliency; Video Compression; Visual Attention; x264; IMAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work aims to apply visual-attention modeling to attention-based video compression. During our comparison we found that eye-tracking data collected even from a single observer outperforms existing automatic models by a significant margin. Therefore, we offer a semiautomatic approach: using computer-vision algorithms and good initial estimation of eye-tracking data from just one observer to produce high-quality saliency maps that are similar to multi-observer eye tracking and that are appropriate for practical applications. We propose a simple algorithm that is based on temporal coherence of the visual-attention distribution and requires eye tracking of just one observer. The results are as good as an average gaze map for two observers. While preparing the saliency-model comparison, we paid special attention to the quality-measurement procedure. We observe that many modern visual-attention models can be improved by applying simple transforms such as brightness adjustment and blending with the center-prior model. The novel quality-evaluation procedure that we propose is invariant to such transforms. To show the practical use of our semiautomatic approach, we developed a saliency-aware modification of the x264 video encoder and performed subjective and objective evaluations. The modified encoder can serve with any attention model and is publicly available.
引用
收藏
页码:403 / 410
页数:8
相关论文
共 50 条
  • [1] SEMIAUTOMATIC VISUAL-ATTENTION MODELING AND ITS APPLICATION TO VIDEO COMPRESSION
    Gitman, Yury
    Erofeev, Mikhail
    Vatolin, Dmitriy
    Andrey, Bolshakov
    Alexey, Fedorov
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1105 - 1109
  • [2] Spatiotemporal cue fusion-based saliency extraction and its application in video compression
    Li K.
    Luo Z.
    Zhang T.
    Ruan Y.
    Zhou D.
    Cognitive Robotics, 2022, 2 : 177 - 185
  • [3] A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression
    Guo, Chenlei
    Zhang, Liming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (01) : 185 - 198
  • [4] Visual Saliency Guided Foveated Video Compression
    Zhang, Shupei
    Basu, Anup
    IEEE ACCESS, 2023, 11 : 62535 - 62548
  • [5] An Approach to Video Compression Using Saliency Based Foveation
    Polakovic, Adam
    Vargic, Radoslav
    Rozinaj, Gregor
    Muntean, Gabriel-Miro
    PROCEEDINGS OF ELMAR-2018: 60TH INTERNATIONAL SYMPOSIUM ELMAR-2018, 2018, : 169 - 172
  • [6] SALIENCY-PRESERVING VIDEO COMPRESSION
    Hadizadeh, Hadi
    Bajic, Ivan V.
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [7] Saliency-Aware Video Compression
    Hadizadeh, Hadi
    Bajic, Ivan V.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (01) : 19 - 33
  • [8] Video compression by computer and its application
    Hu Yu
    TRENDS IN BUILDING MATERIALS RESEARCH, PTS 1 AND 2, 2012, 450-451 : 1293 - 1296
  • [9] Study of Saliency in Objective Video Quality Assessment
    Zhang, Wei
    Liu, Hantao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (03) : 1275 - 1288
  • [10] Visual saliency guided video compression algorithm
    Gupta, Rupesh
    Khanna, Meera Thapar
    Chaudhury, Santanu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (09) : 1006 - 1022