A semiautomatic saliency model and its application to video compression

被引:0
作者
Lyudvichenko, Vitaliy
Erofeev, Mikhail
Gitman, Yury
Vatolin, Dmitriy
机构
来源
2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP) | 2017年
关键词
Eye-Tracking; Saliency; Video Compression; Visual Attention; x264; IMAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work aims to apply visual-attention modeling to attention-based video compression. During our comparison we found that eye-tracking data collected even from a single observer outperforms existing automatic models by a significant margin. Therefore, we offer a semiautomatic approach: using computer-vision algorithms and good initial estimation of eye-tracking data from just one observer to produce high-quality saliency maps that are similar to multi-observer eye tracking and that are appropriate for practical applications. We propose a simple algorithm that is based on temporal coherence of the visual-attention distribution and requires eye tracking of just one observer. The results are as good as an average gaze map for two observers. While preparing the saliency-model comparison, we paid special attention to the quality-measurement procedure. We observe that many modern visual-attention models can be improved by applying simple transforms such as brightness adjustment and blending with the center-prior model. The novel quality-evaluation procedure that we propose is invariant to such transforms. To show the practical use of our semiautomatic approach, we developed a saliency-aware modification of the x264 video encoder and performed subjective and objective evaluations. The modified encoder can serve with any attention model and is publicly available.
引用
收藏
页码:403 / 410
页数:8
相关论文
共 50 条
  • [41] Revisiting Video Saliency: A Large-scale Benchmark and a New Model
    Wang, Wenguan
    Shen, Jianbing
    Guo, Fang
    Cheng, Ming-Ming
    Borji, Ali
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4894 - 4903
  • [42] An Effective Video Saliency Detection Model Based on Human Visual Acuity and Spatiotemporal Cues in Cloud Systems
    Fang, Zhijun
    Zhang, Juan
    Wan, Wanggen
    Fang, Yuming
    JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (05): : 835 - 840
  • [43] Visual attention guided bit allocation in video compression
    Li, Zhicheng
    Qin, Shiyin
    Itti, Laurent
    IMAGE AND VISION COMPUTING, 2011, 29 (01) : 1 - 14
  • [44] Saliency texture structure descriptor and its application in pedestrian detection
    Xiao, D.-G. (dgxiao@hnu.edu.cn), 1600, Chinese Academy of Sciences (25): : 675 - 689
  • [45] Video saliency prediction using enhanced spatiotemporal alignment network
    Chen, Jin
    Song, Huihui
    Zhang, Kaihua
    Liu, Bo
    Liu, Qingshan
    PATTERN RECOGNITION, 2021, 109
  • [46] No Reference Quality Assessment of Stereo Video Based on Saliency and Sparsity
    Yang, Jiachen
    Ji, Chunqi
    Jiang, Bin
    Lu, Wen
    Meng, Qinggang
    IEEE TRANSACTIONS ON BROADCASTING, 2018, 64 (02) : 341 - 353
  • [47] Visual saliency in MPEG-4 AVC video stream
    Ammar, M.
    Mitrea, M.
    Hasnaoui, M.
    Le Callet, P.
    HUMAN VISION AND ELECTRONIC IMAGING XX, 2015, 9394
  • [48] Video Saliency Detection Using Deep Convolutional Neural Networks
    Zhou, Xiaofei
    Liu, Zhi
    Gong, Chen
    Li, Gongyang
    Huang, Mengke
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 308 - 319
  • [49] Video Saliency Map Detection by Dominant Camera Motion Removal
    Huang, Chun-Rong
    Chang, Yun-Jung
    Yang, Zhi-Xiang
    Lin, Yen-Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1336 - 1349
  • [50] Integrating object proposal with attention networks for video saliency detection
    Jian, Muwei
    Wang, Jiaojin
    Yu, Hui
    Wang, Gai-Ge
    INFORMATION SCIENCES, 2021, 576 : 819 - 830