A semiautomatic saliency model and its application to video compression

被引：0

作者：

Lyudvichenko, Vitaliy

Erofeev, Mikhail

Gitman, Yury

Vatolin, Dmitriy

机构：

来源：

2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP) | 2017年

关键词：

Eye-Tracking; Saliency; Video Compression; Visual Attention; x264; IMAGE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work aims to apply visual-attention modeling to attention-based video compression. During our comparison we found that eye-tracking data collected even from a single observer outperforms existing automatic models by a significant margin. Therefore, we offer a semiautomatic approach: using computer-vision algorithms and good initial estimation of eye-tracking data from just one observer to produce high-quality saliency maps that are similar to multi-observer eye tracking and that are appropriate for practical applications. We propose a simple algorithm that is based on temporal coherence of the visual-attention distribution and requires eye tracking of just one observer. The results are as good as an average gaze map for two observers. While preparing the saliency-model comparison, we paid special attention to the quality-measurement procedure. We observe that many modern visual-attention models can be improved by applying simple transforms such as brightness adjustment and blending with the center-prior model. The novel quality-evaluation procedure that we propose is invariant to such transforms. To show the practical use of our semiautomatic approach, we developed a saliency-aware modification of the x264 video encoder and performed subjective and objective evaluations. The modified encoder can serve with any attention model and is publicly available.

引用

页码：403 / 410

页数：8

共 50 条

[41] Revisiting Video Saliency: A Large-scale Benchmark and a New Model
Wang, Wenguan
Shen, Jianbing
Guo, Fang
Cheng, Ming-Ming
Borji, Ali
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4894 - 4903
[42] An Effective Video Saliency Detection Model Based on Human Visual Acuity and Spatiotemporal Cues in Cloud Systems
Fang, Zhijun
Zhang, Juan
Wan, Wanggen
Fang, Yuming
JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (05): : 835 - 840
[43] Visual attention guided bit allocation in video compression
Li, Zhicheng
Qin, Shiyin
Itti, Laurent
IMAGE AND VISION COMPUTING, 2011, 29 (01) : 1 - 14
[44] Saliency texture structure descriptor and its application in pedestrian detection
Xiao, D.-G. (dgxiao@hnu.edu.cn), 1600, Chinese Academy of Sciences (25): : 675 - 689
[45] Video saliency prediction using enhanced spatiotemporal alignment network
Chen, Jin
Song, Huihui
Zhang, Kaihua
Liu, Bo
Liu, Qingshan
PATTERN RECOGNITION, 2021, 109
[46] No Reference Quality Assessment of Stereo Video Based on Saliency and Sparsity
Yang, Jiachen
Ji, Chunqi
Jiang, Bin
Lu, Wen
Meng, Qinggang
IEEE TRANSACTIONS ON BROADCASTING, 2018, 64 (02) : 341 - 353
[47] Visual saliency in MPEG-4 AVC video stream
Ammar, M.
Mitrea, M.
Hasnaoui, M.
Le Callet, P.
HUMAN VISION AND ELECTRONIC IMAGING XX, 2015, 9394
[48] Video Saliency Detection Using Deep Convolutional Neural Networks
Zhou, Xiaofei
Liu, Zhi
Gong, Chen
Li, Gongyang
Huang, Mengke
PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 308 - 319
[49] Video Saliency Map Detection by Dominant Camera Motion Removal
Huang, Chun-Rong
Chang, Yun-Jung
Yang, Zhi-Xiang
Lin, Yen-Yu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1336 - 1349
[50] Integrating object proposal with attention networks for video saliency detection
Jian, Muwei
Wang, Jiaojin
Yu, Hui
Wang, Gai-Ge
INFORMATION SCIENCES, 2021, 576 : 819 - 830

← 1 2 3 4 5 →