A semiautomatic saliency model and its application to video compression

被引：0

作者：

Lyudvichenko, Vitaliy

Erofeev, Mikhail

Gitman, Yury

Vatolin, Dmitriy

机构：

来源：

2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP) | 2017年

关键词：

Eye-Tracking; Saliency; Video Compression; Visual Attention; x264; IMAGE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work aims to apply visual-attention modeling to attention-based video compression. During our comparison we found that eye-tracking data collected even from a single observer outperforms existing automatic models by a significant margin. Therefore, we offer a semiautomatic approach: using computer-vision algorithms and good initial estimation of eye-tracking data from just one observer to produce high-quality saliency maps that are similar to multi-observer eye tracking and that are appropriate for practical applications. We propose a simple algorithm that is based on temporal coherence of the visual-attention distribution and requires eye tracking of just one observer. The results are as good as an average gaze map for two observers. While preparing the saliency-model comparison, we paid special attention to the quality-measurement procedure. We observe that many modern visual-attention models can be improved by applying simple transforms such as brightness adjustment and blending with the center-prior model. The novel quality-evaluation procedure that we propose is invariant to such transforms. To show the practical use of our semiautomatic approach, we developed a saliency-aware modification of the x264 video encoder and performed subjective and objective evaluations. The modified encoder can serve with any attention model and is publicly available.

引用

页码：403 / 410

页数：8

共 50 条

[1] SEMIAUTOMATIC VISUAL-ATTENTION MODELING AND ITS APPLICATION TO VIDEO COMPRESSION
Gitman, Yury
Erofeev, Mikhail
Vatolin, Dmitriy
Andrey, Bolshakov
Alexey, Fedorov
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1105 - 1109
[2] Spatiotemporal cue fusion-based saliency extraction and its application in video compression
Li K.
Luo Z.
Zhang T.
Ruan Y.
Zhou D.
Cognitive Robotics, 2022, 2 : 177 - 185
[3] A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression
Guo, Chenlei
Zhang, Liming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (01) : 185 - 198
[4] Visual Saliency Guided Foveated Video Compression
Zhang, Shupei
Basu, Anup
IEEE ACCESS, 2023, 11 : 62535 - 62548
[5] An Approach to Video Compression Using Saliency Based Foveation
Polakovic, Adam
Vargic, Radoslav
Rozinaj, Gregor
Muntean, Gabriel-Miro
PROCEEDINGS OF ELMAR-2018: 60TH INTERNATIONAL SYMPOSIUM ELMAR-2018, 2018, : 169 - 172
[6] SALIENCY-PRESERVING VIDEO COMPRESSION
Hadizadeh, Hadi
Bajic, Ivan V.
2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
[7] Saliency-Aware Video Compression
Hadizadeh, Hadi
Bajic, Ivan V.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (01) : 19 - 33
[8] Video compression by computer and its application
Hu Yu
TRENDS IN BUILDING MATERIALS RESEARCH, PTS 1 AND 2, 2012, 450-451 : 1293 - 1296
[9] Study of Saliency in Objective Video Quality Assessment
Zhang, Wei
Liu, Hantao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (03) : 1275 - 1288
[10] Visual saliency guided video compression algorithm
Gupta, Rupesh
Khanna, Meera Thapar
Chaudhury, Santanu
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (09) : 1006 - 1022

← 1 2 3 4 5 →