What Do Different Evaluation Metrics Tell Us About Saliency Models?

被引：463

作者：

Bylinskii, Zoya ^{[1
]}

Judd, Tilke ^{[2
]}

Oliva, Aude ^{[1
]}

Torralba, Antonio ^{[1
]}

Durand, Fredo ^{[1
]}

机构：

[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] Google, Zurich, Switzerland

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2019年 / 41卷 / 03期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Saliency models; evaluation metrics; benchmarks; fixation maps; saliency applications; VISUAL-ATTENTION; IMAGE RETRIEVAL; EYE-MOVEMENTS; LOCALIZATION; ALLOCATION; FOVEATION; SELECTION; SCENE; GAZE;

D O I：

10.1109/TPAMI.2018.2815601

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How best to evaluate a saliency model's ability to predict where humans look in images is an open research question. The choice of evaluation metric depends on how saliency is defined and how the ground truth is represented. Metrics differ in how they rank saliency models, and this results from how false positives and false negatives are treated, whether viewing biases are accounted for, whether spatial deviations are factored in, and how the saliency maps are pre-processed. In this paper, we provide an analysis of 8 different evaluation metrics and their properties. With the help of systematic experiments and visualizations of metric computations, we add interpretability to saliency scores and more transparency to the evaluation of saliency models. Building off the differences in metric properties and behaviors, we make recommendations for metric selections under specific assumptions and for specific applications.

引用

页码：740 / 757

页数：18

共 88 条

[81] Saliency-driven scaling optimization for image retargeting [J].

Wang, Dong ;

Li, Guiqing ;

Jia, Weijia ;

Luo, Xiaonan .

VISUAL COMPUTER, 2011, 27 (09) :853-860

[82] Foveation scalable video coding with automatic fixation selection [J].

Wang, Z ;

Lu, LG ;

Bovik, AC .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2003, 12 (02) :243-254

[83] Measures and Limits of Models of Fixation Selection [J].

Wilming, Niklas ;

Betz, Torsten ;

Kietzmann, Tim C. ;

Koenig, Peter .

PLOS ONE, 2011, 6 (09)

[84] Studying Relationships Between Human Gaze, Description, and Computer Vision [J].

Yun, Kiwon ;

Peng, Yifan ;

Samaras, Dimitris ;

Zelinsky, Gregory J. ;

Berg, Tamara L. .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :739-746

[85] SUN: A Bayesian framework for saliency using natural statistics [J].

Zhang, Lingyun ;

Tong, Matthew H. ;

Marks, Tim K. ;

Shan, Honghao ;

Cottrell, Garrison W. .

JOURNAL OF VISION, 2008, 8 (07)

[86] Learning a saliency map using fixated locations in natural scenes [J].

Zhao, Qi ;

Koch, Christof .

JOURNAL OF VISION, 2011, 11 (03)

[87]

Zhou BL, 2014, ADV NEUR IN, V27

[88] Image registration methods:: a survey [J].

Zitová, B ;

Flusser, J .

IMAGE AND VISION COMPUTING, 2003, 21 (11) :977-1000

← 1 2 3 4 5 6 7 8 9 →