DeepFix: A Fully Convolutional Neural Network for Predicting Human Eye Fixations

被引:313
作者
Kruthiventi, Srinivas S. S. [1 ]
Ayush, Kumar [2 ]
Babu, R. Venkatesh [1 ]
机构
[1] Indian Inst Sci, Dept Computat & Data Sci, Video Analyt Lab, Bengaluru 560012, India
[2] Indian Inst Technol Kharagpur, Kharagpur 721302, W Bengal, India
关键词
Saliency prediction; eye fixations; convolutional neural network; deep learning; VISUAL-ATTENTION; SALIENCY DETECTION; BOTTOM-UP; OBJECT; MODEL;
D O I
10.1109/TIP.2017.2710620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding and predicting the human visual attention mechanism is an active area of research in the fields of neuroscience and computer vision. In this paper, we propose DeepFix, a fully convolutional neural network, which models the bottom-up mechanism of visual attention via saliency prediction. Unlike classical works, which characterize the saliency map using various hand-crafted features, our model automatically learns features in a hierarchical fashion and predicts the saliency map in an end-to-end manner. DeepFix is designed to capture semantics at multiple scales while taking global context into account, by using network layers with very large receptive fields. Generally, fully convolutional nets are spatially invariant-this prevents them from modeling location-dependent patterns (e.g., centre-bias). Our network handles this by incorporating a novel location-biased convolutional layer. We evaluate our model on multiple challenging saliency data sets and show that it achieves the state-of-the-art results.
引用
收藏
页码:4446 / 4456
页数:11
相关论文
共 71 条
[1]  
[Anonymous], P IEEE C COMP VIS PA
[2]  
[Anonymous], 2014, Computer Science
[3]  
[Anonymous], 2005, P NEURIPS
[4]  
[Anonymous], Mit saliency benchmark
[5]  
[Anonymous], 2015, PROC CVPR IEEE
[6]  
[Anonymous], P IEEE INT C IM PROC
[7]  
[Anonymous], 2012, Rep. MIT-CSAIL-TR-2012-001
[8]  
[Anonymous], 2014, ACM INT C MULTIMEDIA
[9]  
[Anonymous], LARGE SCALE SCENE UN
[10]   Interactions between attention and working memory [J].
Awh, E ;
Vogel, EK ;
Oh, SH .
NEUROSCIENCE, 2006, 139 (01) :201-208