Shallow and Deep Convolutional Networks for Saliency Prediction

被引:321
作者
Pan, Junting [1 ]
Sayrol, Elisa [1 ]
Giro-I-Nieto, Xavier [1 ]
McGuinness, Kevin [2 ]
O'Connor, Noel E. [2 ]
机构
[1] Univ Politecn Cataluna, Image Proc Grp, Barcelona, Catalonia, Spain
[2] Dublin City Univ, Insight Ctr Data Analyt, Dublin, Ireland
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
爱尔兰科学基金会;
关键词
D O I
10.1109/CVPR.2016.71
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prediction of salient areas in images has been traditionally addressed with hand-crafted features based on neuroscience principles. This paper, however, addresses the problem with a completely data-driven approach by training a convolutional neural network (convnet). The learning process is formulated as a minimization of a loss function that measures the Euclidean distance of the predicted saliency map with the provided ground truth. The recent publication of large datasets of saliency prediction has provided enough data to train end-to-end architectures that are both fast and accurate. Two designs are proposed: a shallow convnet trained from scratch, and a another deeper solution whose first three layers are adapted from another network trained for classification. To the authors' knowledge, these are the first end-to-end CNNs trained and tested for the purpose of saliency prediction.
引用
收藏
页码:598 / 606
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 2010, P PYTH SCI C
[2]  
[Anonymous], 2014, DEEP INSIDE CONVOLUT
[3]  
[Anonymous], 2014, P BRIT MACH VIS C 20
[4]  
[Anonymous], 2015, IEEE C COMP VIS PATT
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], Mit saliency benchmark
[7]  
[Anonymous], 2015, PROC CVPR IEEE
[8]  
[Anonymous], 2015, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
[9]  
[Anonymous], ADV NEURAL INF PROCE
[10]  
[Anonymous], 2015, J VISUAL-JAPAN, DOI DOI 10.1167/15.12.376