DeepAttent: Saliency Prediction with Deep Multi-scale Residual Network

被引:0
作者
Dwivedi, Kshitij [1 ]
Singh, Nitin [2 ]
Shanmugham, Sabari R. [3 ]
Kumar, Manoj [2 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
[2] Samsung Res Inst, Bengaluru, India
[3] DataRobot, Singapore, Singapore
来源
PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2 | 2020年 / 1024卷
关键词
Saliency; Neural networks; Multi-scale; MODEL;
D O I
10.1007/978-981-32-9291-8_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting where humans look in a given scene is a well-known problem with multiple applications in consumer cameras, human-computer interaction, robotics, and gaming. With large-scale image datasets available for human fixation, it is now possible to train deep neural networks for generating a fixationmap. Human fixations are a function of both local visual features and global context. We incorporate this in a deep neural network by using global and local features of an image to predict human fixations. We sample multi-scale features of the deep residual network and introduce a new method for incorporating these multi-scale features for the end-to-end training of our network. Our model DeepAttent obtains competitive results on SALICON and iSUN datasets and outperforms state-of-the-art methods on various metrics.
引用
收藏
页码:65 / 73
页数:9
相关论文
共 27 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]  
[Anonymous], 2016, LSUN 16 LARGE SCALE
[3]  
[Anonymous], 2014, CORR
[4]   Analysis of scores, datasets, and models in visual saliency prediction [J].
Borji, Ali ;
Tavakoli, Hamed R. ;
Sihite, Dicky N. ;
Itti, Laurent .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :921-928
[5]   Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data? [J].
Buhrmester, Michael ;
Kwang, Tracy ;
Gosling, Samuel D. .
PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2011, 6 (01) :3-5
[6]   Using semantic content as cues for better scanpath prediction [J].
Cerf, Moran ;
Frady, E. Paxon ;
Koch, Christof .
PROCEEDINGS OF THE EYE TRACKING RESEARCH AND APPLICATIONS SYMPOSIUM (ETRA 2008), 2008, :143-146
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Guo CL, 2008, PROC CVPR IEEE, P2908
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [J].
Huang, Xun ;
Shen, Chengyao ;
Boix, Xavier ;
Zhao, Qi .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :262-270