DeepAttent: Saliency Prediction with Deep Multi-scale Residual Network

被引：0

作者：

Dwivedi, Kshitij ^{[1
]}

Singh, Nitin ^{[2
]}

Shanmugham, Sabari R. ^{[3
]}

Kumar, Manoj ^{[2
]}

机构：

[1] Singapore Univ Technol & Design, Singapore, Singapore

[2] Samsung Res Inst, Bengaluru, India

[3] DataRobot, Singapore, Singapore

来源：

PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2 | 2020年 / 1024卷

关键词：

Saliency; Neural networks; Multi-scale; MODEL;

D O I：

10.1007/978-981-32-9291-8_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Predicting where humans look in a given scene is a well-known problem with multiple applications in consumer cameras, human-computer interaction, robotics, and gaming. With large-scale image datasets available for human fixation, it is now possible to train deep neural networks for generating a fixationmap. Human fixations are a function of both local visual features and global context. We incorporate this in a deep neural network by using global and local features of an image to predict human fixations. We sample multi-scale features of the deep residual network and introduce a new method for incorporating these multi-scale features for the end-to-end training of our network. Our model DeepAttent obtains competitive results on SALICON and iSUN datasets and outperforms state-of-the-art methods on various metrics.

引用

页码：65 / 73

页数：9

共 27 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2]

[Anonymous], 2016, LSUN 16 LARGE SCALE

[3]

[Anonymous], 2014, CORR

[4] Analysis of scores, datasets, and models in visual saliency prediction [J].

Borji, Ali ;

Tavakoli, Hamed R. ;

Sihite, Dicky N. ;

Itti, Laurent .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :921-928

[5] Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data? [J].

Buhrmester, Michael ;

Kwang, Tracy ;

Gosling, Samuel D. .

PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2011, 6 (01) :3-5

[6] Using semantic content as cues for better scanpath prediction [J].

Cerf, Moran ;

Frady, E. Paxon ;

Koch, Christof .

PROCEEDINGS OF THE EYE TRACKING RESEARCH AND APPLICATIONS SYMPOSIUM (ETRA 2008), 2008, :143-146

[7]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[8]

Guo CL, 2008, PROC CVPR IEEE, P2908

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10] SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [J].

Huang, Xun ;

Shen, Chengyao ;

Boix, Xavier ;

Zhao, Qi .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :262-270

← 1 2 3 →