VISUAL ATTENTION INSPIRED DISTANT VIEW AND CLOSE-UP VIEW CLASSIFICATION

被引:0
作者
Tong, Song [1 ]
Loh, Yuen Peng [2 ]
Liang, Xuefeng [1 ]
Kumada, Takatsune [1 ]
机构
[1] Kyoto Univ, IST, Grad Sch Informat, Kyoto, Japan
[2] Univ Malaya, Ctr Image & Signal Proc, Kuala Lumpur, Malaysia
来源
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2016年
关键词
Distant and close-up view; focus cue; scale cue; visual attention; SCENE; DEPTH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The images of distant view and close-up view indicate a photographers' attention which can be further utilized for user behavior analysis and scene evaluation. As images may compose arbitrary contexts, distant view and close-up view classification becomes non-trivial. In this work, we found two cues can represent human visual attention, i.e. focus cue and scale cue. We model the focus cue in frequency domain using the Discrete Wavelet Transform, and employ signal distribution as the focus feature. For the scale cue, we model it by defining a spatial size and a conceptual size in the image using the Edge Box and Convolution Neural Network. By integrating these two models, a robust scheme is proposed for this non-trivial task. Experiments on a newly retrieved dataset, which has 2137 natural images, show the classification accuracy achieves up to 97.3%.
引用
收藏
页码:2787 / 2791
页数:5
相关论文
共 17 条
[1]  
[Anonymous], 2013, Decaf: A deep convolutional activation feature for generic visual recognition
[2]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[3]  
Chun-Lien S., 2013, Industrial Commercial Power Systems Technical Conf (ICPS), 2013 IEEE/IAS 49th, P1
[4]   Video Saliency Incorporating Spatiotemporal Cues and Uncertainty Weighting [J].
Fang, Yuming ;
Wang, Zhou ;
Lin, Weisi ;
Fang, Zhijun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) :3910-3921
[5]   Text detection in images based on unsupervised classification of high-frequency wavelet coefficients [J].
Gllavata, J ;
Ewerth, R ;
Freisleben, B .
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, :425-428
[6]   Smoke detection in video using wavelets and support vector machines [J].
Gubbi, Jayavardhana ;
Marusic, Slaven ;
Palaniswami, Marimuthu .
FIRE SAFETY JOURNAL, 2009, 44 (08) :1110-1115
[7]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[8]   Estimating Scene-Oriented Pseudo Depth With Pictorial Depth Cues [J].
Lee, Jaeho ;
Yoo, Seungwoo ;
Kim, Changick ;
Vasudev, Bhaskaran .
IEEE TRANSACTIONS ON BROADCASTING, 2013, 59 (02) :238-250
[9]   Modeling the shape of the scene: A holistic representation of the spatial envelope [J].
Oliva, A ;
Torralba, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 42 (03) :145-175
[10]   A wavelet-based image fusion tutorial [J].
Pajares, G ;
de la Cruz, JM .
PATTERN RECOGNITION, 2004, 37 (09) :1855-1872