VISUAL ATTENTION INSPIRED DISTANT VIEW AND CLOSE-UP VIEW CLASSIFICATION

被引：0

作者：

Tong, Song ^{[1
]}

Loh, Yuen Peng ^{[2
]}

Liang, Xuefeng ^{[1
]}

Kumada, Takatsune ^{[1
]}

机构：

[1] Kyoto Univ, IST, Grad Sch Informat, Kyoto, Japan

[2] Univ Malaya, Ctr Image & Signal Proc, Kuala Lumpur, Malaysia

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2016年

关键词：

Distant and close-up view; focus cue; scale cue; visual attention; SCENE; DEPTH;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The images of distant view and close-up view indicate a photographers' attention which can be further utilized for user behavior analysis and scene evaluation. As images may compose arbitrary contexts, distant view and close-up view classification becomes non-trivial. In this work, we found two cues can represent human visual attention, i.e. focus cue and scale cue. We model the focus cue in frequency domain using the Discrete Wavelet Transform, and employ signal distribution as the focus feature. For the scale cue, we model it by defining a spatial size and a conceptual size in the image using the Edge Box and Convolution Neural Network. By integrating these two models, a robust scheme is proposed for this non-trivial task. Experiments on a newly retrieved dataset, which has 2137 natural images, show the classification accuracy achieves up to 97.3%.

引用

页码：2787 / 2791

页数：5

共 17 条

[1]

[Anonymous], 2013, Decaf: A deep convolutional activation feature for generic visual recognition

[2] LIBSVM: A Library for Support Vector Machines [J].