SPATIAL ENSEMBLE KERNEL LEARNING FOR SCENE CLASSIFICATION

被引：0

作者：

Zhang, Lei ^{[1
]}

Zhen, Xiantong ^{[2
]}

Zhang, Qiujing ^{[1
]}

机构：

[1] Guangdong Univ Petrochem Technol, Coll Comp & Elect Informat, Maoming, Peoples R China

[2] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

基金：

美国国家科学基金会;

关键词：

Spatial Ensemble Kernel; CNNs; Fourier Feature Embedding; Spatial Pyramid Kernel; Scene Classification;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Scene recognition is one of the most important tasks in computer vision. Apart from appearance, spatial layout carries the crucial cue for discriminative representation. In this paper, we propose spatial ensemble kernel (SEK) learning, which enables fusion of multi-scale spatial information to achieve compact while discriminative representation of scenes. Based on the spatial pyramid, SEK combines the CNN features in each level of the pyramid in an ensemble and fuse them by kernels. By kernel approximation, we achieve Fourier feature embedding of CNN features in each scale, which establishes a nonlinear layer of the neural network with a cosine activation function. The parameters of the nonlinear layer can be learned jointly in one single optimization framework by supervised learning, which enables compact and discriminative feature representations. We show the effectiveness of the proposed SEK on two recent scene benchmark datasets, i.e., MIT indoor and SUN 397. The propose SEK produces high performance on two datasets which are competitive to state-of-the-art algorithms.

引用

页码：1303 / 1307

页数：5

共 24 条

[1]

[Anonymous], 2016, EUR C MACH LEARN ECM

[2]

[Anonymous], 2013, DECAF DEEP CONVOLUTI

[3]

[Anonymous], 2015, abs/1506.03365

[4]

[Anonymous], P IEEE C COMP VIS PA

[5]

[Anonymous], 2011, FOURIER ANAL GROUPS

[6]

[Anonymous], 2014, Advances in neural information processing systems

[7]

Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]

[8] All about VLAD [J].

Arandjelovic, Relja ;

Zisserman, Andrew .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1578-1585

[9]

Aubry Mathieu, 2014, PAINTING TO 3D MODEL

[10] Total recall: Automatic query expansion with a generative feature model for object retrieval [J].

Chum, Ondrej ;

Philbin, James ;

Sivic, Josef ;

Isard, Michael ;

Zisserman, Andrew .

2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+

← 1 2 3 →