A Deep Scene Representation for Aerial Scene Classification

被引:118
作者
Zheng, Xiangtao [1 ]
Yuan, Yuan [2 ]
Lu, Xiaoqiang [1 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol, Xian 710119, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr Opt Imagery Anal & Learning, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2019年 / 57卷 / 07期
基金
中国国家自然科学基金;
关键词
Aerial scene classification; convolutional neural networks (CNNs); Fisher vector (FV); multiscale representation; CONVOLUTIONAL NEURAL-NETWORKS; IMAGE CLASSIFICATION; FEATURES; SCALE;
D O I
10.1109/TGRS.2019.2893115
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
As a fundamental problem in earth observation, aerial scene classification tries to assign a specific semantic label to an aerial image. In recent years, the deep convolutional neural networks (CNNs) have shown advanced performances in aerial scene classification. The successful pretrained CNNs can he transferable to aerial images. However, global CNN activations may lack geometric invariance and, therefore, limit the improvement of aerial scene classification. To address this problem, this paper proposes a deep scene representation to achieve the invariance of CNN features and further enhance the discriminative power. The proposed method: 1) extracts CNN activations from the last convolutional layer of pretrained CNN; 2) performs multiscale pooling (MSP) on these activations; and 3) builds a holistic representation by the Fisher vector method. MSP is a simple and effective multiscale strategy, which enriches multiscale spatial information in affordable computational time. The proposed representation is particularly suited at aerial scenes and consistently outperforms global CNN activations without requiring feature adaptation. Extensive experiments on five aerial scene data sets indicate that the proposed method, even with a simple linear classifier, can achieve the state-of-the-art performance.
引用
收藏
页码:4799 / 4809
页数:11
相关论文
共 53 条
[1]   Asymmetric Adaptation of Deep Features for Cross-Domain Classification in Remote Sensing Imagery [J].
Ammour, Nassim ;
Bashmal, Laila ;
Bazi, Yakoub ;
Al Rahhal, M. M. ;
Zuair, Mansour .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (04) :597-601
[2]  
[Anonymous], IEEE T PATTERN ANAL
[3]  
[Anonymous], COMMUN ACM, DOI [DOI 10.1145/3065386, 10.1145/3065386]
[4]   DeepSat - A Learning framework for Satellite Imagery [J].
Basu, Saikat ;
Ganguly, Sangram ;
Mukhopadhyay, Supratik ;
DiBiano, Robert ;
Karki, Manohar ;
Nemani, Ramakrishna .
23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,
[5]  
Castelluccio M, 2015, ARXIV
[6]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[7]  
Chatfield K., P BRIT MACHINE VISIO
[8]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[9]   Remote Sensing Image Scene Classification Using Bag of Convolutional Features [J].
Cheng, Gong ;
Li, Zhenpeng ;
Yao, Xiwen ;
Guo, Lei ;
Wei, Zhongliang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (10) :1735-1739
[10]  
Cimpoi M, 2015, PROC CVPR IEEE, P3828, DOI 10.1109/CVPR.2015.7299007