A Lightweight and Discriminative Model for Remote Sensing Scene Classification With Multidilation Pooling Module

被引:103
作者
Zhang, Bin [1 ]
Zhang, Yongjun [1 ]
Wang, Shugen [1 ]
机构
[1] Wuhan Univ, Dept Photogrammetry, Sch Remote Sensing & Informat Engn, Wuhan 430079, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention mechanism; convolutional neural network (CNN); dilated convolution; remote sensing image; scene classification; CONVOLUTIONAL NEURAL-NETWORKS; FEATURES;
D O I
10.1109/JSTARS.2019.2919317
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the growing spatial resolution of satellite images, high spatial resolution (HSR) remote sensing imagery scene classification has become a challenging task due to the highly complex geometrical structures and spatial patterns in HSR imagery. The key issue in scene classification is how to understand the semantic content of the images effectively, and researchers have been looking for ways to improve the process. Convolutional neural networks (CNNs), which have achieved amazing results in natural image classification, were introduced for remote sensing image scene classification. Most of the researches to date have improved the final classification accuracy by merging the features of CNNs. However, the entire models become relatively complex and cannot extract more effective features. To solve this problem, in this paper, we propose a lightweight and effective CNN which is capable of maintaining high accuracy. We use MobileNet V2 as a base network and introduce the dilated convolution and channel attention to extract discriminative features. To improve the performance of the CNN further, we also propose a multidilation pooling module to extract multiscale features. Experiments are performed on six datasets, and the results verify that our method can achieve higher accuracy compared to the current state-of-the-art methods.
引用
收藏
页码:2636 / 2653
页数:18
相关论文
共 75 条
[1]  
[Anonymous], P 3 INT C LEARNING R
[2]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[3]  
[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[4]  
[Anonymous], P 23 ACM SIGSP INT C
[5]  
[Anonymous], 2017, P IEEE, DOI DOI 10.1109/JPROC.2017.2675998
[6]  
[Anonymous], ADV NEURAL INFORM PR
[7]  
[Anonymous], 2017, Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications
[8]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85
[9]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[10]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784