Classification of Remote Sensing Images Using EfficientNet-B3 CNN Model With Attention

被引:184
作者
Alhichri, Haikel [1 ]
Alswayed, Asma S. [1 ]
Bazi, Yakoub [1 ]
Ammour, Nassim [1 ]
Alajlan, Naif A. [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Comp Engn Dept, Adv Lab Intelligent Syst Res ALISR, Riyadh 11543, Saudi Arabia
关键词
Remote sensing; scene classification; EfficientNet-B3; convolutional neural networks (CNNs); attention mechanisms; CONVOLUTIONAL NEURAL-NETWORKS; SCENE CLASSIFICATION; COLLABORATIVE REPRESENTATION; FEATURES;
D O I
10.1109/ACCESS.2021.3051085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene classification is a highly useful task in Remote Sensing (RS) applications. Many efforts have been made to improve the accuracy of RS scene classification. Scene classification is a challenging problem, especially for large datasets with tens of thousands of images with a large number of classes and taken under different circumstances. One problem that is observed in scene classification is the fact that for a given scene, only one part of it indicates which class it belongs to, whereas the other parts are either irrelevant or they actually tend to belong to another class. To address this issue, this paper proposes a deep attention Convolutional Neural Network (CNN) for scene classification in remote sensing. CNN models use successive convolutional layers to learn feature maps from larger and larger regions (or receptive fields) of the scene. The attention mechanism computes a new feature map as a weighted average of these original feature maps. In particular, we propose a solution, named EfficientNet-B3-Attn-2, based on the pre-trained EfficientNet-B3 CNN enhanced with an attention mechanism. A dedicated branch is added to layer 262 of the network, to compute the required weights. These weights are learned automatically by training the whole CNN model end-to-end using the backpropagation algorithm. In this way, the network learns to emphasize important regions of the scene and suppress the regions that are irrelevant to the classification. We tested the proposed EfficientNet-B3-Attn-2 on six popular remote sensing datasets, namely UC Merced, KSA, OPTIMAL-31, RSSCN7, WHU-RS19, and AID datasets, showing its strong capabilities in classifying RS scenes.
引用
收藏
页码:14078 / 14094
页数:17
相关论文
共 86 条
[1]  
Alhichri H, REMOTE SENSING DATAS
[2]  
Alhichri H., KSA REMOTE SENSING D
[3]  
Alhichri H. S, 2019, P INT C COMP INF SCI, P1
[4]  
Alswayed A.S., 2020, 2020 3 INT C COMP AP, P1, DOI DOI 10.1109/ICCAIS48893.2020.9096876
[5]  
[Anonymous], 2016, ARXIV161100471
[6]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85
[7]   Fusing Local and Global Features for High-Resolution Scene Classification [J].
Bian, Xiaoyong ;
Chen, Chen ;
Tian, Long ;
Du, Qian .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (06) :2889-2901
[8]  
Castelluccio M., 2015, Acta Ecol. Sin
[9]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[10]   Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities [J].
Cheng, Gong ;
Xie, Xingxing ;
Han, Junwei ;
Guo, Lei ;
Xia, Gui-Song .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :3735-3756