An Attention Cascade Global-Local Network for Remote Sensing Scene Classification

被引:27
作者
Shen, Junge [1 ]
Yu, Tianwei [1 ]
Yang, Haopeng [1 ]
Wang, Ruxin [2 ]
Wang, Qi [1 ,3 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Yunnan Univ, Sch Software, Engn Res Ctr Cyberspace, Kunming 650106, Yunnan, Peoples R China
[3] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing scene classification; convolutional neural network; neural architecture search; feature fusion; CONVOLUTIONAL NEURAL-NETWORKS; FUSION; MECHANISM; FEATURES;
D O I
10.3390/rs14092042
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing image scene classification is an important task of remote sensing image interpretation, which has recently been well addressed by the convolutional neural network owing to its powerful learning ability. However, due to the multiple types of geographical information and redundant background information of the remote sensing images, most of the CNN-based methods, especially those based on a single CNN model and those ignoring the combination of global and local features, exhibit limited performance on accurate classification. To compensate for such insufficiency, we propose a new dual-model deep feature fusion method based on an attention cascade global-local network (ACGLNet). Specifically, we use two popular CNNs as the feature extractors to extract complementary multiscale features from the input image. Considering the characteristics of the global and local features, the proposed ACGLNet filters the redundant background information from the low-level features through the spatial attention mechanism, followed by which the locally attended features are fused with the high-level features. Then, bilinear fusion is employed to produce the fused representation of the dual model, which is finally fed to the classifier. Through extensive experiments on four public remote sensing scene datasets, including UCM, AID, PatternNet, and OPTIMAL-31, we demonstrate the feasibility of the proposed method and its superiority over the state-of-the-art scene classification methods.
引用
收藏
页数:20
相关论文
共 60 条
[1]  
[Anonymous], 2018, INT J ADV RES COMPUT, DOI DOI 10.26483/IJARCS.V9I2.5897
[2]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85
[3]  
Castelluccio M., 2015, ARXIV
[4]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[5]   Land-use scene classification using multi-scale completed local binary patterns [J].
Chen, Chen ;
Zhang, Baochang ;
Su, Hongjun ;
Li, Wei ;
Wang, Lu .
SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (04) :745-752
[6]   Review of Image Classification Algorithms Based on Convolutional Neural Networks [J].
Chen, Leiyu ;
Li, Shaobo ;
Bai, Qiang ;
Yang, Jing ;
Jiang, Sanlong ;
Miao, Yanming .
REMOTE SENSING, 2021, 13 (22)
[7]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[8]   Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei ;
Guo, Lei ;
Han, Jungong .
IET COMPUTER VISION, 2015, 9 (05) :639-647
[9]  
Fan RQ, 2019, ASIA PACIF MICROWAVE, P1349, DOI [10.1109/APMC46564.2019.9038422, 10.1109/apmc46564.2019.9038422]
[10]   Compact Bilinear Pooling [J].
Gao, Yang ;
Beijbom, Oscar ;
Zhang, Ning ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326