Global-Local Attention Network for Aerial Scene Classification

被引:64
作者
Guo, Yiyou [1 ]
Ji, Jinsheng [1 ]
Lu, Xiankai [1 ]
Huo, Hong [1 ]
Fang, Tao [1 ]
Li, Deren [2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Automat, Shanghai 200240, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Scene classification; global-local attention network; deep learning; remote sensing; CONVOLUTIONAL NEURAL-NETWORKS; FUSION; IMAGES; FRAMEWORK; FEATURES; MODEL; BAG;
D O I
10.1109/ACCESS.2019.2918732
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The classification performance of aerial scenes relies heavily on the discriminative power of feature representation from high-spatial resolution remotely sensed imagery. The convolutional neural networks (CNNs) have recently been applied to adaptively learn image features at different levels of abstraction rather than requiring handcrafted features and achieved state-of-the-art performance. However, most of these networks focus on multi-stage global feature learning yet neglect the local information, which plays an important role in scene recognition. To address this issue, a novel end-to-end global-local attention network (GLANet) is proposed to capture both global and local information for aerial scene classification. FC layers in the VGGNet are replaced by the global attention (GA) branch and local attention (LA) branch, one of which learns the global information while the other learns the local semantic information via attention mechanisms. During each training, the labels of input images can be predicted by the local, global, and their concatenated features using softmax. According to different predicted labels, two auxiliary loss functions are further computed and imposed on the proposed network to enhance the supervision for network learning. The experimental results on three challenging large-scale scene datasets demonstrate the effectiveness of the proposed global-local attention network.
引用
收藏
页码:67200 / 67212
页数:13
相关论文
共 55 条
[1]  
[Anonymous], P 3 INT C LEARNING R
[2]  
[Anonymous], 2018, INT J ADV RES COMPUT, DOI DOI 10.26483/IJARCS.V9I2.5897
[3]  
[Anonymous], P IEEE C COMP VIS PA
[4]  
[Anonymous], 2017, P IEEE, DOI DOI 10.1109/JPROC.2017.2675998
[5]  
[Anonymous], 2018, REMOTE SENS
[6]  
[Anonymous], IEEE ACCESS
[7]  
[Anonymous], IEEE T GEOSCI REMOTE
[8]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85
[9]   Fusing Local and Global Features for High-Resolution Scene Classification [J].
Bian, Xiaoyong ;
Chen, Chen ;
Tian, Long ;
Du, Qian .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (06) :2889-2901
[10]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784