Semantic interleaving global channel attention for multilabel remote sensing image classification

被引:6
作者
Liu, Yongkun [1 ]
Ni, Kesong [3 ]
Zhang, Yuhan [4 ]
Zhou, Lijian [2 ]
Zhao, Kun [2 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China
[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
[3] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[4] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[5] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; multilabel classification; gnn; channel attention; label relation; LEARNING APPROACH; NEURAL-NETWORK; LAND-COVER; RETRIEVAL; FRAMEWORK;
D O I
10.1080/01431161.2023.2297175
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Multilabel remote sensing image classification (MLRSIC) has received increasing research interest. Taking the co-occurrence relationship of multiple labels as additional information helps to improve the overall performance. However, current methods only focus on using it to constrain the final feature which is output from a convolutional neural network (CNN). On the one hand, these methods need to exploit the potential of label correlation in feature representation fully. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called 'Semantic Interleaving Global chaNnel Attention' (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the training set and fed into a graph neural network (GNN) to generate optimal semantic feature representations of each label. Next, the semantic features are interleaved with visual features which are extracted by CNNs to guide the overall features of the input image transform from the original feature space to the semantic feature space with embedded label relations. Then, global attention triggered by semantic interleaving is used to emphasize visual features in important channels. Finally, to make SIGNA easier to use and more optimized, multihead SIGNA-based feature adaptive weighting networks are proposed as plug-in blocks to plug into any layers of a CNN. For remote sensing images, better classification performance can be achieved by inserting the plug-in blocks into the shallow layers of CNNs. We conducted extensive experimental comparisons on three data sets: UCM, AID and DFC15. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. Notes that the codes of this paper will be open to the community for reproducibility research.
引用
收藏
页码:393 / 419
页数:27
相关论文
共 65 条
[1]   Deep Attention Neural Network for Multi-Label Classification in Unmanned Aerial Vehicle Imagery [J].
Alshehri, Aaliyah ;
Bazi, Yakoub ;
Ammour, Nassim ;
Almubarak, Haidar ;
Alajlan, Naif .
IEEE ACCESS, 2019, 7 :119873-119880
[2]  
[Anonymous], 2017, Multi-label Classification of Satellite Images with Deep Learning
[3]  
Bazi Y, 2019, INT GEOSCI REMOTE SE, P2443, DOI [10.1109/igarss.2019.8898895, 10.1109/IGARSS.2019.8898895]
[4]  
Chaudhuri B, 2018, IEEE T GEOSCI REMOTE, V56, P1144, DOI [10.1109/tgrs.2017.2760909, 10.1109/TGRS.2017.2760909]
[5]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[6]   A Novel System for Content-Based Retrieval of Single and Multi-Label High-Dimensional Remote Sensing Images [J].
Dai, Osman Emre ;
Demir, Begum ;
Sankur, Bulent ;
Bruzzone, Lorenzo .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (07) :2473-2490
[7]   Endmember Extraction From Hyperspectral Imagery Based on Probabilistic Tensor Moments [J].
Fernandez-Beltran, Ruben ;
Pla, Filiberto ;
Plaza, Antonio .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (12) :2120-2124
[8]   Hyperspectral Unmixing Based on Dual-Depth Sparse Probabilistic Latent Semantic Analysis [J].
Fernandez-Beltran, Ruben ;
Plaza, Antonio ;
Plaza, Javier ;
Pla, Filiberto .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (11) :6344-6360
[9]   Optimized Kernel Minimum Noise Fraction Transformation for Hyperspectral Image Classification [J].
Gao, Lianru ;
Zhao, Bin ;
Jia, Xiuping ;
Liao, Wenzhi ;
Zhang, Bing .
REMOTE SENSING, 2017, 9 (06)
[10]   High-Resolution SAR Image Classification via Deep Convolutional Autoencoders [J].
Geng, Jie ;
Fan, Jianchao ;
Wang, Hongyu ;
Ma, Xiaorui ;
Li, Baoming ;
Chen, Fuliang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (11) :2351-2355