Semantic interleaving global channel attention for multilabel remote sensing image classification

被引:9
作者
Liu, Yongkun [1 ]
Ni, Kesong [3 ]
Zhang, Yuhan [4 ]
Zhou, Lijian [2 ]
Zhao, Kun [2 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China
[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
[3] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[4] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[5] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; multilabel classification; gnn; channel attention; label relation; LEARNING APPROACH; NEURAL-NETWORK; LAND-COVER; RETRIEVAL; FRAMEWORK;
D O I
10.1080/01431161.2023.2297175
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Multilabel remote sensing image classification (MLRSIC) has received increasing research interest. Taking the co-occurrence relationship of multiple labels as additional information helps to improve the overall performance. However, current methods only focus on using it to constrain the final feature which is output from a convolutional neural network (CNN). On the one hand, these methods need to exploit the potential of label correlation in feature representation fully. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called 'Semantic Interleaving Global chaNnel Attention' (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the training set and fed into a graph neural network (GNN) to generate optimal semantic feature representations of each label. Next, the semantic features are interleaved with visual features which are extracted by CNNs to guide the overall features of the input image transform from the original feature space to the semantic feature space with embedded label relations. Then, global attention triggered by semantic interleaving is used to emphasize visual features in important channels. Finally, to make SIGNA easier to use and more optimized, multihead SIGNA-based feature adaptive weighting networks are proposed as plug-in blocks to plug into any layers of a CNN. For remote sensing images, better classification performance can be achieved by inserting the plug-in blocks into the shallow layers of CNNs. We conducted extensive experimental comparisons on three data sets: UCM, AID and DFC15. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. Notes that the codes of this paper will be open to the community for reproducibility research.
引用
收藏
页码:393 / 419
页数:27
相关论文
共 65 条
[11]  
Hamilton WL, 2017, ADV NEUR IN, V30
[12]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[13]   Graph Convolutional Networks for Hyperspectral Image Classification [J].
Hong, Danfeng ;
Gao, Lianru ;
Yao, Jing ;
Zhang, Bing ;
Plaza, Antonio ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (07) :5966-5978
[14]   More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification [J].
Hong, Danfeng ;
Gao, Lianru ;
Yokoya, Naoto ;
Yao, Jing ;
Chanussot, Jocelyn ;
Du, Qian ;
Zhang, Bing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4340-4354
[15]   Learning to propagate labels on graphs: An iterative multitask regression framework for semi-supervised hyperspectral dimensionality reduction [J].
Hong, Danfeng ;
Yokoya, Naoto ;
Chanussot, Jocelyn ;
Xu, Jian ;
Zhu, Xiao Xiang .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 158 :35-49
[16]   Learnable manifold alignment (LeMA): A semi-supervised cross-modality learning framework for land cover and land use classification [J].
Hong, Danfeng ;
Yokoya, Naoto ;
Ge, Nan ;
Chanussot, Jocelyn ;
Zhu, Xiao Xiang .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 :193-205
[17]   An Augmented Linear Mixing Model to Address Spectral Variability for Hyperspectral Unmixing [J].
Hong, Danfeng ;
Yokoya, Naoto ;
Chanussot, Jocelyn ;
Zhu, Xiao Xiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) :1923-1938
[18]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[19]   Relation Network for Multilabel Aerial Image Classification [J].
Hua, Yuansheng ;
Mou, Lichao ;
Zhu, Xiao Xiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07) :4558-4572
[20]  
Hua YS, 2019, INT GEOSCI REMOTE SE, P5244, DOI [10.1109/igarss.2019.8898934, 10.1109/IGARSS.2019.8898934]