MSE-Net: A novel master-slave encoding network for remote sensing scene classification

被引:4
作者
Yue, Hongguang [1 ]
Qing, Linbo [1 ]
Zhang, Zhixuan [2 ]
Wang, Zhengyong [1 ]
Guo, Li [3 ]
Peng, Yonghong [3 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu 610065, Peoples R China
[3] Manchester Metropolitan Univ, Dept Comp & Math, Manchester M1 5GD, England
关键词
Remote sensing scene classification; Convolutional neural networks; Visual transformers; Feature fusion; CONVOLUTIONAL NEURAL-NETWORKS; FEATURE FUSION; RECOGNITION; SCALE;
D O I
10.1016/j.engappai.2024.107909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote sensing scene (RSS) image classification plays a vital role in various fields such as urban planning and environmental protection. However, due to higher inter-class similarity and intra-class variability, achieving accurate classification for RSS images poses a considerable challenge for current convolutional neural networks (CNNs)-based and visual transformer (ViT)-based methods. To address these issues, this paper proposes a novel dual-encoding method named master-slave encoding network (MSE-Net) from two perspectives of feature extraction and fusion. The master encoder, based on ViT, extracts higher-level semantic features, while the slave encoder, based on CNN, captures relative lower-level spatial structure information. Secondly, to integrate feature information from the two encoders effectively, this paper further develop two fusion strategies. The first strategy involves the auxiliary enhancement units (AEUs), which eliminates semantic divergence between the two encoders, enhances spatial context awareness of the slave encoder and promotes effective feature learning. The interactive perception unit (IPU), as the second strategy, facilitates interaction and integration of the two encoders' representations to extract more discriminative feature information. In addition, we conducted comparative experiments on four widely-used RSS datasets, including RSSCN7, SIRI-WHU, the aerial image dataset (AID) and NWPU-RESISC45 (NWPU45), to verify the effectiveness of MSE-Net. The experimental results demonstrate that MSE-Net achieved state -of -the -art (SOTA) performance across all the datasets.
引用
收藏
页数:16
相关论文
共 74 条
[1]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85
[2]   Vision Transformers for Remote Sensing Image Classification [J].
Bazi, Yakoub ;
Bashmal, Laila ;
Rahhal, Mohamad M. Al ;
Dayil, Reham Al ;
Ajlan, Naif Al .
REMOTE SENSING, 2021, 13 (03) :1-20
[3]   Remote Sensing Scene Classification Using Convolutional Features and Deep Forest Classifier [J].
Boualleg, Yaakoub ;
Farah, Mohamed ;
Farah, Imed Riadh .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (12) :1944-1948
[4]  
Cao Y, 2019, IEEE ICC
[5]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[6]   GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification [J].
Chen, Weitao ;
Ouyang, Shubing ;
Tong, Wei ;
Li, Xianju ;
Zheng, Xiongwei ;
Wang, Lizhe .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :1150-1162
[7]   Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities [J].
Cheng, Gong ;
Xie, Xingxing ;
Han, Junwei ;
Guo, Lei ;
Xia, Gui-Song .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :3735-3756
[8]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[9]   Remote Sensing Image Scene Classification Using Bag of Convolutional Features [J].
Cheng, Gong ;
Li, Zhenpeng ;
Yao, Xiwen ;
Guo, Lei ;
Wei, Zhongliang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (10) :1735-1739
[10]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883