FCIHMRT: Feature Cross-Layer Interaction Hybrid Method Based on Res2Net and Transformer for Remote Sensing Scene Classification

被引:52
作者
Huo, Yan [1 ,2 ,3 ]
Gang, Shuang [1 ,2 ,3 ]
Guan, Chao [1 ,2 ,3 ]
机构
[1] Shenyang Univ, Inst Carbon Neutral Technol & Policy, Shenyang 110044, Peoples R China
[2] China Geol Survey, Northeast Geol S&T Innovat Ctr, Shenyang 110034, Peoples R China
[3] Minist Nat Resources, Key Lab Black Soil Evolut & Ecol Effect, Shenyang 110034, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
vision transformer; remote sensing image; Res2Net; scene classification; CONVOLUTIONAL NEURAL-NETWORK; ATTENTION; MODEL;
D O I
10.3390/electronics12204362
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene classification is one of the areas of remote sensing image processing that is gaining much attention. Aiming to solve the problem of the limited precision of optical scene classification caused by complex spatial patterns, a high similarity between classes, and a high diversity of classes, a feature cross-layer interaction hybrid algorithm for optical remote sensing scene classification is proposed in this paper. Firstly, a number of features are extracted from two branches, a vision transformer branch and a Res2Net branch, to strengthen the feature extraction capability of the strategy. A novel interactive attention technique is proposed, with the goal of focusing on the strong correlation between the two-branch features, to fully use the complementing advantages of the feature information. The retrieved feature data are further refined and merged. The combined characteristics are then employed for classification. The experiments were conducted by using three open-source remote sensing datasets to validate the feasibility of the proposed method, which performed better in scene classification tasks than other methods.
引用
收藏
页数:15
相关论文
共 48 条
[41]   A Two-Stream Deep Fusion Framework for High-Resolution Aerial Scene Classification [J].
Yu, Yunlong ;
Liu, Fuxian .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[42]   A Lightweight and Discriminative Model for Remote Sensing Scene Classification With Multidilation Pooling Module [J].
Zhang, Bin ;
Zhang, Yongjun ;
Wang, Shugen .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) :2636-2653
[43]   TRS: Transformers for Remote Sensing Scene Classification [J].
Zhang, Jianrong ;
Zhao, Hongwei ;
Li, Jiao .
REMOTE SENSING, 2021, 13 (20)
[44]   Remote Sensing Image Scene Classification Using CNN-CapsNet [J].
Zhang, Wei ;
Tang, Ping ;
Zhao, Lijun .
REMOTE SENSING, 2019, 11 (05)
[45]   Pairwise Comparison Network for Remote-Sensing Scene Classification [J].
Zhang, Yue ;
Zheng, Xiangtao ;
Lu, Xiaoqiang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[46]   Convolutional neural network based heterogeneous transfer learning for remote-sensing scene classification [J].
Zhao, Huizhen ;
Liu, Fuxian ;
Zhang, Han ;
Liang, Zhibing .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2019, 40 (22) :8506-8527
[47]   Remote Sensing Image Scene Classification Based on an Enhanced Attention Module [J].
Zhao, Zhicheng ;
Li, Jiaqi ;
Luo, Ze ;
Li, Jian ;
Chen, Can .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (11) :1926-1930
[48]   Bag-of-Visual-Words Scene Classifier With Local and Global Features for High Spatial Resolution Remote Sensing Imagery [J].
Zhu, Qiqi ;
Zhong, Yanfei ;
Zhao, Bei ;
Xia, Gui-Song ;
Zhang, Liangpei .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (06) :747-751