LGLFormer: Local-Global Lifting Transformer for Remote Sensing Scene Parsing

被引：5

作者：

Yang, Yuting ^{[1
]}

Jiao, Licheng ^{[1
]}

Li, Lingling ^{[1
]}

Liu, Xu ^{[1
]}

Liu, Fang ^{[1
]}

Chen, Puhua ^{[1
]}

Yang, Shuyuan ^{[1
]}

机构：

[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding,, Xian 710071, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Convolutional neural network (CNN); lifting scheme; local-global (LG); remote sensing; transformer; wavelet; REPRESENTATION; NETWORK;

D O I：

10.1109/TGRS.2023.3344116

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

In deep learning, convolutional neural networks (CNNs) and transformers have gained excellent achievements in remote sensing scene parsing. Strong feature representation ability is still a challenge for them. Besides, the complex scenes are still essential challenges for deep learning in remote sensing scene parsing. In this article, an efficient local-global lifting transformer (LGLFormer) framework is proposed to ease the challenges above. It effectively combines CNNs, transformer, and wavelet transform to build a strong local-global (LG) feature representation network. Besides, global feature learning driven by LG adaptive features is proposed based on the 2-D LG adaptive feature extractor (LGAFE) and refined global feature attention module. The 2-D LG lifting feature extractor is inspired by the lifting scheme, which introduces local and global dependency. Furthermore, two LG lifting schemes are proposed, including the series and parallel modes, which can effectively learn LG relations between pixels. Finally, experiments are validated on three remote sensing benchmark datasets. The proposed LGLFormer achieves the state-of-the-art with 99.02%, 99.2%, and 99.48% overall accuracy (OA) on AID, WHU-RS19, and UCM datasets, respectively. In addition, LGLFormer shows good convergence with competitive parameters. The experimental code will be available at https://github.com/yutinyang/LGLFormer.

引用

页码：1 / 13

页数：13

共 51 条

[1] Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification
Anwer, Rao Muhammad
Khan, Fahad Shahbaz
van de Weijer, Joost
Molinier, Matthieu
Laaksonen, Jorma
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 : 74 - 85
[2] Remote Sensing Image Scene Classification Using Multiscale Feature Fusion Covariance Network With Octave Convolution
Bai, Lin
Liu, Qingxin
Li, Cuiling
Ye, Zhen
Hui, Meng
Jia, Xiuping
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[3] Rodriguez MXB, 2020, IEEE WINT CONF APPL, P3100, DOI [10.1109/wacv45572.2020.9093580, 10.1109/WACV45572.2020.9093580]
[4] Multi-features extraction based on deep learning for skin lesion classification
Benyahia, Samia
Meftah, Boudjelal
Lezoray, Olivier
[J]. TISSUE & CELL, 2022, 74
[5] Multi-scale stacking attention pooling for remote sensing scene classification
Bi, Qi
Zhang, Han
Qin, Kun
[J]. NEUROCOMPUTING, 2021, 436 : 147 - 161
[6] APDC-Net: Attention Pooling-Based Convolutional Network for Aerial Scene Classification
Bi, Qi
Qin, Kun
Zhang, Han
Xie, Jiafen
Li, Zhili
Xu, Kai
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) : 1603 - 1607
[7] Deep Feature Fusion for VHR Remote Sensing Scene Classification
Chaib, Souleyman
Liu, Huan
Gu, Yanfeng
Yao, Hongxun
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08): : 4775 - 4784
[8] Remote Sensing Image Scene Classification: Benchmark and State of the Art
Cheng, Gong
Han, Junwei
Lu, Xiaoqiang
[J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1865 - 1883
[9] Chu X., 2022, PROC 11 INT C LEARN
[10] Dai Z, 2021, ADV NEUR IN, V34

← 1 2 3 4 5 6 →