LGLFormer: Local-Global Lifting Transformer for Remote Sensing Scene Parsing

被引:5
作者
Yang, Yuting [1 ]
Jiao, Licheng [1 ]
Li, Lingling [1 ]
Liu, Xu [1 ]
Liu, Fang [1 ]
Chen, Puhua [1 ]
Yang, Shuyuan [1 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding,, Xian 710071, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Convolutional neural network (CNN); lifting scheme; local-global (LG); remote sensing; transformer; wavelet; REPRESENTATION; NETWORK;
D O I
10.1109/TGRS.2023.3344116
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In deep learning, convolutional neural networks (CNNs) and transformers have gained excellent achievements in remote sensing scene parsing. Strong feature representation ability is still a challenge for them. Besides, the complex scenes are still essential challenges for deep learning in remote sensing scene parsing. In this article, an efficient local-global lifting transformer (LGLFormer) framework is proposed to ease the challenges above. It effectively combines CNNs, transformer, and wavelet transform to build a strong local-global (LG) feature representation network. Besides, global feature learning driven by LG adaptive features is proposed based on the 2-D LG adaptive feature extractor (LGAFE) and refined global feature attention module. The 2-D LG lifting feature extractor is inspired by the lifting scheme, which introduces local and global dependency. Furthermore, two LG lifting schemes are proposed, including the series and parallel modes, which can effectively learn LG relations between pixels. Finally, experiments are validated on three remote sensing benchmark datasets. The proposed LGLFormer achieves the state-of-the-art with 99.02%, 99.2%, and 99.48% overall accuracy (OA) on AID, WHU-RS19, and UCM datasets, respectively. In addition, LGLFormer shows good convergence with competitive parameters. The experimental code will be available at https://github.com/yutinyang/LGLFormer.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 51 条
  • [1] Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    van de Weijer, Joost
    Molinier, Matthieu
    Laaksonen, Jorma
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 : 74 - 85
  • [2] Remote Sensing Image Scene Classification Using Multiscale Feature Fusion Covariance Network With Octave Convolution
    Bai, Lin
    Liu, Qingxin
    Li, Cuiling
    Ye, Zhen
    Hui, Meng
    Jia, Xiuping
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Rodriguez MXB, 2020, IEEE WINT CONF APPL, P3100, DOI [10.1109/wacv45572.2020.9093580, 10.1109/WACV45572.2020.9093580]
  • [4] Multi-features extraction based on deep learning for skin lesion classification
    Benyahia, Samia
    Meftah, Boudjelal
    Lezoray, Olivier
    [J]. TISSUE & CELL, 2022, 74
  • [5] Multi-scale stacking attention pooling for remote sensing scene classification
    Bi, Qi
    Zhang, Han
    Qin, Kun
    [J]. NEUROCOMPUTING, 2021, 436 : 147 - 161
  • [6] APDC-Net: Attention Pooling-Based Convolutional Network for Aerial Scene Classification
    Bi, Qi
    Qin, Kun
    Zhang, Han
    Xie, Jiafen
    Li, Zhili
    Xu, Kai
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) : 1603 - 1607
  • [7] Deep Feature Fusion for VHR Remote Sensing Scene Classification
    Chaib, Souleyman
    Liu, Huan
    Gu, Yanfeng
    Yao, Hongxun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08): : 4775 - 4784
  • [8] Remote Sensing Image Scene Classification: Benchmark and State of the Art
    Cheng, Gong
    Han, Junwei
    Lu, Xiaoqiang
    [J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1865 - 1883
  • [9] Chu X., 2022, PROC 11 INT C LEARN
  • [10] Dai Z, 2021, ADV NEUR IN, V34