Co-Enhanced Global-Part Integration for Remote-Sensing Scene Classification

被引:1
作者
Zhao, Yichen [1 ,2 ,3 ,4 ]
Chen, Yaxiong [1 ,2 ,3 ,4 ]
Xiong, Shengwu [1 ,2 ,3 ,4 ]
Lu, Xiaoqiang [5 ]
Zhu, Xiao Xiang [6 ]
Mou, Lichao [6 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[4] Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[5] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China
[6] Tech Univ Munich, Chair Data Sci Earth Observat, D-80333 Munich, Germany
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Feature extraction; Semantics; Context modeling; Training; Technological innovation; Remote sensing; Convolutional neural networks; Attention; convolutional neural networks (CNNs); discriminative part discovery; remote-sensing (RS); scene classification; ATTENTION;
D O I
10.1109/TGRS.2024.3367877
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Remote-sensing (RS) scene classification aims to classify RS images with similar scene characteristics into one category. Plenty of RS images are complex in background, rich in content, and multiscale in target, exhibiting the characteristics of both intraclass separation and interclass convergence. Therefore, discriminative feature representations designed to highlight the differences between classes are the key to RS scene classification. Existing methods represent scene images by extracting either global context or discriminative part features from RS images. However, global-based methods often lack salient details in similar RS scenes, while part-based methods tend to ignore the relationships between local ground objects, thus weakening the discriminative feature representation. In this article, we propose to combine global context and part-level discriminative features within a unified framework called CGINet for accurate RS scene classification. To be specific, we develop a light context-aware attention block (LCAB) to explicitly model the global context to obtain larger receptive fields and contextual information. A co-enhanced loss module (CELM) is also devised to encourage the model to actively locate discriminative parts for feature enhancement. In particular, CELM is only used during training and not activated during inference, which introduces less computational cost. Benefiting from LCAB and CELM, our proposed CGINet improves the discriminability of features, thereby improving classification performance. Comprehensive experiments over four benchmark datasets show that the proposed method achieves consistent performance gains over state-of-the-art (SOTA) RS scene classification methods.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 51 条
  • [1] Remote Sensing Image Scene Classification Using Multiscale Feature Fusion Covariance Network With Octave Convolution
    Bai, Lin
    Liu, Qingxin
    Li, Cuiling
    Ye, Zhen
    Hui, Meng
    Jia, Xiuping
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification
    Chang, Dongliang
    Ding, Yifeng
    Xie, Jiyang
    Bhunia, Ayan Kumar
    Li, Xiaoxu
    Ma, Zhanyu
    Wu, Ming
    Guo, Jun
    Song, Yi-Zhe
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4683 - 4695
  • [3] Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks
    Chattopadhay, Aditya
    Sarkar, Anirban
    Howlader, Prantik
    Balasubramanian, Vineeth N.
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 839 - 847
  • [4] Chen S., 2022, PROC 10 INT C LEARN, P1
  • [5] Remote Sensing Scene Classification via Multi-Branch Local Attention Network
    Chen, Si-Bao
    Wei, Qing-Song
    Wang, Wen-Zhong
    Tang, Jin
    Luo, Bin
    Wang, Zu-Yuan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 99 - 109
  • [6] Remote Sensing Scene Classification by Local-Global Mutual Learning
    Chen, Xiumei
    Zheng, Xiangtao
    Zhang, Yue
    Lu, Xiaoqiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] Remote Sensing Image Scene Classification: Benchmark and State of the Art
    Cheng, Gong
    Han, Junwei
    Lu, Xiaoqiang
    [J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1865 - 1883
  • [8] Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    Guo, Lei
    Han, Jungong
    [J]. IET COMPUTER VISION, 2015, 9 (05) : 639 - 647
  • [9] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [10] ArcFace: Additive Angular Margin Loss for Deep Face Recognition
    Deng, Jiankang
    Guo, Jia
    Xue, Niannan
    Zafeiriou, Stefanos
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4685 - 4694