Co-Enhanced Global-Part Integration for Remote-Sensing Scene Classification

被引：1

作者：

Zhao, Yichen ^{[1
,2
,3
,4
]}

Chen, Yaxiong ^{[1
,2
,3
,4
]}

Xiong, Shengwu ^{[1
,2
,3
,4
]}

Lu, Xiaoqiang ^{[5
]}

Zhu, Xiao Xiang ^{[6
]}

Mou, Lichao ^{[6
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China

[3] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China

[4] Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China

[5] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China

[6] Tech Univ Munich, Chair Data Sci Earth Observat, D-80333 Munich, Germany

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Feature extraction; Semantics; Context modeling; Training; Technological innovation; Remote sensing; Convolutional neural networks; Attention; convolutional neural networks (CNNs); discriminative part discovery; remote-sensing (RS); scene classification; ATTENTION;

D O I：

10.1109/TGRS.2024.3367877

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Remote-sensing (RS) scene classification aims to classify RS images with similar scene characteristics into one category. Plenty of RS images are complex in background, rich in content, and multiscale in target, exhibiting the characteristics of both intraclass separation and interclass convergence. Therefore, discriminative feature representations designed to highlight the differences between classes are the key to RS scene classification. Existing methods represent scene images by extracting either global context or discriminative part features from RS images. However, global-based methods often lack salient details in similar RS scenes, while part-based methods tend to ignore the relationships between local ground objects, thus weakening the discriminative feature representation. In this article, we propose to combine global context and part-level discriminative features within a unified framework called CGINet for accurate RS scene classification. To be specific, we develop a light context-aware attention block (LCAB) to explicitly model the global context to obtain larger receptive fields and contextual information. A co-enhanced loss module (CELM) is also devised to encourage the model to actively locate discriminative parts for feature enhancement. In particular, CELM is only used during training and not activated during inference, which introduces less computational cost. Benefiting from LCAB and CELM, our proposed CGINet improves the discriminability of features, thereby improving classification performance. Comprehensive experiments over four benchmark datasets show that the proposed method achieves consistent performance gains over state-of-the-art (SOTA) RS scene classification methods.

引用

页码：1 / 14

页数：14

共 51 条

[1] Remote Sensing Image Scene Classification Using Multiscale Feature Fusion Covariance Network With Octave Convolution
Bai, Lin
Liu, Qingxin
Li, Cuiling
Ye, Zhen
Hui, Meng
Jia, Xiuping
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2] The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification
Chang, Dongliang
Ding, Yifeng
Xie, Jiyang
Bhunia, Ayan Kumar
Li, Xiaoxu
Ma, Zhanyu
Wu, Ming
Guo, Jun
Song, Yi-Zhe
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4683 - 4695
[3] Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks
Chattopadhay, Aditya
Sarkar, Anirban
Howlader, Prantik
Balasubramanian, Vineeth N.
[J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 839 - 847
[4] Chen S., 2022, PROC 10 INT C LEARN, P1
[5] Remote Sensing Scene Classification via Multi-Branch Local Attention Network
Chen, Si-Bao
Wei, Qing-Song
Wang, Wen-Zhong
Tang, Jin
Luo, Bin
Wang, Zu-Yuan
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 99 - 109
[6] Remote Sensing Scene Classification by Local-Global Mutual Learning
Chen, Xiumei
Zheng, Xiangtao
Zhang, Yue
Lu, Xiaoqiang
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[7] Remote Sensing Image Scene Classification: Benchmark and State of the Art
Cheng, Gong
Han, Junwei
Lu, Xiaoqiang
[J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1865 - 1883
[8] Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images
Cheng, Gong
Zhou, Peicheng
Han, Junwei
Guo, Lei
Han, Jungong
[J]. IET COMPUTER VISION, 2015, 9 (05) : 639 - 647
[9] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10] ArcFace: Additive Angular Margin Loss for Deep Face Recognition
Deng, Jiankang
Guo, Jia
Xue, Niannan
Zafeiriou, Stefanos
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4685 - 4694

← 1 2 3 4 5 6 →