A Grad-CAM and capsule network hybrid method for remote sensing image scene classification

被引:1
作者
He, Zhan [1 ,4 ]
Zhang, Chunju [2 ]
Wang, Shu [3 ]
Huang, Jianwei [4 ]
Zheng, Xiaoyun [1 ]
Jiang, Weijie [4 ]
Bo, Jiachen [4 ]
Yang, Yucheng [4 ]
机构
[1] Minist Nat Resources, Shenzhen Data Management Ctr Planning & Nat Resour, Key Lab Urban Land Resources Monitoring & Simulat, Shenzhen 518000, Peoples R China
[2] Minist Nat Resources, Key Lab Jianghuai Arable Land Resources Protect &, Hefei 230088, Peoples R China
[3] Chinese Acad Sci, State Key Lab Resources & Environm Informat Syst, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
[4] Hefei Univ Technol, Sch Civil Engn, Hefei 230009, Peoples R China
关键词
image scene classification; CNN; Gaad-CAM; CapsNet; DenseNet; FUSION FRAMEWORK; ATTENTION;
D O I
10.1007/s11707-022-1079-x
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Remote sensing image scene classification and remote sensing technology applications are hot research topics. Although CNN-based models have reached high average accuracy, some classes are still misclassified, such as "freeway," "spare residential," and "commercial_area." These classes contain typical decisive features, spatial-relation features, and mixed decisive and spatial-relation features, which limit high-quality image scene classification. To address this issue, this paper proposes a Grad-CAM and capsule network hybrid method for image scene classification. The Grad-CAM and capsule network structures have the potential to recognize decisive features and spatial-relation features, respectively. By using a pre-trained model, hybrid structure, and structure adjustment, the proposed model can recognize both decisive and spatial-relation features. A group of experiments is designed on three popular data sets with increasing classification difficulties. In the most advanced experiment, 92.67% average accuracy is achieved. Specifically, 83%, 75%, and 86% accuracies are obtained in the classes of "church," "palace," and "commercial_area," respectively. This research demonstrates that the hybrid structure can effectively improve performance by considering both decisive and spatial-relation features. Therefore, Grad-CAM-CapsNet is a promising and powerful structure for image scene classification.
引用
收藏
页码:538 / 553
页数:16
相关论文
共 47 条
[1]  
Abai Z., 2019, ARXIV
[2]   A Novel Statistical Method for Scene Classification Based on Multi-Object Categorization and Logistic Regression [J].
Ahmed, Abrar ;
Jalal, Ahmad ;
Kim, Kibum .
SENSORS, 2020, 20 (14) :1-20
[3]  
[Anonymous], 2015, arXiv
[4]  
[Anonymous], 2017, ADV NEURAL INF PROCE
[5]   Growing random forest on deep convolutional neural networks for scene categorization [J].
Bai, Shuang .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 :279-287
[6]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[7]   Remote Sensing Scene Classification Based on Convolutional Neural Networks Pre-Trained Using Attention-Guided Sparse Filters [J].
Chen, Jingbo ;
Wang, Chengyi ;
Ma, Zhong ;
Chen, Jiansheng ;
He, Dongxu ;
Ackland, Stephen .
REMOTE SENSING, 2018, 10 (02)
[8]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[9]   Remote Sensing Image Scene Classification Using Bag of Convolutional Features [J].
Cheng, Gong ;
Li, Zhenpeng ;
Yao, Xiwen ;
Guo, Lei ;
Wei, Zhongliang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (10) :1735-1739
[10]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883