A High Spatial Resolution Aerial Image Dataset and an Efficient Scene Classification Model

被引:3
作者
Lin, Yi [1 ]
He, Lin [1 ]
Zhong, Daiqi [1 ]
Song, Yufei [1 ]
Wei, Lujia [1 ]
Xin, Liang [1 ,2 ]
机构
[1] Tongji Univ, Coll Surveying & Geoinformat, Shanghai 200092, Peoples R China
[2] Shanghai Surveying & Mapping Inst, Coll Surveying & Geoinformat, Shanghai, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Remote sensing; Scene classification; Data models; Task analysis; Spatial resolution; Semantics; Transformers; Aerial image dataset; scene classification; space complexity; transformer; ATTENTION;
D O I
10.1109/TGRS.2024.3382159
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the past decade, deep neural networks have been widely introduced into remote sensing image interpretation tasks. Exploring how to improve the model at some critical points, such as making it more suitable for remote sensing data, is still lacking, which makes more sense than developing stacked and transformed network structures, as is the expertise of scholars in the field of deep learning. To tackle this issue, this article proposes a fixed-length sequence-vision reformer (FLS-ViR) model. Relying on an advanced attention encoder with a reversible residual network structure, the presented model drastically decreases the spatial complexity. In contrast to most models that take input image sizes with 200+ pixels as edges, it will help us allow the input of large-size images under limited equipment conditions, thereby making fuller use of high spatial resolution data information. The introduction of a self-supervised method further improves the detection accuracy. Meanwhile, we built a high spatial resolution aerial image dataset for scene classification, which is superior in terms of high interclass similarity and diversity of image variations. To validate and evaluate the method and dataset, we performed a comparison of accuracy and memory consumption on a similar aerial dataset and attention-base models. The effectiveness of our model and its fitness on remote sensing datasets with a limited sample size is proved. Meanwhile, the investigation of the heat map demonstrates that increasing the size of the input image has a positive effect on optimizing the features captured by the model for remote sensing image scenes.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 59 条
[1]   Vision Transformers for Remote Sensing Image Classification [J].
Bazi, Yakoub ;
Bashmal, Laila ;
Rahhal, Mohamad M. Al ;
Dayil, Reham Al ;
Ajlan, Naif Al .
REMOTE SENSING, 2021, 13 (03) :1-20
[2]   Bridging the Semantic Gap for Satellite Image Annotation and Automatic Mapping Applications [J].
Bratasanu, Dragos ;
Nedelcu, Ion ;
Datcu, Mihai .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2011, 4 (01) :193-204
[3]   Classification of Urban Functional Areas From Remote Sensing Images and Time-Series User Behavior Data [J].
Chen, Chen ;
Yan, Jining ;
Wang, Lizhe ;
Liang, Dong ;
Zhang, Wanfeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :1207-1221
[4]   Complex Scene Classification of High Resolution Remote Sensing Images Based on DCNN Model [J].
Chen, Dexi ;
Hu, Peng ;
Duan, Xuelin .
2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
[5]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[6]   Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA [J].
Cheng, Gong ;
Guo, Lei ;
Zhao, Tianyun ;
Han, Junwei ;
Li, Huihui ;
Fang, Jun .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (01) :45-59
[7]   Unsupervised Feature Learning for Aerial Scene Classification [J].
Cheriyadat, Anil M. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :439-451
[8]   Convolutional Neural Network for Remote-Sensing Scene Classification: Transfer Learning Analysis [J].
de Lima, Rafael Pires ;
Marfurt, Kurt .
REMOTE SENSING, 2020, 12 (01)
[9]   When CNNs Meet Vision Transformer: A Joint Framework for Remote Sensing Scene Classification [J].
Deng, Peifang ;
Xu, Kejie ;
Huang, Hong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[10]   Very High Resolution Remote Sensing Imagery Classification Using a Fusion of Random Forest and Deep Learning Technique-Subtropical Area for Example [J].
Dong, Luofan ;
Du, Huaqiang ;
Mao, Fangjie ;
Han, Ning ;
Li, Xuejian ;
Zhou, Guomo ;
Zhu, Di'en ;
Zheng, Junlong ;
Zhang, Meng ;
Xing, Luqi ;
Liu, Tengyan .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :113-128