Remote Sensing Image Scene Classification: Benchmark and State of the Art

被引:1827
作者
Cheng, Gong [1 ]
Han, Junwei [1 ]
Lu, Xiaoqiang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Ctr OPT IMagery Anal & Learning OPTIMAL, State Key Lab Transient Opt & Photon, Xian 710119, Shaanxi, Peoples R China
基金
美国国家科学基金会;
关键词
Benchmark data set; deep learning; handcrafted features; remote sensing image; scene classification; unsupervised feature learning; GEOSPATIAL OBJECT DETECTION; TARGET DETECTION; FEATURE-SELECTION; SATELLITE IMAGES; VISUAL SALIENCY; DEEP; FEATURES; TEXTURE; REPRESENTATION; MULTISCALE;
D O I
10.1109/JPROC.2017.2675998
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing image scene classification plays an important role in a wide range of applications and hence has been receiving remarkable attention. During the past years, significant efforts have been made to develop various data sets or present a variety of approaches for scene classification from remote sensing images. However, a systematic review of the literature concerning data sets and methods for scene classification is still lacking. In addition, almost all existing data sets have a number of limitations, including the small scale of scene classes and the image numbers, the lack of image variations and diversity, and the saturation of accuracy. These limitations severely limit the development of new approaches especially deep learning-based methods. This paper first provides a comprehensive review of the recent progress. Then, we propose a large-scale data set, termed "NWPU-RESISC45," which is a publicly available benchmark for REmote Sensing Image Scene Classification (RESISC), created by Northwestern Polytechnical University (NWPU). This data set contains 31 500 images, covering 45 scene classes with 700 images in each class. The proposed NWPU-RESISC45 1) is large-scale on the scene classes and the total image number; 2) holds big variations in translation, spatial resolution, viewpoint, object pose, illumination, background, and occlusion; and 3) has high within-class diversity and between-class similarity. The creation of this data set will enable the community to develop and evaluate various data-driven algorithms. Finally, several representative methods are evaluated using the proposed data set, and the results are reported as a useful baseline for future research.
引用
收藏
页码:1865 / 1883
页数:19
相关论文
共 176 条
  • [1] [Anonymous], IMAGE SEGMENTATION M
  • [2] [Anonymous], 2002, Principal components analysis
  • [3] [Anonymous], PROC CVPR IEEE
  • [4] [Anonymous], 2006, NIPS
  • [5] [Anonymous], 2011, GEOMORPHOMETRY
  • [6] [Anonymous], 2001, Zeitschrift fur Geoinformationssysteme
  • [7] [Anonymous], 2008, OBJECT BASED IMAGE A, DOI DOI 10.1007/978
  • [8] [Anonymous], IEEE SENSORS J
  • [9] [Anonymous], ADV NEURAL INF PROCE
  • [10] [Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199