Remote Sensing Scene Classification Based on Convolutional Neural Networks Pre-Trained Using Attention-Guided Sparse Filters

被引:47
作者
Chen, Jingbo [1 ]
Wang, Chengyi [1 ]
Ma, Zhong [2 ]
Chen, Jiansheng [1 ]
He, Dongxu [1 ]
Ackland, Stephen [3 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100094, Peoples R China
[2] Xian Microelect Technol Inst, Xian 710065, Shaanxi, Peoples R China
[3] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England
基金
中国国家自然科学基金;
关键词
scene classification; visual attention mechanism; unsupervised learning; sparse filters; convolutional neural networks;
D O I
10.3390/rs10020290
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic-level land-use scene classification is a challenging problem, in which deep learning methods, e.g., convolutional neural networks (CNNs), have shown remarkable capacity. However, a lack of sufficient labeled images has proved a hindrance to increasing the land-use scene classification accuracy of CNNs. Aiming at this problem, this paper proposes a CNN pre-training method under the guidance of a human visual attention mechanism. Specifically, a computational visual attention model is used to automatically extract salient regions in unlabeled images. Then, sparse filters are adopted to learn features from these salient regions, with the learnt parameters used to initialize the convolutional layers of the CNN. Finally, the CNN is further fine-tuned on labeled images. Experiments are performed on the UCMerced and AID datasets, which show that when combined with a demonstrative CNN, our method can achieve 2.24% higher accuracy than a plain CNN and can obtain an overall accuracy of 92.43% when combined with AlexNet. The results indicate that the proposed method can effectively improve CNN performance using easy-to-access unlabeled images and thus will enhance the performance of land-use scene classification especially when a large-scale labeled dataset is unavailable.
引用
收藏
页数:16
相关论文
共 25 条
[1]  
[Anonymous], P 18 ACM SIGSPATIAL
[2]  
[Anonymous], P 2007 IEEE C COMP V
[3]  
[Anonymous], P 23 ACM INT C MULT
[4]  
[Anonymous], P INT C NEUR INF PRO
[5]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[6]   Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images [J].
Cheng, Gong ;
Han, Junwei ;
Guo, Lei ;
Liu, Zhenbao ;
Bu, Shuhui ;
Ren, Jinchang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (08) :4238-4249
[7]   Unsupervised Feature Learning for Aerial Scene Classification [J].
Cheriyadat, Anil M. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :439-451
[8]  
Fan JY, 2017, IEEE T GEOSCI REMOTE, V55, P2250, DOI [10.1109/tgrs.2016.2640186, 10.1109/TGRS.2016.2640186]
[9]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[10]   Unsupervised Feature Learning Via Spectral Clustering of Multidimensional Patches for Remotely Sensed Scene Classification [J].
Hu, Fan ;
Xia, Gui-Song ;
Wang, Zifeng ;
Huang, Xin ;
Zhang, Liangpei ;
Sun, Hong .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (05) :2015-2030