NATURAL LANGUAGE DESCRIPTION OF REMOTE SENSING IMAGES BASED ON DEEP LEARNING

被引:0
作者
Zhang, Xiangrong [1 ]
Li, Xiang [1 ]
An, Jinliang [1 ]
Gao, Li [2 ]
Hou, Biao [1 ]
Li, Chen [3 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Shaanxi, Peoples R China
[2] Xian Res Inst Surveying & Mapping, Xian 710000, Shaanxi, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Shaanxi, Peoples R China
来源
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS) | 2017年
基金
中国国家自然科学基金;
关键词
remote sensing image; natural language description; convolutional neural network; recurrent neural network;
D O I
暂无
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
The semantic description of remote sensing image is a useful and meaningful task, which can help us to get a better understanding of the scene depicted in the remote sensing images and make better use of the remote sensing images. Nature language provides good solution for describing the semantic information of remote sensing images. Nature language description of a remote sensing image is to generate a meaningful sentence given a remote sensing image. This paper presents a novel method based on deep learning. First, a convolutional neural network is utilized to detect the main objects of the remote sensing images. Then a recurrent neural network language model is utilized to generate the natural language descriptions of the objects which are detected in the first step. Experimental results on a set of remote sensing images demonstrate that the proposed method is able to generate desirable description of the scene.
引用
收藏
页码:4798 / 4801
页数:4
相关论文
共 15 条
[1]  
[Anonymous], 2017, INT J PROD RES, DOI DOI 10.1080/00207543.2016.1154209
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], 2016, IEEE C COMP VIS PATT
[4]  
[Anonymous], 2013, Caffe: An Open Source Convolutional Architecture for Fast Feature Embedding
[5]  
[Anonymous], NIPS
[6]  
[Anonymous], 2010, INTERSPEECH
[7]   Land-use scene classification using multi-scale completed local binary patterns [J].
Chen, Chen ;
Zhang, Baochang ;
Su, Hongjun ;
Li, Wei ;
Wang, Lu .
SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (04) :745-752
[8]  
Karpathy A, 2015, PROC CVPR IEEE, P3128, DOI 10.1109/CVPR.2015.7298932
[9]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[10]   Semantic Labeling of Aerial and Satellite Imagery [J].
Paisitkriangkrai, Sakrapee ;
Sherrah, Jamie ;
Janney, Pranam ;
van den Hengel, Anton .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (07) :2868-2881