Zero-Shot Scene Classification for High Spatial Resolution Remote Sensing Images

被引:77
作者
Li, Aoxue [1 ]
Lu, Zhiwu [2 ]
Wang, Liwei [1 ]
Xiang, Tao [3 ]
Wen, Ji-Rong [2 ]
机构
[1] Peking Univ, Key Lab Machine Percept, Sch Elect Engn & Comp Sci, MOE, Beijing 100871, Peoples R China
[2] Renmin Univ China, Beijing Key Lab Big Data Management & Anal Method, Sch Informat, Beijing 100872, Peoples R China
[3] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2017年 / 55卷 / 07期
基金
中国国家自然科学基金; 欧洲研究理事会;
关键词
High spatial resolution (HSR) remote sensing images; scene classification; zero-shot learning; SPARSE REPRESENTATION;
D O I
10.1109/TGRS.2017.2689071
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Due to the rapid technological development of various sensors, a huge volume of high spatial resolution (HSR) image data can now be acquired. How to efficiently recognize the scenes from such HSR image data has become a critical task. Conventional approaches to remote sensing scene classification only utilize information from HSR images. Therefore, they always need a large amount of labeled data and cannot recognize the images from an unseen scene class without any visual sample in the labeled data. To overcome this drawback, we propose a novel approach for recognizing images from unseen scene classes, i.e., zero-shot scene classification (ZSSC). In this approach, we first use the well-known natural language process model, word2vec, to map names of seen/unseen scene classes to semantic vectors. A semantic-directed graph is then constructed over the semantic vectors for describing the relationships between unseen classes and seen classes. To transfer knowledge from the images in seen classes to those in unseen classes, we make an initial label prediction on test images by an unsupervised domain adaptation model. With the semantic-directed graph and initial prediction, a label-propagation algorithm is then developed for ZSSC. By leveraging the visual similarity among images from the same scene class, a label refinement approach based on sparse learning is used to suppress the noise in the zero-shot classification results. Experimental results show that the proposed approach significantly outperforms the state-of-the-art approaches in ZSSC.
引用
收藏
页码:4157 / 4167
页数:11
相关论文
共 40 条
[1]   Multi-Cue Zero-Shot Learning with Strong Supervision [J].
Akata, Zeynep ;
Malinowski, Mateusz ;
Fritz, Mario ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :59-68
[2]   Label-Embedding for Image Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438
[3]  
[Anonymous], 2014, P INT C INT C MACH L
[4]  
[Anonymous], CNST2011001
[5]  
[Anonymous], 2006, Advances in Neural Information Processing Systems
[6]  
[Anonymous], 2015, PROC CVPR IEEE
[7]  
Antol S, 2014, LECT NOTES COMPUT SC, V8692, P401, DOI 10.1007/978-3-319-10593-2_27
[8]   Robust Image Analysis With Sparse Representation on Quantized Visual Features [J].
Bao, Bing-Kun ;
Zhu, Guangyu ;
Shen, Jialie ;
Yan, Shuicheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (03) :860-871
[9]   Pyramid of Spatial Relatons for Scene-Level Land Use Classification [J].
Chen, Shizhi ;
Tian, YingLi .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04) :1947-1957
[10]  
Chen X, 2011, COMM COM INF SC, V225, P105