Geographical Knowledge-Driven Representation Learning for Remote Sensing Images

被引:72
作者
Li, Wenyuan [1 ,2 ,3 ]
Chen, Keyan [1 ,2 ,3 ]
Chen, Hao [1 ,2 ,3 ]
Shi, Zhenwei [1 ,2 ,3 ]
机构
[1] Beihang Univ, Image Proc Ctr, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, Beijing Key Lab Digital Media, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Astronaut, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Remote sensing; Task analysis; Satellites; Sensors; Semantics; Annotations; Training; Cloud; snow detection; object detection; remote sensing images; representation learning; scene classification; semantic segmentation; SEMANTIC SEGMENTATION; OBJECT DETECTION; CLOUD DETECTION; NEURAL-NETWORK; ACCESS;
D O I
10.1109/TGRS.2021.3115569
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The proliferation of remote sensing satellites has resulted in a massive amount of remote sensing images. However, due to human and material resource constraints, the vast majority of remote sensing images remain unlabeled. As a result, it cannot be applied to currently available deep learning methods. To fully utilize the remaining unlabeled images, we propose a Geographical Knowledge-driven Representation (GeoKR) learning method for remote sensing images, improving network performance and reduce the demand for annotated data. The global land cover products and geographical location associated with each remote sensing image are regarded as geographical knowledge to provide supervision for representation learning and network pretraining. An efficient pretraining framework is proposed to eliminate the supervision noises caused by imaging times and resolutions difference between remote sensing images and geographical knowledge. A large-scale pretraining dataset Levir-KR is constructed to support network pretraining. It contains 1,431,950 remote sensing images from Gaofen series satellites with various resolutions. Experimental results demonstrate that our proposed method outperforms ImageNet pretraining and self-supervised representation learning methods and significantly reduces the burden of data annotation on downstream tasks, such as scene classification, semantic segmentation, object detection, and cloud/snow detection. It demonstrates that our proposed method can be used as a novel paradigm for pretraining neural networks. Codes will be available on https://github.com/flyakon/Geographical-Knowledge-driven-Representaion-Learning.
引用
收藏
页数:16
相关论文
共 70 条
[1]   Geography-Aware Self-Supervised Learning [J].
Ayush, Kumar ;
Uzkent, Burak ;
Meng, Chenlin ;
Tanmay, Kumar ;
Burke, Marshall ;
Lobell, David ;
Ermon, Stefano .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :10161-10170
[2]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[3]   Preliminary analysis of spatiotemporal pattern of global land surface water [J].
Cao Xin ;
Chen Jun ;
Chen LiJun ;
Liao AnPing ;
Sun FangDi ;
Li Yang ;
Li Lei ;
Lin ZhongHui ;
Pang ZhiGuo ;
Chen Jin ;
He ChaoYing ;
Peng Shu .
SCIENCE CHINA-EARTH SCIENCES, 2014, 57 (10) :2330-2339
[4]  
Caron M, 2020, ADV NEUR IN, V33
[5]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[6]  
[陈军 Chen Jun], 2015, [测绘学报, Acta Geodetica et Cartographica Sinica], V44, P1181
[7]  
Chen T, 2020, PR MACH LEARN RES, V119
[8]   Hyperspectral Image Super-Resolution with Self-Supervised Spectral-Spatial Residual Network [J].
Chen, Wenjing ;
Zheng, Xiangtao ;
Lu, Xiaoqiang .
REMOTE SENSING, 2021, 13 (07)
[9]  
Chen Xinyun, 2020, arXiv preprint arXiv:2003.04297
[10]   DeepPermNet: Visual Permutation Learning [J].
Cruz, Rodrigo Santa ;
Fernando, Basura ;
Cherian, Anoop ;
Gould, Stephen .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6044-6052