NATURAL LANGUAGE AIDED REMOTE SENSING IMAGE FEW-SHOT CLASSIFICATION

被引:0
作者
Chen, Deliang [1 ,2 ]
Xiao, Jianbo [1 ,2 ]
Gao, Kyle [4 ]
Lu, Yanyan [3 ]
Fatholahi, Sarah [4 ]
Li, Jonathan [4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Geog & Biol Informat, Nanjing 210023, JS, Peoples R China
[2] Nanjing Univ, Sch Geog & Oceanog Sci, Nanjing 210023, JS, Peoples R China
[3] Nanjing Audit Univ, Inst Nat Resources & Environm Audit, Nanjing 211815, JS, Peoples R China
[4] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada
来源
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM | 2023年
关键词
remote-sensing image classification; few-shot learning; cross-modal;
D O I
10.1109/IGARSS52108.2023.10282538
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
We aim to improve the efficiency of traditional deep learning methods for remote sensing by reducing the reliance on annotated data and minimizing training time. Instead of using large-scale unimodal remote sensing image datasets for pre-training, we propose the use of multimodal data (text-image pairs), which we believe to be more effective. To enhance the model's generalization performance in the remote sensing domain and achieve accurate remote sensing image scene classification, we employ the Feature Adaptive Embedding Module. For this purpose, we introduce a cross-modal comparison learning network that is based on openly accessible generalized datasets. This network is capable of recognizing specific photo scenarios from remote sensing photographs, maximizing the accuracy of classification.
引用
收藏
页码:6298 / 6301
页数:4
相关论文
共 9 条
[1]  
Cheng G., 2017, P IEEE 2017, V105, P1865
[2]  
Dosovitskiy A., 2020, PREPRINT
[3]   PRF-RW: a progressive random forest-based random walk approach for interactive semi-automated pulmonary lobes segmentation [J].
Li, Qiang ;
Chen, Lei ;
Li, Xiangju ;
Lv, Xiaofeng ;
Xia, Shuyue ;
Kang, Yan .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (10) :2221-2235
[4]  
Radford A, 2021, PR MACH LEARN RES, V139
[5]   WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning [J].
Srinivasan, Krishna ;
Raman, Karthik ;
Chen, Jiecao ;
Bendersky, Michael ;
Najork, Marc .
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :2443-2449
[6]  
Vaswani A, 2017, P INT C NEUR INF PRO, P6000
[7]   Scene Classification With Recurrent Attention of VHR Remote Sensing Images [J].
Wang, Qi ;
Liu, Shaoteng ;
Chanussot, Jocelyn ;
Li, Xuelong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02) :1155-1167
[8]   Fast RobustSTL: Efficient and Robust Seasonal-Trend Decomposition for Time Series with Complex Patterns [J].
Wen, Qingsong ;
Zhang, Zhe ;
Li, Yan ;
Sun, Liang .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :2203-2213
[9]   A META-LEARNING FRAMEWORK FOR FEW-SHOT CLASSIFICATION OF REMOTE SENSING SCENE [J].
Zhang, Pei ;
Bai, Yunpeng ;
Wang, Dong ;
Bai, Bendu ;
Li, Ying .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :4590-4594