Text guided zero-shot scene classification of high spatial resolution remote sensing images

被引:0
|
作者
Liu, Bing [1 ]
Chen, Xiaohui [1 ]
Zhou, Dewei [1 ]
Wang, Peng [2 ]
Wang, Ruirui [3 ]
机构
[1] Strateg Support Force Informat Engn Univ, Zhengzhou, Peoples R China
[2] North China Univ Water Resources & Elect Power, Zhengzhou, Peoples R China
[3] Henan Geol & Mineral Explorat & Dev Bur, Surveying & Mapping Geog Informat Inst, Zhengzhou, Peoples R China
关键词
high resolution remote sensing images; scene classification; zero-shot classification; contrast learning; text guidance; NETWORKS;
D O I
10.1117/1.JRS.18.014525
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
. Recently, high spatial resolution remote sensing image scene classification has had a wide range of applications and has become one of the hotspots in the field of remote sensing research. Due to the complexity of the scenes in remote sensing images, it is impossible to annotate all ground object classes at once. To adapt to different application scenarios, high spatial resolution remote sensing image scene classification models need to have zero-shot generalization ability for unseen classes. To improve the zero-shot generalization ability of classification models, the existing methods often start from the perspective of image features, thus ignoring the high-order semantic information in the scene. In fact, the association between higher-order semantic information in the scene is very important for the generalization ability of the classification model. People often use image information and its corresponding higher-order semantic information to complete remote sensing image scene understanding. Therefore, this work proposes a text guided remote sensing image pre-training model for zero-shot classification of high spatial resolution remote sensing image scenes. First, the transformer model is used to extract the embedded features of text and remote sensing images. Then, based on the aligned text and remote sensing image data, a contrast learning method is used to train the model to learn the correspondence between text and image features. After the model training is completed, the nearest neighbor method is used to complete zero-shot classification on the target data. The effectiveness of the proposed method was verified on three remote sensing image scene classification benchmark datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Dirichlet-Derived Multiple Topic Scene Classification Model for High Spatial Resolution Remote Sensing Imagery
    Zhao, Bei
    Zhong, Yanfei
    Xia, Gui-Song
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (04): : 2108 - 2123
  • [32] UNSUPERVISED FEATURE LEARNING FOR SCENE CLASSIFICATION OF HIGH RESOLUTION REMOTE SENSING IMAGE
    Fu, Min
    Yuan, Yuan
    Lu, Xiaoqiang
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 206 - 210
  • [33] Deep feature representations for high-resolution remote sensing scene classification
    Zhou, Weixun
    Shao, Zhenfeng
    Cheng, Qimin
    2016 4RTH INTERNATIONAL WORKSHOP ON EARTH OBSERVATION AND REMOTE SENSING APPLICATIONS (EORSA), 2016,
  • [34] An Optical Image-Aided Approach for Zero-Shot SAR Image Scene Classification
    Ma, Yanjing
    Pei, Jifang
    Zhang, Xing
    Huo, Weibo
    Zhang, Yin
    Huang, Yulin
    Yang, Jianyu
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [35] Knowledge Guided Evolutionary Transformer for Remote Sensing Scene Classification
    Zhao, Jiaxuan
    Jiao, Licheng
    Wang, Chao
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Ma, Mengru
    Yang, Shuyuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 10368 - 10384
  • [36] Hyperbolic prototypical network for few shot remote sensing scene classification
    Hamzaoui, Manal
    Chapel, Laetitia
    Pham, Minh -Tan
    Lefevre, Sebastien
    PATTERN RECOGNITION LETTERS, 2024, 177 : 151 - 156
  • [37] Scene Classification of Optical High-resolution Remote Sensing Images Using Vision Transformer and Graph Convolutional Network
    Wang Jianan
    Gao Yue
    Shi Jun
    Liu Ziqi
    ACTA PHOTONICA SINICA, 2021, 50 (11)
  • [38] Scene Classification of Remote Sensing Images Based on Wavelet-Spatial High-Order Feature Aggregation Network
    Ni Kang
    Zhai Mingliang
    Wang Peng
    ACTA OPTICA SINICA, 2022, 42 (24)
  • [39] Scene Classification With Recurrent Attention of VHR Remote Sensing Images
    Wang, Qi
    Liu, Shaoteng
    Chanussot, Jocelyn
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02): : 1155 - 1167
  • [40] Scene Classification in Remote Sensing Images using Dynamic Kernels
    Datla, Rajeshreddy
    Chalavadi, Vishnu
    Mohan, Krishna C.
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,