VISION-LANGUAGE JOINT LEARNING FOR BOX-SUPERVISED CHANGE DETECTION IN REMOTE SENSING

被引:0
|
作者
Yin, Kanghua [1 ]
Liu, Fang [1 ]
Liu, Jia [1 ]
Xiao, Liang [1 ]
机构
[1] Nanjing Univ Sci & Technol, Jiangsu Prov Engn Res Ctr Airborne Detecting & In, Sch Comp Sci & Engn, Nanjing, Peoples R China
关键词
Change detection; remote sensing; vision-language; box-supervised;
D O I
10.1109/IGARSS53475.2024.10641329
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Change detection (CD) in remote sensing aims at revealing land cover changes according to the category of the ground objects. However, the category information is always missing in current popular vision-based CD methods. Considering that language analysis is really good at identifying different categories, a vision-language joint learning method is proposed in this paper, which consists of two vision-language joint representation (VLJR) modules and a changed instance segmentation (CIS) module. The former combines image features and language features with the help of text encoder and Transformer. The latter generates the final pixel-level CD result with only box-level labeled samples by level-set evolution and box matching supervision, which reduces manual-labor to a large extent. Tested on representative WHU datasets, the proposed method achieves comparable results to fully-supervised CD methods and is ahead of the other weakly-supervised methods.
引用
收藏
页码:10254 / 10258
页数:5
相关论文
共 50 条
  • [1] ChangeCLIP: Remote sensing change detection with multimodal vision-language representation learning
    Dong, Sijun
    Wang, Libo
    Du, Bo
    Meng, Xiaoliang
    ISPRS Journal of Photogrammetry and Remote Sensing, 2024, 208 : 53 - 69
  • [2] ChangeCLIP: Remote sensing change detection with multimodal vision-language representation learning
    Dong, Sijun
    Wang, Libo
    Du, Bo
    Meng, Xiaoliang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 208 : 53 - 69
  • [3] FSVLM: A Vision-Language Model for Remote Sensing Farmland Segmentation
    Wu, Haiyang
    Du, Zhuofei
    Zhong, Dandan
    Wang, Yuze
    Tao, Chao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [4] Practical Techniques for Vision-Language Segmentation Model in Remote Sensing
    Lin, Yuting
    Suzuki, Kumiko
    Sogo, Shinichiro
    MID-TERM SYMPOSIUM THE ROLE OF PHOTOGRAMMETRY FOR A SUSTAINABLE WORLD, VOL. 48-2, 2024, : 203 - 210
  • [5] SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing
    Wang, Zhecheng
    Prabha, Rajanie
    Huang, Tianyuan
    Wu, Jiajun
    Rajagopal, Ram
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5805 - 5813
  • [6] Vision-Language Models in Remote Sensing: Current progress and future trends
    Li, Xiang
    Wen, Congcong
    Hu, Yuan
    Yuan, Zhenghang
    Zhu, Xiao Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2024, 12 (02) : 32 - 66
  • [7] MetaSegNet: Metadata-Collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images
    Wang, Libo
    Dong, Sijun
    Chen, Ying
    Meng, Xiaoliang
    Fang, Shenghui
    Fei, Songlin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [8] RS-LLaVA: A Large Vision-Language Model for Joint Captioning and Question Answering in Remote Sensing Imagery
    Bazi, Yakoub
    Bashmal, Laila
    Al Rahhal, Mohamad Mahmoud
    Ricci, Riccardo
    Melgani, Farid
    REMOTE SENSING, 2024, 16 (09)
  • [9] Advancements in Vision-Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
    Tao, Lijie
    Zhang, Haokui
    Jing, Haizhao
    Liu, Yu
    Yan, Dawei
    Wei, Guoting
    Xue, Xizhe
    REMOTE SENSING, 2025, 17 (01)
  • [10] Vision-Language Models for Zero-Shot Classification of Remote Sensing Images
    Al Rahhal, Mohamad Mahmoud
    Bazi, Yakoub
    Elgibreen, Hebah
    Zuair, Mansour
    APPLIED SCIENCES-BASEL, 2023, 13 (22):