Dynamic and Adaptive Self-Training for Semi-Supervised Remote Sensing Image Semantic Segmentation

被引:4
作者
Jin, Jidong [1 ,2 ,3 ,4 ]
Lu, Wanxuan [1 ,2 ]
Yu, Hongfeng [1 ,2 ]
Rong, Xuee [1 ,2 ,3 ,4 ]
Sun, Xian [1 ,2 ,3 ,4 ]
Wu, Yirong [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Inst Elect, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Elect, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Remote sensing; Semantic segmentation; Transformers; Data models; Training; Semantics; Predictive models; Consistency regularization (CR); remote sensing (RS) image; self-training; semantic segmentation; semisupervised learning (SSL);
D O I
10.1109/TGRS.2024.3407142
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Remote sensing (RS) technology has made remarkable progress, providing a wealth of data for various applications, such as ecological conservation and urban planning. However, the meticulous annotation of this data is labor-intensive, leading to a shortage of labeled data, particularly in tasks like semantic segmentation. Semi-supervised methods, combining consistency regularization (CR) with self-training, offer a solution to efficiently utilize labeled and unlabeled data. However, these methods encounter challenges due to imbalanced data ratios. To tackle these challenges, we introduce a self-training approach named dynamic and adaptive self-training (DAST), which is combined with dynamic pseudo-label sampling (DPS), distribution matching (DM), and adaptive threshold updating (ATU). DPS is tailored to address the issue of class distribution imbalance by giving priority to classes with fewer samples. Meanwhile, DM and ATU aim to reduce distribution disparities by adjusting model predictions across augmented images within the framework of CR, ensuring they align with the actual data distribution. Experimental results on the Potsdam and iSAID datasets demonstrate that DAST effectively balances class distribution, aligns model predictions with data distribution, and stabilizes pseudo-labels, leading to state-of-the-art performance on both datasets. These findings highlight the potential of DAST in overcoming the challenges associated with significant disparities in labeled-to-unlabeled data ratios.
引用
收藏
页数:14
相关论文
共 50 条
[1]  
Berthelot David., 2019, arXiv
[2]  
Chen H, 2023, Arxiv, DOI arXiv:2301.10921
[3]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[4]   What is a good evaluation measure for semantic segmentation? [J].
Csurka, Gabriela ;
Larlus, Diane ;
Perronnin, Florent .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[5]   DenseU-Net-Based Semantic Segmentation of Objects in Urban Remote Sensing Images [J].
Dong, Rongsheng ;
Pan, Xiaoquan ;
Li, Fengying .
IEEE ACCESS, 2019, 7 :65347-65356
[6]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, 10.48550/arXiv.2010.11929, DOI 10.48550/ARXIV.2010.11929]
[7]   DMT: Dynamic mutual training for semi-supervised learning [J].
Feng, Zhengyang ;
Zhou, Qianyu ;
Gu, Qiqi ;
Tan, Xin ;
Cheng, Guangliang ;
Lu, Xuequan ;
Shi, Jianping ;
Ma, Lizhuang .
PATTERN RECOGNITION, 2022, 130
[8]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[9]   A Survey on Vision Transformer [J].
Han, Kai ;
Wang, Yunhe ;
Chen, Hanting ;
Chen, Xinghao ;
Guo, Jianyuan ;
Liu, Zhenhua ;
Tang, Yehui ;
Xiao, An ;
Xu, Chunjing ;
Xu, Yixing ;
Yang, Zhaohui ;
Zhang, Yiman ;
Tao, Dacheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110
[10]   Hybrid first and second order attention Unet for building segmentation in remote sensing images [J].
He, Nanjun ;
Fang, Leyuan ;
Plaza, Antonio .
SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (04)