Coarse-to-fine semantic segmentation of satellite images

被引:3
作者
Chen, Hao [1 ]
Yang, Wen [2 ]
Liu, Li [3 ]
Xia, Gui-Song [1 ,4 ,5 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Sch Elect Informat, Wuhan 430079, Peoples R China
[3] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[4] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China
[5] Wuhan Univ, Inst Artificial Intelligence, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Local-softmax; Multi-prototype learning; Semantic segmentation; COVER; AREA;
D O I
10.1016/j.isprsjprs.2024.07.028
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Training deep neural networks for semantic segmentation of aerial images relies heavily on obtaining a large number of precise pixel-level annotations, which can cause significant annotation expenses. Given the fact that acquiring fine-class annotations is considerably more challenging than obtaining coarse-class annotations, we present a novel semi-supervised learning framework, which utilizes high spatial resolution images annotated with coarse-class labels alongside a very small set of fine-grained annotated images as the training set, thereby achieving classification results that are refined in both spatial resolution and categorical granularity. Specifically, this framework adopts Mix Transformer (MiT) as the backbone architecture to accommodate both local feature extraction and long-range dependency modeling capabilities and utilizes multi-prototype learning to model each class as multiple sub-prototypes, preserving the intrinsic variance characteristics within classes. We propose a dedicated co-training approach tailored for extracting fine-grained pseudo-labels from coarse- grained samples. In this approach, a local-softmax pseudo-labeling strategy is developed to ensure a harmonious balance between the efficiency and accuracy of the pseudo-labeling, and four losses are formulated for both single-level class and cross-category granularity supervised learning. We evaluate the proposed framework on the Gaofen Image Dataset (GID) and Five-Billion-Pixels (FBP) dataset, confirming its feasibility and superior results. In particular, based on coarse-class annotations, the performance achieved using only 5% of fineclass labels, in terms of the four metrics, namely mIoU, mean UA, mean F1-score, and OA, reached 91%, 96%, 89%, and 93% of the fully-supervised baseline performance respectively. The code is available at https://github.com/chenhaocs/C2F.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [1] Fine-grained Angular Contrastive Learning with Coarse Labels
    Bukchin, Guy
    Schwartz, Eli
    Saenko, Kate
    Shahar, Ori
    Feris, Rogerio
    Giryes, Raja
    Karlinsky, Leonid
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8726 - 8736
  • [2] Albumentations: Fast and Flexible Image Augmentations
    Buslaev, Alexander
    Iglovikov, Vladimir I.
    Khvedchenya, Eugene
    Parinov, Alex
    Druzhinin, Mikhail
    Kalinin, Alexandr A.
    [J]. INFORMATION, 2020, 11 (02)
  • [3] Learning spectral-spatial representations from VHR images for fine-scale crop type mapping: A case study of rice-crayfish field extraction in South China
    Cai, Zhiwen
    Wei, Haodong
    Hu, Qiong
    Zhou, Wei
    Zhang, Xinyu
    Jin, Wenjie
    Wang, Ling
    Yu, Shuxia
    Wang, Zhen
    Xu, Baodong
    Shi, Zhihua
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 199 : 28 - 39
  • [4] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
    Chen, Xiaokang
    Yuan, Yuhui
    Zeng, Gang
    Wang, Jingdong
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2613 - 2622
  • [5] Impacts of spatial heterogeneity on crop area mapping in Canada using MODIS data
    Chen, Yaoliang
    Song, Xiaodong
    Wang, Shusen
    Huang, Jingfeng
    Mansaray, Lamin R.
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 119 : 451 - 461
  • [6] A novel weakly supervised semantic segmentation framework to improve the resolution of land cover product
    Chen, Yujia
    Zhang, Guo
    Cui, Hao
    Li, Xue
    Hou, Shasha
    Ma, Jinhao
    Li, Zhijiang
    Li, Haifeng
    Wang, Huabin
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 : 73 - 92
  • [7] Cheng Ouyang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12374), P762, DOI 10.1007/978-3-030-58526-6_45
  • [8] Contributors M., 2018, MMCV: openmmlab computer vision foundation
  • [9] Efficient Subclass Segmentation in Medical Images
    Dai, Linrui
    Lei, Wenhui
    Zhang, Xiaofan
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 266 - 275
  • [10] Dewitz J., 2021, NATL LAND COVER DATA