Coarse-to-fine semantic segmentation of satellite images

被引：6

作者：

Chen, Hao ^{[1
]}

Yang, Wen ^{[2
]}

Liu, Li ^{[3
]}

Xia, Gui-Song ^{[1
,4
,5
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China

[2] Wuhan Univ, Sch Elect Informat, Wuhan 430079, Peoples R China

[3] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China

[4] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China

[5] Wuhan Univ, Inst Artificial Intelligence, Wuhan 430072, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2024年 / 217卷

基金：

中国国家自然科学基金;

关键词：

Local-softmax; Multi-prototype learning; Semantic segmentation; COVER; AREA;

D O I：

10.1016/j.isprsjprs.2024.07.028

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Training deep neural networks for semantic segmentation of aerial images relies heavily on obtaining a large number of precise pixel-level annotations, which can cause significant annotation expenses. Given the fact that acquiring fine-class annotations is considerably more challenging than obtaining coarse-class annotations, we present a novel semi-supervised learning framework, which utilizes high spatial resolution images annotated with coarse-class labels alongside a very small set of fine-grained annotated images as the training set, thereby achieving classification results that are refined in both spatial resolution and categorical granularity. Specifically, this framework adopts Mix Transformer (MiT) as the backbone architecture to accommodate both local feature extraction and long-range dependency modeling capabilities and utilizes multi-prototype learning to model each class as multiple sub-prototypes, preserving the intrinsic variance characteristics within classes. We propose a dedicated co-training approach tailored for extracting fine-grained pseudo-labels from coarse- grained samples. In this approach, a local-softmax pseudo-labeling strategy is developed to ensure a harmonious balance between the efficiency and accuracy of the pseudo-labeling, and four losses are formulated for both single-level class and cross-category granularity supervised learning. We evaluate the proposed framework on the Gaofen Image Dataset (GID) and Five-Billion-Pixels (FBP) dataset, confirming its feasibility and superior results. In particular, based on coarse-class annotations, the performance achieved using only 5% of fineclass labels, in terms of the four metrics, namely mIoU, mean UA, mean F1-score, and OA, reached 91%, 96%, 89%, and 93% of the fully-supervised baseline performance respectively. The code is available at https://github.com/chenhaocs/C2F.

引用

页码：1 / 17

页数：17

共 50 条

[1] Fine-grained Angular Contrastive Learning with Coarse Labels [J].

Bukchin, Guy ;

Schwartz, Eli ;

Saenko, Kate ;

Shahar, Ori ;

Feris, Rogerio ;

Giryes, Raja ;

Karlinsky, Leonid .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8726-8736

[2] Albumentations: Fast and Flexible Image Augmentations [J].

Buslaev, Alexander ;

Iglovikov, Vladimir I. ;

Khvedchenya, Eugene ;

Parinov, Alex ;

Druzhinin, Mikhail ;

Kalinin, Alexandr A. .

INFORMATION, 2020, 11 (02)

[3] Learning spectral-spatial representations from VHR images for fine-scale crop type mapping: A case study of rice-crayfish field extraction in South China [J].

Cai, Zhiwen ;

Wei, Haodong ;

Hu, Qiong ;

Zhou, Wei ;

Zhang, Xinyu ;

Jin, Wenjie ;

Wang, Ling ;

Yu, Shuxia ;

Wang, Zhen ;

Xu, Baodong ;

Shi, Zhihua .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 199 :28-39

[4] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision [J].

Chen, Xiaokang ;

Yuan, Yuhui ;

Zeng, Gang ;

Wang, Jingdong .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2613-2622

[5] Impacts of spatial heterogeneity on crop area mapping in Canada using MODIS data [J].

Chen, Yaoliang ;

Song, Xiaodong ;

Wang, Shusen ;

Huang, Jingfeng ;

Mansaray, Lamin R. .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 119 :451-461

[6] A novel weakly supervised semantic segmentation framework to improve the resolution of land cover product [J].

Chen, Yujia ;

Zhang, Guo ;

Cui, Hao ;

Li, Xue ;

Hou, Shasha ;

Ma, Jinhao ;

Li, Zhijiang ;

Li, Haifeng ;

Wang, Huabin .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 :73-92

[7]

Contributors M., 2020, Mmsegmentation: openmmlab semantic segmentation toolbox and benchmark

[8] Efficient Subclass Segmentation in Medical Images [J].

Dai, Linrui ;

Lei, Wenhui ;

Zhang, Xiaofan .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 :266-275

[9]

Dewitz J., 2021, NATL LAND COVER DATA

[10] UCC: Uncertainty guided Cross-head Co-training for Semi-Supervised Semantic Segmentation [J].

Fan, Jiashuo ;

Gao, Bin ;

Jin, Huan ;

Jiang, Lihui .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :9937-9946

← 1 2 3 4 5 →