Boosting Semantic Segmentation of Aerial Images via Decoupled and Multilevel Compaction and Dispersion

被引：1

作者：

Shan, Lianlei ^{[1
]}

Wang, Weiqiang ^{[1
]}

Lv, Ke ^{[2
]}

Luo, Bin ^{[3
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China

[2] Univ Chinese Acad Sci, Sch Engn Sci, Beijing 100049, Peoples R China

[3] Anhui Univ, Sch Comp Sci & Technol, Key Lab Signal Proc & Intelligent Comp, MOE, Hefei 230601, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷

关键词：

Aerial image; multilevel feature interclass dispersion; multilevel feature intraclass compaction; semantic segmentation; CLASSIFICATION;

D O I：

10.1109/TGRS.2023.3297092

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Semantic segmentation is a valuable task in practical applications for aerial images. Nevertheless, the segmentation performance is unsatisfactory due to aerial images' huge intraclass variance and interclass similarity. To solve this problem, we propose an approach to increase the distinction between classes and compact the features of the same class. Specifically, since a single aerial image contains only a small number of categories, which is fatal for previous contrastive learning, we discard InfoNCE loss in contrastive learning and use the simple mean square error (mse) loss that does not require negative samples to decouple the dispersion and compaction operations. Besides, we set up more representative prototypes for classes and extend the prototypes to the whole dataset level, which we call image- and dataset-level prototypes. Based on the calculated prototypes, we propose multilevel intraclass feature compaction (MFC) and multilevel interclass feature dispersion (MFD) to compact the features of the same class and disperse the features of different classes in the latent feature space. More importantly, some measures are proposed to ensure the two do not conflict. MFC and MFD can be applied to any existing segmentation network to improve performance significantly without increasing computational complexity during inference. Moreover, we feed the calculated multilevel prototypes directly into the classifier, thus keeping the feature extraction and classifier consistent. Results on four challenging datasets, Deepglobe, iSAID, Potsdam, and Vaihingen, demonstrate the significant effect of our method, and sufficient ablation studies verify the role of each module.

引用

页数：16

共 53 条

[1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].

Achanta, Radhakrishna ;

Shaji, Appu ;

Smith, Kevin ;

Lucchi, Aurelien ;

Fua, Pascal ;

Suesstrunk, Sabine .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281

[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[3]

Chen LC, 2017, Arxiv, DOI arXiv:1706.05587

[4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[5]

Chen TW, 2008, 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, P324, DOI 10.1109/MMSP.2008.4665097

[6] Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images [J].

Chen, Wuyang ;

Jiang, Ziyu ;

Wang, Zhangyang ;

Cui, Kexin ;

Qian, Xiaoning .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8916-8925

[7] Memory Enhanced Global-Local Aggregation for Video Object Detection [J].

Chen, Yihong ;

Cao, Yue ;

Hu, Han ;

Wang, Liwei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10334-10343

[8] DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images [J].

Demir, Ilke ;

Koperski, Krzysztof ;

Lindenbaum, David ;

Pang, Guan ;

Huang, Jing ;

Bast, Saikat ;

Hughes, Forest ;

Tuia, Devis ;

Raskar, Ramesh .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :172-181

[9] CCANet: Class-Constraint Coarse-to-Fine Attentional Deep Network for Subdecimeter Aerial Image Semantic Segmentation [J].

Deng, Guohui ;

Wu, Zhaocong ;

Wang, Chengjun ;

Xu, Miaozhong ;

Zhong, Yanfei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[10]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 →