Find it if You Can: End-to-End Adversarial Erasing for Weakly-Supervised Semantic Segmentation

被引：3

作者：

Stammes, Erik ^{[1
,2
]}

Runia, Tom F. H. ^{[1
]}

Hofmann, Michael ^{[2
]}

Ghafoorian, Mohsen ^{[2
]}

机构：

[1] Univ Amsterdam, Amsterdam, Netherlands

[2] TomTom, Amsterdam, Netherlands

来源：

THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021) | 2021年 / 11878卷

关键词：

Semantic segmentation; adversarial learning; weak supervision;

D O I：

10.1117/12.2599432

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

Semantic segmentation is a task that traditionally requires a large dataset of pixel-level ground truth labels, which is time-consuming and expensive to obtain. Recent advancements in the weakly-supervised setting show that reasonable performance can be obtained by using only image-level labels. Classification is often used as a proxy task to train a deep neural network from which attention maps are extracted. However, the classification task needs only the minimum evidence to make predictions, hence it focuses on the most discriminative object regions. To overcome this problem, we propose a novel formulation of adversarial erasing of the attention maps. In contrast to previous adversarial erasing methods, we optimize two networks with opposing loss functions, which eliminates the requirement of certain suboptimal strategies; for instance, having multiple training steps that complicate the training process or a weight sharing policy between networks operating on different distributions that might be suboptimal for performance. The proposed solution does not require saliency masks, instead it uses a regularization loss to prevent the attention maps from spreading to less discriminative object regions. Our experiments on the Pascal VOC dataset demonstrate that our adversarial approach increases segmentation performance by 2.1 mIoU compared to our baseline and by 1.0 mIoU compared to previous adversarial erasing approaches.

引用

页数：10

共 44 条

[41]

Zhang BF, 2020, AAAI CONF ARTIF INTE, V34, P12765

[42] Adversarial Complementary Learning for Weakly Supervised Object Localization [J].

Zhang, Xiaolin ;

Wei, Yunchao ;

Feng, Jiashi ;

Yang, Yi ;

Huang, Thomas .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1325-1334

[43] Pyramid Scene Parsing Network [J].

Zhao, Hengshuang ;

Shi, Jianping ;

Qi, Xiaojuan ;

Wang, Xiaogang ;

Jia, Jiaya .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239

[44]

ZHOU B, 2016, PROC CVPR IEEE, P2921, DOI DOI 10.1109/CVPR.2016.319

← 1 2 3 4 5 →