Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation

被引：10

作者：

Zhai, Wei ^{[1
]}

Wu, Pingyu ^{[1
]}

Zhu, Kai ^{[1
]}

Cao, Yang ^{[1
,2
]}

Wu, Feng ^{[1
,2
]}

Zha, Zheng-Jun ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年 / 132卷 / 03期

关键词：

Weakly supervised; Object localization; Background activation suppression; Semantic segmentation;

D O I：

10.1007/s11263-023-01919-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels. Recently, a new paradigm has emerged by generating a foreground prediction map (FPM) to achieve pixel-level localization. While existing FPM-based methods use cross-entropy to evaluate the foreground prediction map and to guide the learning of the generator, this paper presents two astonishing experimental observations on the object localization learning process: For a trained network, as the foreground mask expands, (1) the cross-entropy converges to zero when the foreground mask covers only part of the object region. (2) The activation value continuously increases until the foreground mask expands to the object boundary. Therefore, to achieve a more effective localization performance, we argue for the usage of activation value to learn more object regions. In this paper, we propose a background activation suppression (BAS) method. Specifically, an activation map constraint module is designed to facilitate the learning of generator by suppressing the background activation value. Meanwhile, by using foreground region guidance and area constraint, BAS can learn the whole region of the object. In the inference phase, we consider the prediction maps of different categories together to obtain the final localization results. Extensive experiments show that BAS achieves significant and consistent improvement over the baseline methods on the CUB-200-2011 and ILSVRC datasets. In addition, our method also achieves state-of-the-art weakly supervised semantic segmentation performance on the PASCAL VOC 2012 and MS COCO 2014 datasets. Code and models are available at https://github.com/wpy1999/BAS-Extension.

引用

页码：750 / 775

页数：26

共 80 条

[1] Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations
Ahn, Jiwoon
Cho, Sunghyun
Kwak, Suha
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2204 - 2213
[2] Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation
Ahn, Jiwoon
Kwak, Suha
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4981 - 4990
[3] A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains
Chan, Lyndon
Hosseini, Mahdi S.
Plataniotis, Konstantinos N.
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 361 - 384
[4] Chang YT, 2020, PROC CVPR IEEE, P8988, DOI 10.1109/CVPR42600.2020.00901
[5] Chen LC, 2016, Arxiv, DOI [arXiv:1412.7062, 10.1080/17476938708814211]
[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[7] Chen Liyi, 2020, EUR C COMP VIS ECCV, P347
[8] Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation
Chen, Qi
Yang, Lingxiao
Lai, Jianhuang
Xie, Xiaohua
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4278 - 4288
[9] Chen Z., 2022, IEEECVF C COMPUT VIS, P969
[10] Evaluating Weakly Supervised Object Localization Methods Right
Choe, Junsuk
Oh, Seong Joon
Lee, Seungho
Chun, Sanghyuk
Akata, Zeynep
Shim, Hyunjung
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3130 - 3139

← 1 2 3 4 5 6 7 8 →