Atrous convolutional feature network for weakly supervised semantic segmentation

被引：0

作者：

Xu, Lian ^{[1
]}

Xue, Hao ^{[1
]}

Bennamoun, Mohammed ^{[1
]}

Boussaid, Farid ^{[2
]}

Sohel, Ferdous ^{[3
]}

机构：

[1] Univ Western Australia, Dept Comp Sci & Software Engn, 35 Stirling Hwy, Perth, WA 6009, Australia

[2] Univ Western Australia, Sch Elect Elect & Comp Engn, 35 Stirling Hwy, Perth, WA 6009, Australia

[3] Murdoch Univ, Discipline Informat Technol Math & Stat, Murdoch, WA 6150, Australia

来源：

NEUROCOMPUTING | 2021年 / 421卷

基金：

澳大利亚研究理事会;

关键词：

Weakly supervised semantic segmentation; Multi-label image classification; Atrous convolution; Multi-scale features; Attention mechanism;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised semantic segmentation has been attracting increasing attention as it can alleviate the need for expensive pixel-level annotations through the use of image-level labels. Relevant methods mainly rely on the implicit object localization ability of convolutional neural networks (CNNs). However, generated object attention maps remain mostly small and incomplete. In this paper, we propose an Atrous Convolutional Feature Network (ACFN) to generate dense object attention maps. This is achieved by enhancing the context representation of image classification CNNs. More specifically, cascaded atrous convolutions are used in the middle layers to retain sufficient spatial details, and pyramidal atrous convolutions are used in the last convolutional layers to provide multi-scale context information for the extraction of object attention maps. Moreover, we propose an attentive fusion strategy to adaptively fuse the multi-scale features. Our method shows improvements over existing methods on both the PASCAL VOC 2012 and MS COCO datasets, achieving state-of-the-art performance. (c) 2020 Elsevier B.V. All rights reserved.

引用

页码：115 / 126

页数：12

共 58 条

[1] Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation [J].

Ahn, Jiwoon ;

Kwak, Suha .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4981-4990

[2]

[Anonymous], 2017, P IEEE C COMP VIS PA

[3]

[Anonymous], 2018, Lect. Comput. Vis., DOI [DOI 10.2200/S00822ED1V01Y201712COV015, 10.2200/S00822ED1V01Y201712COV015]

[4]

[Anonymous], 2017, P CVPR

[5]

[Anonymous], 2020, Tensors and dynamic neural networks in python with strong GPU acceleration

[6]

[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.563

[7]

[Anonymous], 2019, INVERSE PROBL SCI EN

[8] What's the Point: Semantic Segmentation with Point Supervision [J].

Bearman, Amy ;

Russakovsky, Olga ;

Ferrari, Vittorio ;

Fei-Fei, Li .

COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :549-565

[9] Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].

Chattopadhay, Aditya ;

Sarkar, Anirban ;

Howlader, Prantik ;

Balasubramanian, Vineeth N. .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847

[10]

Chaudhry A., BMVC

← 1 2 3 4 5 6 →