Adversarial Learning of Object-Aware Activation Map for Weakly-Supervised Semantic Segmentation

被引:22
作者
Chen, Junliang [1 ,2 ]
Lu, Weizeng [1 ,2 ]
Li, Yuexiang [3 ]
Shen, Linlin [1 ,2 ]
Duan, Jinming [4 ]
机构
[1] Shenzhen Univ, Comp Vis Inst, Sch Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Shenzhen Inst Artificial Intelligence & Robot Soc, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
[3] Tencent Jarvis Lab, Shenzhen 518057, Peoples R China
[4] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
基金
中国国家自然科学基金;
关键词
Weakly-supervised semantic segmentation; class activation map; object-aware activation map; IMAGE;
D O I
10.1109/TCSVT.2023.3236432
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent years have witnessed impressive advances in the area of weakly-supervised semantic segmentation (WSSS). However, most of existing approaches are based on class activation maps (CAMs), which suffer from the under-segmentation problem (i.e., objects of interest are segmented partially). Although a number of literature works have been proposed to tackle this under-segmentation problem, we argue that these solutions built on CAMs may not be optimal for the WSSS task. Instead, in this paper we propose a network based on the object-aware activation map (OAM). The proposed network, termed OAM-Net, consists of four loss functions (foreground loss, background loss, average pixel and consistency loss) which ensure exactness, completeness, compactness and consistency of segmented objects via adversarial training. Compared to conventional CAM-based methods, our OAM-Net overcomes the under-segmentation drawback and significantly improves segmentation accuracy with negligible computational cost. A thorough comparison between OAM-Net and CAM-based approaches is carried out on the PASCAL VOC2012 dataset, and experimental results show that our network outperforms state-of-the-art approaches by a large margin. The code will be available soon.
引用
收藏
页码:3935 / 3946
页数:12
相关论文
共 51 条
[1]   Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations [J].
Ahn, Jiwoon ;
Cho, Sunghyun ;
Kwak, Suha .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2204-2213
[2]   Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation [J].
Ahn, Jiwoon ;
Kwak, Suha .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4981-4990
[3]   Video Foreground Extraction Using Multi-View Receptive Field and EncoderDecoder DCNN for Traffic and Surveillance Applications [J].
Akilan, Thangarajah ;
Wu, Q. M. Jonathan ;
Zhang, Wandong .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (10) :9478-9493
[4]  
Chang YT, 2020, PROC CVPR IEEE, P8988, DOI 10.1109/CVPR42600.2020.00901
[5]  
Chen L. -C., 2014, Semantic image segmentation with deep convolutional nets and fully connected crfs
[6]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]  
Chen LY, 2020, Img Proc Comp Vis Re, V12371, P347, DOI 10.1007/978-3-030-58574-7_21
[9]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[10]   Semi-Supervised Video Object Segmentation via Learning Object-Aware Global-Local Correspondence [J].
Fan, Jiaqing ;
Liu, Bo ;
Zhang, Kaihua ;
Liu, Qingshan .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) :8153-8164