Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation

被引:37
作者
Chen, Tao [1 ]
Yao, Yazhou [1 ]
Zhang, Lei [2 ]
Wang, Qiong [1 ]
Xie, Guo-Sen [1 ]
Shen, Fumin [3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantic segmentation; weak supervision; saliency guidance; relation constraint;
D O I
10.1109/TMM.2022.3157481
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Weakly supervised semantic segmentation with only image-level labels aims to reduce annotation costs for the segmentation task. Existing approaches generally leverage class activation maps (CAMs) to locate the object regions for pseudo label generation. However, CAMs can only discover the most discriminative parts of objects, thus leading to inferior pixel-level pseudo labels. To address this issue, we propose a saliency guided Inter- and Intra-Class Relation Constrained ((ICRC)-C-2) framework to assist the expansion of the activated object regions in CAMs. Specifically, we propose a saliency guided class-agnostic distance module to pull the intra-category features closer by aligning features to their class prototypes. Further, we propose a class-specific distance module to push the inter-class features apart and encourage the object region to have a higher activation than the background. Besides strengthening the capability of the classification network to activate more integral object regions in CAMs, we also introduce an object guided label refinement module to take a full use of both the segmentation prediction and the initial labels for obtaining superior pseudo-labels. Extensive experiments on PASCAL VOC 2012 and COCO datasets demonstrate well the effectiveness of I-2 CRC over other state-of-the-art counterparts.
引用
收藏
页码:1727 / 1737
页数:11
相关论文
共 67 条
[1]   Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations [J].
Ahn, Jiwoon ;
Cho, Sunghyun ;
Kwak, Suha .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2204-2213
[2]   Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation [J].
Ahn, Jiwoon ;
Kwak, Suha .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4981-4990
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   What's the Point: Semantic Segmentation with Point Supervision [J].
Bearman, Amy ;
Russakovsky, Olga ;
Ferrari, Vittorio ;
Fei-Fei, Li .
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :549-565
[5]   Webly Supervised Semantic Segmentation [J].
Bin Jin ;
Segovia, Maria V. Ortiz ;
Susstrunk, Sabine .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1705-1714
[6]   Weakly-Supervised Semantic Segmentation via Sub-category Exploration [J].
Chang, Yu-Ting ;
Wang, Qiaosong ;
Hung, Wei-Chih ;
Piramuthu, Robinson ;
Tsai, Yi-Hsuan ;
Yang, Ming-Hsuan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8988-8997
[7]  
Chaudhry A., 2017, PROC BRIT MACH VIS C
[8]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[9]   Weakly Supervised Semantic Segmentation with Boundary Exploration [J].
Chen, Liyi ;
Wu, Weiwei ;
Fu, Chenchen ;
Han, Xiao ;
Zhang, Yuntao .
COMPUTER VISION - ECCV 2020, PT XXVI, 2020, 12371 :347-362
[10]   Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation [J].
Chen, Tao ;
Wang, Shui-Hua ;
Wang, Qiong ;
Zhang, Zheng ;
Xie, Guo-Sen ;
Tang, Zhenmin .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1042-1054