Multi-layered self-attention mechanism for weakly supervised semantic segmentation

被引:4
作者
Yaganapu, Avinash [1 ]
Kang, Mingon [1 ]
机构
[1] Univ Nevada Las Vegas, Dept Comp Sci, 4505 S Maryland Pkwy, Las Vegas, NV 89154 USA
基金
美国国家科学基金会;
关键词
Weakly supervised semantic segmentation; Segmentation; Self-attention; Image-level labels;
D O I
10.1016/j.cviu.2023.103886
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Semantic Segmentation (WSSS) provides efficient solutions for semantic image segmen-tation using image-level annotations. WSSS requires no pixel-level labeling that Fully Supervised Semantic Segmentation (FSSS) does, which is time-consuming and label-intensive. Most WSSS approaches have leveraged Class Activation Maps (CAM) or Self-Attention (SA) to generate pseudo pixel-level annotations to perform semantic segmentation tasks coupled with fully supervised approaches (e.g., Fully Convolutional Network). However, those approaches often provides incomplete supervision that mainly includes discriminative regions from the last convolutional layer. They may fail to capture regions of low-or intermediate-level features that may not be present in the last convolutional layer. To address the issue, we proposed a novel Multilayered Self-Attention (Multi-SA) method that applies a self-attention module to multiple convolutional layers, and then stack feature maps from the self-attention layers to generate pseudo pixel-level annotations. We demonstrated that integrated feature maps from multiple self-attention layers produce higher coverage in semantic segmentation than using only the last convolutional layer through intensive experiments using standard benchmark datasets.
引用
收藏
页数:9
相关论文
共 42 条
[1]   SEEDED REGION GROWING [J].
ADAMS, R ;
BISCHOF, L .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (06) :641-647
[2]   Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation [J].
Ahn, Jiwoon ;
Kwak, Suha .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4981-4990
[3]   MiniNet: An Efficient Semantic Segmentation ConvNet for Real-Time Robotic Applications [J].
Alonso, Inigo ;
Riazuelo, Luis ;
Murillo, Ana C. .
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (04) :1340-1347
[4]   Deep semantic segmentation of natural and medical images: a review [J].
Asgari Taghanaki, Saeid ;
Abhishek, Kumar ;
Cohen, Joseph Paul ;
Cohen-Adad, Julien ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178
[5]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[6]   A survey on deep learning techniques for image and video semantic segmentation [J].
Garcia-Garcia, Alberto ;
Orts-Escolano, Sergio ;
Oprea, Sergiu ;
Villena-Martinez, Victor ;
Martinez-Gonzalez, Pablo ;
Garcia-Rodriguez, Jose .
APPLIED SOFT COMPUTING, 2018, 70 :41-65
[7]  
Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing [J].
Huang, Zilong ;
Wang, Xinggang ;
Wang, Jiasi ;
Liu, Wenyu ;
Wang, Jingdong .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7014-7023
[10]   LayerCAM: Exploring Hierarchical Class Activation Maps for Localization [J].
Jiang, Peng-Tao ;
Zhang, Chang-Bin ;
Hou, Qibin ;
Cheng, Ming-Ming ;
Wei, Yunchao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :5875-5888