Learning graph structures with transformer for weakly supervised semantic segmentation

被引：1

作者：

Sun, Wanchun ^{[1
]}

Feng, Xin ^{[1
,2
]}

Ma, Hui ^{[3
]}

Liu, Jingyao ^{[1
,4
]}

机构：

[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China

[2] Changchun Univ Sci & Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China

[3] Anhui Vocat Coll Police Officers, Comp Basic Teaching & Res Dept, Hefei 232001, Peoples R China

[4] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Peoples R China

来源：

COMPLEX & INTELLIGENT SYSTEMS | 2023年 / 9卷 / 06期

关键词：

Weakly supervised; Transformer; Graph convolutional network; Semantic segmentation;

D O I：

10.1007/s40747-023-01152-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.

引用

页码：7511 / 7521

页数：11

共 39 条

[21] Segmenter: Transformer for Semantic Segmentation
Strudel, Robin
Garcia, Ricardo
Laptev, Ivan
Schmid, Cordelia
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7242 - 7252
[22] Touvron H, 2021, PR MACH LEARN RES, V139, P7358
[23] Velickovic P, 2017, ARXIV
[24] Co-attention dictionary network for weakly-supervised semantic segmentation
Wan, Weitao
Chen, Jiansheng
Yang, Ming-Hsuan
Ma, Huimin
[J]. NEUROCOMPUTING, 2022, 486 : 272 - 285
[25] HCP: A Flexible CNN Framework for Multi-Label Image Classification
Wei, Yunchao
Xia, Wei
Lin, Min
Huang, Junshi
Ni, Bingbing
Dong, Jian
Zhao, Yao
Yan, Shuicheng
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) : 1901 - 1907
[26] Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation
Wu, Tong
Huang, Junshi
Gao, Guangyu
Wei, Xiaoming
Wei, Xiaolin
Luo, Xuan
Liu, Chi Harold
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16760 - 16769
[27] Xu W., 2021, Proceedings of the IEEE/CVF International Conference on Computer Vision, P6984
[28] Weakly-supervised semantic segmentation with superpixel guided local and global consistency
Yi, Sheng
Ma, Huimin
Wang, Xiang
Hu, Tianyu
Li, Xi
Wang, Yu
[J]. PATTERN RECOGNITION, 2022, 124
[29] Yu-Ting Chang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P8988, DOI 10.1109/CVPR42600.2020.00901
[30] Yude Wang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P12272, DOI 10.1109/CVPR42600.2020.01229

← 1 2 3 4 →