Learning graph structures with transformer for weakly supervised semantic segmentation

被引:1
作者
Sun, Wanchun [1 ]
Feng, Xin [1 ,2 ]
Ma, Hui [3 ]
Liu, Jingyao [1 ,4 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
[2] Changchun Univ Sci & Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[3] Anhui Vocat Coll Police Officers, Comp Basic Teaching & Res Dept, Hefei 232001, Peoples R China
[4] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Peoples R China
关键词
Weakly supervised; Transformer; Graph convolutional network; Semantic segmentation;
D O I
10.1007/s40747-023-01152-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.
引用
收藏
页码:7511 / 7521
页数:11
相关论文
共 39 条
  • [21] Segmenter: Transformer for Semantic Segmentation
    Strudel, Robin
    Garcia, Ricardo
    Laptev, Ivan
    Schmid, Cordelia
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7242 - 7252
  • [22] Touvron H, 2021, PR MACH LEARN RES, V139, P7358
  • [23] Velickovic P, 2017, ARXIV
  • [24] Co-attention dictionary network for weakly-supervised semantic segmentation
    Wan, Weitao
    Chen, Jiansheng
    Yang, Ming-Hsuan
    Ma, Huimin
    [J]. NEUROCOMPUTING, 2022, 486 : 272 - 285
  • [25] HCP: A Flexible CNN Framework for Multi-Label Image Classification
    Wei, Yunchao
    Xia, Wei
    Lin, Min
    Huang, Junshi
    Ni, Bingbing
    Dong, Jian
    Zhao, Yao
    Yan, Shuicheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) : 1901 - 1907
  • [26] Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation
    Wu, Tong
    Huang, Junshi
    Gao, Guangyu
    Wei, Xiaoming
    Wei, Xiaolin
    Luo, Xuan
    Liu, Chi Harold
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16760 - 16769
  • [27] Xu W., 2021, Proceedings of the IEEE/CVF International Conference on Computer Vision, P6984
  • [28] Weakly-supervised semantic segmentation with superpixel guided local and global consistency
    Yi, Sheng
    Ma, Huimin
    Wang, Xiang
    Hu, Tianyu
    Li, Xi
    Wang, Yu
    [J]. PATTERN RECOGNITION, 2022, 124
  • [29] Yu-Ting Chang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P8988, DOI 10.1109/CVPR42600.2020.00901
  • [30] Yude Wang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P12272, DOI 10.1109/CVPR42600.2020.01229