Learning graph structures with transformer for weakly supervised semantic segmentation

被引:1
作者
Sun, Wanchun [1 ]
Feng, Xin [1 ,2 ]
Ma, Hui [3 ]
Liu, Jingyao [1 ,4 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
[2] Changchun Univ Sci & Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[3] Anhui Vocat Coll Police Officers, Comp Basic Teaching & Res Dept, Hefei 232001, Peoples R China
[4] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Peoples R China
关键词
Weakly supervised; Transformer; Graph convolutional network; Semantic segmentation;
D O I
10.1007/s40747-023-01152-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.
引用
收藏
页码:7511 / 7521
页数:11
相关论文
共 39 条
  • [1] Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations
    Ahn, Jiwoon
    Cho, Sunghyun
    Kwak, Suha
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2204 - 2213
  • [2] Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation
    Ahn, Jiwoon
    Kwak, Suha
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4981 - 4990
  • [3] Single-Stage Semantic Segmentation from Image Labels
    Araslanov, Nikita
    Roth, Stefan
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4252 - 4261
  • [4] Twin-field quantum key distribution over a 511km optical fibre linking two distant metropolitan areas
    Chen, Jiu-Peng
    Zhang, Chi
    Liu, Yang
    Jiang, Cong
    Zhang, Wei-Jun
    Han, Zhi-Yong
    Ma, Shi-Zhao
    Hu, Xiao-Long
    Li, Yu-Huai
    Liu, Hui
    Zhou, Fei
    Jiang, Hai-Feng
    Chen, Teng-Yun
    Li, Hao
    You, Li-Xing
    Wang, Zhen
    Wang, Xiang-Bin
    Zhang, Qiang
    Pan, Jian-Wei
    [J]. NATURE PHOTONICS, 2021, 15 (08) : 570 - 575
  • [5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [6] Dosovitskiy Alexey, 2020, INT C LEARN REPR ICL, DOI DOI 10.48550/ARXIV.2010.11929
  • [7] Deep graph cut network for weakly-supervised semantic segmentation
    Feng, Jiapei
    Wang, Xinggang
    Liu, Wenyu
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (03)
  • [8] TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization
    Gao, Wei
    Wan, Fang
    Pan, Xingjia
    Peng, Zhiliang
    Tian, Qi
    Han, Zhenjun
    Zhou, Bolei
    Ye, Qixiang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2866 - 2875
  • [9] Guolei Sun, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12347), P347, DOI 10.1007/978-3-030-58536-5_21
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778