Full-Scale Selective Transformer for Semantic Segmentation

被引:0
作者
Lin, Fangjian [1 ,2 ,3 ]
Wu, Sitong [2 ]
Ma, Yizhe [1 ]
Tian, Shengwei [1 ]
机构
[1] Xinjiang Univ, Sch Software, Urumqi, Peoples R China
[2] Baidu VIS, Beijing, Peoples R China
[3] Baidu Res, Inst Deep Learning, Beijing, Peoples R China
来源
COMPUTER VISION - ACCV 2022, PT VII | 2023年 / 13847卷
关键词
Semantic segmentation; Transformer; Full-scale feature fusion;
D O I
10.1007/978-3-031-26293-7_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we rethink the multi-scale feature fusion from two perspectives (scale-level and spatial-level) and propose a full-scale selective fusion strategy for semantic segmentation. Based on such strategy, we design a novel segmentation network, named Full-scale Selective Transformer (FSFormer). Specifically, our FSFormer adaptively selects partial tokens from all tokens at all scales to construct a token subset of interest for each scale. Therefore, each token only interacts with the tokens within its corresponding token subset of interest. The proposed full-scale selective fusion strategy can not only filter out the noisy information propagation but also reduce the computational costs to some extent. We evaluate our FSFormer on four challenging semantic segmentation benchmarks, including PASCAL Context, ADE20K, COCO-Stuff 10K, and Cityscapes, outperforming the state-of-the-art methods. We evaluate our FSFormer on four challenging semantic segmentation benchmarks, including PASCAL Context, ADE20K, COCO-Stuff 10K, and Cityscapes, outperforming the state-of-the-art methods.
引用
收藏
页码:310 / 326
页数:17
相关论文
共 50 条
  • [31] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
    Yan, Li
    Huang, Jianming
    Xie, Hong
    Wei, Pengcheng
    Gao, Zhao
    REMOTE SENSING, 2022, 14 (05)
  • [32] TBFormer: three-branch efficient transformer for semantic segmentation
    Can Wei
    Yan Wei
    Signal, Image and Video Processing, 2024, 18 : 3661 - 3672
  • [33] PASTS: TOWARD EFFECTIVE DISTILLING TRANSFORMER FOR PANORAMIC SEMANTIC SEGMENTATION
    Kim, Jihyun
    Jeong, Somi
    Sohn, Kwanghoon
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2881 - 2885
  • [34] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
    Li, Weitao
    Gao, Hui
    Su, Yi
    Momanyi, Biffon Manyura
    REMOTE SENSING, 2022, 14 (19)
  • [35] A Full-Scale Connected CNN-Transformer Network for Remote Sensing Image Change Detection
    Chen, Min
    Zhang, Qiangjiang
    Ge, Xuming
    Xu, Bo
    Hu, Han
    Zhu, Qing
    Zhang, Xin
    REMOTE SENSING, 2023, 15 (22)
  • [36] Scene sketch semantic segmentation with hierarchical Transformer
    Yang, Jie
    Ke, Aihua
    Yu, Yaoxiang
    Cai, Bo
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [37] Graph Structure Guided Transformer for Semantic Segmentation
    Qian, Luyang
    Zhang, Canlong
    Li, Zhixin
    Wang, Zhiwen
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 915 - 922
  • [38] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [39] MarsFormer: Martian Rock Semantic Segmentation With Transformer
    Xiong, Yonggang
    Xiao, Xueming
    Yao, Meibao
    Liu, Haiqiang
    Yang, Hong
    Fu, Yuegang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [40] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146