Full-Scale Selective Transformer for Semantic Segmentation

被引：0

作者：

Lin, Fangjian ^{[1
,2
,3
]}

Wu, Sitong ^{[2
]}

Ma, Yizhe ^{[1
]}

Tian, Shengwei ^{[1
]}

机构：

[1] Xinjiang Univ, Sch Software, Urumqi, Peoples R China

[2] Baidu VIS, Beijing, Peoples R China

[3] Baidu Res, Inst Deep Learning, Beijing, Peoples R China

来源：

COMPUTER VISION - ACCV 2022, PT VII | 2023年 / 13847卷

关键词：

Semantic segmentation; Transformer; Full-scale feature fusion;

D O I：

10.1007/978-3-031-26293-7_19

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we rethink the multi-scale feature fusion from two perspectives (scale-level and spatial-level) and propose a full-scale selective fusion strategy for semantic segmentation. Based on such strategy, we design a novel segmentation network, named Full-scale Selective Transformer (FSFormer). Specifically, our FSFormer adaptively selects partial tokens from all tokens at all scales to construct a token subset of interest for each scale. Therefore, each token only interacts with the tokens within its corresponding token subset of interest. The proposed full-scale selective fusion strategy can not only filter out the noisy information propagation but also reduce the computational costs to some extent. We evaluate our FSFormer on four challenging semantic segmentation benchmarks, including PASCAL Context, ADE20K, COCO-Stuff 10K, and Cityscapes, outperforming the state-of-the-art methods. We evaluate our FSFormer on four challenging semantic segmentation benchmarks, including PASCAL Context, ADE20K, COCO-Stuff 10K, and Cityscapes, outperforming the state-of-the-art methods.

引用

页码：310 / 326

页数：17

共 50 条

[31] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
Yan, Li
Huang, Jianming
Xie, Hong
Wei, Pengcheng
Gao, Zhao
REMOTE SENSING, 2022, 14 (05)
[32] TBFormer: three-branch efficient transformer for semantic segmentation
Can Wei
Yan Wei
Signal, Image and Video Processing, 2024, 18 : 3661 - 3672
[33] PASTS: TOWARD EFFECTIVE DISTILLING TRANSFORMER FOR PANORAMIC SEMANTIC SEGMENTATION
Kim, Jihyun
Jeong, Somi
Sohn, Kwanghoon
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2881 - 2885
[34] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
Li, Weitao
Gao, Hui
Su, Yi
Momanyi, Biffon Manyura
REMOTE SENSING, 2022, 14 (19)
[35] A Full-Scale Connected CNN-Transformer Network for Remote Sensing Image Change Detection
Chen, Min
Zhang, Qiangjiang
Ge, Xuming
Xu, Bo
Hu, Han
Zhu, Qing
Zhang, Xin
REMOTE SENSING, 2023, 15 (22)
[36] Scene sketch semantic segmentation with hierarchical Transformer
Yang, Jie
Ke, Aihua
Yu, Yaoxiang
Cai, Bo
KNOWLEDGE-BASED SYSTEMS, 2023, 280
[37] Graph Structure Guided Transformer for Semantic Segmentation
Qian, Luyang
Zhang, Canlong
Li, Zhixin
Wang, Zhiwen
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 915 - 922
[38] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
Shao, Yilin
Sun, Long
Jiao, Licheng
Liu, Xu
Liu, Fang
Li, Lingling
Yang, Shuyuan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
[39] MarsFormer: Martian Rock Semantic Segmentation With Transformer
Xiong, Yonggang
Xiao, Xueming
Yao, Meibao
Liu, Haiqiang
Yang, Hong
Fu, Yuegang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[40] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
Shao, Yilin
Sun, Long
Jiao, Licheng
Liu, Xu
Liu, Fang
Li, Lingling
Yang, Shuyuan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146

← 1 2 3 4 5 →