ETFT: Equiangular Tight Frame Transformer for Imbalanced Semantic Segmentation

被引:0
|
作者
Jeong, Seonggyun [1 ]
Heo, Yong Seok [1 ,2 ]
机构
[1] Ajou Univ, Dept Artificial Intelligence, Suwon 16499, South Korea
[2] Ajou Univ, Dept Elect & Comp Engn, Suwon 16499, South Korea
基金
新加坡国家研究基金会;
关键词
semantic segmentation; neural collapse; class imbalance; transformer;
D O I
10.3390/s24216913
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Semantic segmentation often suffers from class imbalance, where the label ratio for each class in the dataset is not uniform. Recent studies have addressed the issue of class imbalance in semantic segmentation by leveraging the neural collapse phenomenon in conjunction with an Equiangular Tight Frame (ETF). While the use of ETF aids in enhancing the discriminability of minor classes, class correlation is another crucial factor that must be taken into account. However, managing the balance between class correlation and discrimination through neural collapse remains challenging, as these properties inherently conflict with one another. Moreover, this control is established during the training stage, resulting in a fixed classifier. There is no guarantee that this classifier will consistently perform well with different input images. To address this problem, we propose an Equiangular Tight Frame Transformer (ETFT), a transformer-based model that jointly processes the features and classifier using ETF structure, and dynamically generates the classifier as a function of the input for imbalanced semantic segmentation. Specifically, the classifier initialized with the ETF structure is jointly processed with the input patch tokens during the attention process. As a result, the transformed patch tokens, aided by the ETF structure, achieve discriminability between classes while preserving contextual correlation. The classifier, initially structured as an ETF, is adjusted to incorporate the correlation information, benefiting from the attention mechanism. Furthermore, the learned classifier is combined with the fixed ETF classifier, leveraging the advantages of both. Extensive experiments demonstrate that the proposed method outperforms state-of-the-art methods for imbalanced semantic segmentation on both the ADE20K and Cityscapes datasets.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Remote Sensing Image Semantic Segmentation Based on Cascaded Transformer
    Wang F.
    Ji J.
    Wang Y.
    IEEE. Trans. Artif. Intell., 2024, 8 (4136-4148): : 4136 - 4148
  • [22] Enhancing Mask Transformer with Auxiliary Convolution Layers for Semantic Segmentation
    Xia, Zhengyu
    Kim, Joohee
    SENSORS, 2023, 23 (02)
  • [23] Global and edge enhanced transformer for semantic segmentation of remote sensing
    Wang, Hengyou
    Li, Xiao
    Huo, Lianzhi
    Hu, Changmiao
    APPLIED INTELLIGENCE, 2024, 54 (07) : 5658 - 5673
  • [24] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
    Yan, Li
    Huang, Jianming
    Xie, Hong
    Wei, Pengcheng
    Gao, Zhao
    REMOTE SENSING, 2022, 14 (05)
  • [25] TBFormer: three-branch efficient transformer for semantic segmentation
    Can Wei
    Yan Wei
    Signal, Image and Video Processing, 2024, 18 : 3661 - 3672
  • [26] PASTS: TOWARD EFFECTIVE DISTILLING TRANSFORMER FOR PANORAMIC SEMANTIC SEGMENTATION
    Kim, Jihyun
    Jeong, Somi
    Sohn, Kwanghoon
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2881 - 2885
  • [27] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
    Li, Weitao
    Gao, Hui
    Su, Yi
    Momanyi, Biffon Manyura
    REMOTE SENSING, 2022, 14 (19)
  • [28] Scene sketch semantic segmentation with hierarchical Transformer
    Yang, Jie
    Ke, Aihua
    Yu, Yaoxiang
    Cai, Bo
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [29] Graph Structure Guided Transformer for Semantic Segmentation
    Qian, Luyang
    Zhang, Canlong
    Li, Zhixin
    Wang, Zhiwen
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 915 - 922
  • [30] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15