Transformer-Based Cascade U-shaped Network for Action Segmentation

被引:0
|
作者
Bao, Wenxia [1 ]
Lin, An [1 ]
Huang, Hua [2 ]
Yang, Xianjun [3 ]
Chen, Hemu [4 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei, Peoples R China
[2] China Tobacco Zhejiang Ind Co Ltd, Hangzhou, Zhejiang, Peoples R China
[3] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei, Peoples R China
[4] Anhui Med Univ, Affiliated Hosp 1, Hefei, Peoples R China
关键词
Action Segmentation; Transformer; U-net;
D O I
10.1109/ICIPMC62364.2024.10586708
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Action segmentation requires predicting the action that occurs in each frame of the original video, and existing methods tend to focus on the global relationship of the sequence, ignoring the contextual information at different granularities. To address this problem, this paper proposes a Transformer-based cascaded U-network for action segmentation. The proposed method adopts a cascaded transformer structure, where the feature sequences between the encoder-decoder are connected in a U-shape, which fully combines the global context information as well as the local context information between neighboring frames. The extended temporal convolution as well as the local window attention mechanism are used to enhance the model's ability to perceive long-range action interactions. The proposed method outperforms the current mainstream action segmentation methods on two challenging datasets 50 salads and GTEA.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [21] Improved Colonic Polyp Segmentation Method Based on Double U-Shaped Network
    Liu Jiawei
    Liu Qiaohong
    Xiaoou, Li
    Ling Chen
    Liu Cunjue
    ACTA OPTICA SINICA, 2021, 41 (18)
  • [22] Research on Retinal Vessel Segmentation Algorithm Based on a Modified U-Shaped Network
    He, Xialan
    Wang, Ting
    Yang, Wankou
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [23] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    JunBo Qiao
    Xing Wang
    Ji Chen
    MingTao Liu
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 1895 - 1902
  • [24] MBUTransNet: multi-branch U-shaped network fusion transformer architecture for medical image segmentation
    Qiao, JunBo
    Wang, Xing
    Chen, Ji
    Liu, MingTao
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (10) : 1895 - 1902
  • [25] MAS-UNet: a U-shaped network for prostate segmentation
    Hong, YuQi
    Qiu, Zhao
    Chen, Huajing
    Zhu, Bing
    Lei, Haodong
    FRONTIERS IN MEDICINE, 2023, 10
  • [26] Video summarization with u-shaped transformer
    Yaosen Chen
    Bing Guo
    Yan Shen
    Renshuang Zhou
    Weichen Lu
    Wei Wang
    Xuming Wen
    Xinhua Suo
    Applied Intelligence, 2022, 52 : 17864 - 17880
  • [27] CrowdUNet: Segmentation assisted U-shaped crowd counting network
    Cao, Zhou
    Lyu, Lei
    Qi, Ran
    Wang, Jihua
    NEUROCOMPUTING, 2024, 601
  • [28] Video summarization with u-shaped transformer
    Chen, Yaosen
    Guo, Bing
    Shen, Yan
    Zhou, Renshuang
    Lu, Weichen
    Wang, Wei
    Wen, Xuming
    Suo, Xinhua
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17864 - 17880
  • [29] BTSwin-Unet: 3D U-shaped Symmetrical Swin Transformer-based Network for Brain Tumor Segmentation with Self-supervised Pre-training
    Junjie Liang
    Cihui Yang
    Jingting Zhong
    Xiaoli Ye
    Neural Processing Letters, 2023, 55 : 3695 - 3713
  • [30] BTSwin-Unet: 3D U-shaped Symmetrical Swin Transformer-based Network for Brain Tumor Segmentation with Self-supervised Pre-training
    Liang, Junjie
    Yang, Cihui
    Zhong, Jingting
    Ye, Xiaoli
    NEURAL PROCESSING LETTERS, 2023, 55 (04) : 3695 - 3713