Transformer-Based Cascade U-shaped Network for Action Segmentation

被引:0
|
作者
Bao, Wenxia [1 ]
Lin, An [1 ]
Huang, Hua [2 ]
Yang, Xianjun [3 ]
Chen, Hemu [4 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei, Peoples R China
[2] China Tobacco Zhejiang Ind Co Ltd, Hangzhou, Zhejiang, Peoples R China
[3] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei, Peoples R China
[4] Anhui Med Univ, Affiliated Hosp 1, Hefei, Peoples R China
关键词
Action Segmentation; Transformer; U-net;
D O I
10.1109/ICIPMC62364.2024.10586708
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Action segmentation requires predicting the action that occurs in each frame of the original video, and existing methods tend to focus on the global relationship of the sequence, ignoring the contextual information at different granularities. To address this problem, this paper proposes a Transformer-based cascaded U-network for action segmentation. The proposed method adopts a cascaded transformer structure, where the feature sequences between the encoder-decoder are connected in a U-shape, which fully combines the global context information as well as the local context information between neighboring frames. The extended temporal convolution as well as the local window attention mechanism are used to enhance the model's ability to perceive long-range action interactions. The proposed method outperforms the current mainstream action segmentation methods on two challenging datasets 50 salads and GTEA.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [31] CCT-Unet: A U-Shaped Network Based on Convolution Coupled Transformer for Segmentation of Peripheral and Transition Zones in Prostate MRI
    Yan, Yifei
    Liu, Rongzong
    Chen, Haobo
    Zhang, Limin
    Zhang, Qi
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (09) : 4341 - 4351
  • [32] Research on Thyroid CT Image Segmentation Based on U-Shaped Convolutional Neural Network
    Zeng, Yunzhi
    Zhang, Yanfen
    Gong, Ning
    Li, Mei
    Wang, Meili
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [33] Skin Lesion Segmentation Based on U-Shaped Structure Context Encoding and Decoding Network
    Jiang Xinhui
    Li Zhe
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)
  • [34] Small-Body Segmentation Based on Morphological Features with a U-Shaped Network Architecture
    Pugliatti, Mattia
    Maestrini, Michele
    JOURNAL OF SPACECRAFT AND ROCKETS, 2022, 59 (06) : 1821 - 1835
  • [35] Remote Sensing Image Segmentation Based on U-shaped Network with Atrous Spatial Pyramid
    Hou, Yunlong
    Zhu, Lei
    Chen, Qin
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7335 - 7339
  • [36] A Dual-Decoding branch U-shaped semantic segmentation network combining Transformer attention with Decoder: DBUNet
    Wang, Yuefei
    Yu, Xi
    Guo, Xiaoyan
    Wang, Xilei
    Wei, Yuanhong
    Zeng, Shijie
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [37] CSwinDoubleU-Net: A double U-shaped network combined with convolution and Swin Transformer for colorectal polyp segmentation
    Lin, Yuanjie
    Han, Xiaoxiang
    Chen, Keyan
    Zhang, Weikun
    Liu, Qiaohong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [38] A deep supervised transformer U-shaped full-resolution residual network for the segmentation of breast ultrasound image
    Zhou, Jiale
    Hou, Zuoxun
    Lu, Hongyan
    Wang, Wenhan
    Zhao, Wanchen
    Wang, Zenan
    Zheng, Dezhi
    Wang, Shuai
    Tang, Wenzhong
    Qu, Xiaolei
    MEDICAL PHYSICS, 2023, 50 (12) : 7513 - 7524
  • [39] MAUNet: Polyp segmentation network based on multiscale feature fusion of attention U-shaped network structure
    Long, Jianwu
    Lin, Jian
    Liu, Jiayin
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)
  • [40] An Improved U-Shaped Network for ROI Segmentation of Skin Lesion images
    Gupta, Rasika
    Kumar, Rajeev
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,