Transformer-Based Cascade U-shaped Network for Action Segmentation

被引:0
|
作者
Bao, Wenxia [1 ]
Lin, An [1 ]
Huang, Hua [2 ]
Yang, Xianjun [3 ]
Chen, Hemu [4 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei, Peoples R China
[2] China Tobacco Zhejiang Ind Co Ltd, Hangzhou, Zhejiang, Peoples R China
[3] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei, Peoples R China
[4] Anhui Med Univ, Affiliated Hosp 1, Hefei, Peoples R China
关键词
Action Segmentation; Transformer; U-net;
D O I
10.1109/ICIPMC62364.2024.10586708
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Action segmentation requires predicting the action that occurs in each frame of the original video, and existing methods tend to focus on the global relationship of the sequence, ignoring the contextual information at different granularities. To address this problem, this paper proposes a Transformer-based cascaded U-network for action segmentation. The proposed method adopts a cascaded transformer structure, where the feature sequences between the encoder-decoder are connected in a U-shape, which fully combines the global context information as well as the local context information between neighboring frames. The extended temporal convolution as well as the local window attention mechanism are used to enhance the model's ability to perceive long-range action interactions. The proposed method outperforms the current mainstream action segmentation methods on two challenging datasets 50 salads and GTEA.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [41] MRU-NET: A U-Shaped Network for Retinal Vessel Segmentation
    Ding, Hongwei
    Cui, Xiaohui
    Chen, Leiyang
    Zhao, Kun
    APPLIED SCIENCES-BASEL, 2020, 10 (19):
  • [42] Double U-Net: semi-supervised ultrasound image segmentation combining CNN and transformer's U-shaped network
    Zhou, Huabiao
    Luo, Yanmin
    Guo, Jingjing
    Chen, Zhikui
    Gong, Wanyuan
    Lin, Zhongwei
    Zhuo, Minling
    Lin, Youjia
    Lin, Weiwei
    Shen, Qingling
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [43] A combination of multi-scale and attention based on the U-shaped network for retinal vessel segmentation
    Zhang, Yan
    Lan, Qingyan
    Sun, Yemei
    Ma, Chunming
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (02)
  • [44] MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation
    Xi, Heran
    Dong, Haoji
    Sheng, Yue
    Cui, Hui
    Huang, Chengying
    Li, Jinbao
    Zhu, Jinghua
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
  • [45] UCDnet: Double U-Shaped Segmentation Network Cascade Centroid Map Prediction for Infrared Weak Small Target Detection
    Xu, Xiangdong
    Wang, Jiarong
    Zhu, Ming
    Sun, Haijiang
    Wu, Zhenyuan
    Wang, Yao
    Cao, Shenyi
    Liu, Sanzai
    REMOTE SENSING, 2023, 15 (15)
  • [46] Skin lesion image segmentation based on lightweight multi-scale U-shaped network
    Zhou, Pengfei
    Liu, Xuefeng
    Xiong, Jichuan
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2023, 9 (05)
  • [47] Joint optic disc and cup segmentation based on multi-module U-shaped network
    Zhu, Qianlong
    Luo, Gaohui
    Chen, Xinjian
    Shi, Fei
    Pan, Lingjiao
    Zhu, Weifang
    MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
  • [48] UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
    Fang, Kun
    He, Baochun
    Liu, Libo
    Hu, Haoyu
    Fang, Chihua
    Huang, Xuguang
    Jia, Fucang
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2023, 13 (03) : 1619 - 1630
  • [49] A transformer-based generative adversarial network for brain tumor segmentation
    Huang, Liqun
    Zhu, Enjun
    Chen, Long
    Wang, Zhaoyang
    Chai, Senchun
    Zhang, Baihai
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [50] USMLP: U-shaped Sparse-MLP network for mass segmentation in mammograms
    Luo, Jiaming
    Tang, Yongzhe
    Wang, Jie
    Lu, Hongtao
    IMAGE AND VISION COMPUTING, 2023, 137