Sparse Coding Inspired LSTM and Self-Attention Integration for Medical Image Segmentation

被引:0
|
作者
Ji, Zexuan [1 ]
Ye, Shunlong [1 ]
Ma, Xiao [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
基金
美国国家科学基金会;
关键词
Sparse coding; contextual module; LSTM; self-attention; medical image segmentation; NETWORK; 2D; CLASSIFICATION;
D O I
10.1109/TIP.2024.3482189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate and automatic segmentation of medical images plays an essential role in clinical diagnosis and analysis. It has been established that integrating contextual relationships substantially enhances the representational ability of neural networks. Conventionally, Long Short-Term Memory (LSTM) and Self-Attention (SA) mechanisms have been recognized for their proficiency in capturing global dependencies within data. However, these mechanisms have typically been viewed as distinct modules without a direct linkage. This paper presents the integration of LSTM design with SA sparse coding as a key innovation. It uses linear combinations of LSTM states for SA's query, key, and value (QKV) matrices to leverage LSTM's capability for state compression and historical data retention. This approach aims to rectify the shortcomings of conventional sparse coding methods that overlook temporal information, thereby enhancing SA's ability to do sparse coding and capture global dependencies. Building upon this premise, we introduce two innovative modules that weave the SA matrix into the LSTM state design in distinct manners, enabling LSTM to more adeptly model global dependencies and meld seamlessly with SA without accruing extra computational demands. Both modules are separately embedded into the U-shaped convolutional neural network architecture for handling both 2D and 3D medical images. Experimental evaluations on downstream medical image segmentation tasks reveal that our proposed modules not only excel on four extensively utilized datasets across various baselines but also enhance prediction accuracy, even on baselines that have already incorporated contextual modules. Code is available at https://github.com/yeshunlong/SALSTM.
引用
收藏
页码:6098 / 6113
页数:16
相关论文
共 50 条
  • [1] ISC-TRANSUNET: MEDICAL IMAGE SEGMENTATION NETWORK BASED ON THE INTEGRATION OF SELF-ATTENTION AND CONVOLUTION
    Li, Fang
    Pei, Siyu
    Zhang, Ziqun
    Yang, Fuming
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2023, 23 (09)
  • [2] Sparse Self-Attention LSTM for Sentiment Lexicon Construction
    Deng, Dong
    Jing, Liping
    Yu, Jian
    Sun, Shaolong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1777 - 1790
  • [3] Research of Self-Attention in Image Segmentation
    Cao, Fude
    Zheng, Chunguang
    Huang, Limin
    Wang, Aihua
    Zhang, Jiong
    Zhou, Feng
    Ju, Haoxue
    Guo, Haitao
    Du, Yuxia
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [4] Self-Attention Technology in Image Segmentation
    Cao, Fude
    Lu, Xueyun
    INTERNATIONAL CONFERENCE ON INTELLIGENT TRAFFIC SYSTEMS AND SMART CITY (ITSSC 2021), 2022, 12165
  • [5] Sparse self-attention transformer for image inpainting
    Huang, Wenli
    Deng, Ye
    Hui, Siqi
    Wu, Yang
    Zhou, Sanping
    Wang, Jinjun
    PATTERN RECOGNITION, 2024, 145
  • [6] DI-Unet: Dimensional interaction self-attention for medical image segmentation
    Wu, Yanlin
    Wang, Guanglei
    Wang, Zhongyang
    Wang, Hongrui
    Li, Yan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [7] Transformer with sparse self-attention mechanism for image captioning
    Wang, Duofeng
    Hu, Haifeng
    Chen, Dihu
    ELECTRONICS LETTERS, 2020, 56 (15) : 764 - +
  • [8] Weakly supervised histopathology image segmentation with self-attention
    Li, Kailu
    Qian, Ziniu
    Han, Yingnan
    Chang, Eric I-Chao
    Wei, Bingzheng
    Lai, Maode
    Liao, Jing
    Fan, Yubo
    Xu, Yan
    MEDICAL IMAGE ANALYSIS, 2023, 86
  • [9] SACA-UNet:Medical Image Segmentation Network Based on Self-Attention and ASPP
    Fan, Gaojuan
    Wang, Jie
    Zhang, Chongsheng
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 317 - 322
  • [10] Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
    Tang, Chuanxin
    Zhao, Yucheng
    Wang, Guangting
    Luo, Chong
    Xie, Wenxuan
    Zeng, Wenjun
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2344 - 2351