Dilated transformer: residual axial attention for breast ultrasound image segmentation

被引:15
|
作者
Shen, Xiaoyan [1 ]
Wang, Liangyu [1 ]
Zhao, Yu [1 ]
Liu, Ruibo [1 ]
Qian, Wei [1 ]
Ma, He [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Coll Med & Biol Informat Engn, Shenyang, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang, Peoples R China
[3] Northeastern Univ, Coll Med & Biol Informat Engn, 195 Chuangxin Rd, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Breast ultrasound (US); tumor segmentation; transformer; residual; axial attention; MODEL;
D O I
10.21037/qims-22-33
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: The segmentation of breast ultrasound (US) images has been a challenging task, mainly due to limited data and the inherent image characteristics involved, such as low contrast and speckle noise. Although convolutional neural network-based (CNN-based) methods have made significant progress over the past decade, they lack the ability to model long-range interactions. Recently, the transformer method has been successfully applied to the tasks of computer vision. It has a strong ability to capture distant interactions. However, most transformer-based methods with excellent performance rely on pre-training on large datasets, making it infeasible to directly apply them to medical images analysis, especially that of breast US images with limited high-quality labels. Therefore, it is of great significance to find a robust and efficient transformer-based method for use on small breast US image datasets.Methods: We developed a dilated transformer (DT) method which mainly uses the proposed residual axial attention layers to build encoder blocks and the introduced dilation module (DM) to further increase the receptive field. We evaluated the proposed method on 2 breast US image datasets using the 5-fold cross-validation method. Dataset A was a public dataset with 562 images, while dataset B was a private dataset with 878 images. Ground truth (GT) was delineated by 2 radiologists with more than 5 years of experience. The evaluation was followed by related ablation experiments.Results: The DT was found to be comparable with the state-of-the-art (SOTA) CNN-based method and outperformed the related transformer-based method, medical transformer (MT), on both datasets. Especially on dataset B, the DT outperformed the MT on metrics of Jaccard index (JI) and Dice similarity coefficient (DSC) by 2.67% and 4.68%, respectively. Meanwhile, when compared with Unet, the DT improved JI and DSC by 4.89% and 4.66%, respectively. Moreover, the results of the ablation experiments showed that each add-on part of the DT is important and contributes to the segmentation accuracy.Conclusions: The proposed transformer-based method could achieve advanced segmentation performance on different small breast US image datasets without pretraining.
引用
收藏
页码:4512 / 4528
页数:17
相关论文
共 50 条
  • [31] MSAANet: Multi-scale Axial Attention Network for medical image segmentation
    Zeng, Hao
    Shan, Xinxin
    Feng, Yu
    Wen, Ying
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2291 - 2296
  • [32] Transformer-based heart organ segmentation using a novel axial attention and fusion mechanism
    Addo, Addae Emmanuel
    Gedeon, Kashala Kabe
    Liu, Zhe
    IMAGING SCIENCE JOURNAL, 2024, 72 (01): : 121 - 139
  • [33] Ultrasound image segmentation based on Transformer and U-Net with joint loss
    Cai, Lina
    Li, Qingkai
    Zhang, Junhua
    Zhang, Zhenghua
    Yang, Rui
    Zhang, Lun
    PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 18
  • [34] Residual-atrous attention network for lumbosacral plexus segmentation with MR image
    Zhao, Junyong
    Sun, Liang
    Zhou, Xin
    Huang, Shuo
    Si, Haipeng
    Zhang, Daoqiang
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2022, 100
  • [35] Breast ultrasound image segmentation: A coarse-to-fine fusion convolutional neural network
    Wang, Ke
    Liang, Shujun
    Zhong, Shengzhou
    Feng, Qianjin
    Ning, Zhenyuan
    Zhang, Yu
    MEDICAL PHYSICS, 2021, 48 (08) : 4262 - 4278
  • [36] Residual SwinV2 transformer coordinate attention network for image super resolution
    Lei, Yushi
    Zhu, Zhengwei
    Qin, Yilin
    Zhu, Chenyang
    Zhu, Yanping
    AI COMMUNICATIONS, 2024, 37 (04) : 693 - 709
  • [37] COMFormer: Classification of Maternal–Fetal and Brain Anatomy Using a Residual Cross-Covariance Attention Guided Transformer in Ultrasound
    Sarker, Md Mostafa Kamal
    Singh, Vivek Kumar
    Alsharid, Mohammad
    Hernandez-Cruz, Netzahualcoyotl
    Papageorghiou, Aris T.
    Noble, J. Alison
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2023, 70 (11) : 1417 - 1427
  • [38] Doppler Image-Based Weakly-Supervised Vascular Ultrasound Segmentation with Transformer
    Ning, Guochen
    Liang, Hanying
    Chen, Fang
    Zhang, Xinran
    Liao, Hongen
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [39] Breast Cancer Segmentation From Ultrasound Images Using Multiscale Cascaded Convolution With Residual Attention-Based Double Decoder Network
    Umer, Muhammad Junaid
    Sharif, Muhammad Irfan
    Kim, Jungeun
    IEEE ACCESS, 2024, 12 : 107888 - 107902
  • [40] Breast Ultrasound Image Classification and Segmentation Using Convolutional Neural Networks
    Xie, Xiaozheng
    Shi, Faqiang
    Niu, Jianwei
    Tang, Xiaolan
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 200 - 211