Dilated transformer: residual axial attention for breast ultrasound image segmentation

被引:15
|
作者
Shen, Xiaoyan [1 ]
Wang, Liangyu [1 ]
Zhao, Yu [1 ]
Liu, Ruibo [1 ]
Qian, Wei [1 ]
Ma, He [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Coll Med & Biol Informat Engn, Shenyang, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang, Peoples R China
[3] Northeastern Univ, Coll Med & Biol Informat Engn, 195 Chuangxin Rd, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Breast ultrasound (US); tumor segmentation; transformer; residual; axial attention; MODEL;
D O I
10.21037/qims-22-33
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: The segmentation of breast ultrasound (US) images has been a challenging task, mainly due to limited data and the inherent image characteristics involved, such as low contrast and speckle noise. Although convolutional neural network-based (CNN-based) methods have made significant progress over the past decade, they lack the ability to model long-range interactions. Recently, the transformer method has been successfully applied to the tasks of computer vision. It has a strong ability to capture distant interactions. However, most transformer-based methods with excellent performance rely on pre-training on large datasets, making it infeasible to directly apply them to medical images analysis, especially that of breast US images with limited high-quality labels. Therefore, it is of great significance to find a robust and efficient transformer-based method for use on small breast US image datasets.Methods: We developed a dilated transformer (DT) method which mainly uses the proposed residual axial attention layers to build encoder blocks and the introduced dilation module (DM) to further increase the receptive field. We evaluated the proposed method on 2 breast US image datasets using the 5-fold cross-validation method. Dataset A was a public dataset with 562 images, while dataset B was a private dataset with 878 images. Ground truth (GT) was delineated by 2 radiologists with more than 5 years of experience. The evaluation was followed by related ablation experiments.Results: The DT was found to be comparable with the state-of-the-art (SOTA) CNN-based method and outperformed the related transformer-based method, medical transformer (MT), on both datasets. Especially on dataset B, the DT outperformed the MT on metrics of Jaccard index (JI) and Dice similarity coefficient (DSC) by 2.67% and 4.68%, respectively. Meanwhile, when compared with Unet, the DT improved JI and DSC by 4.89% and 4.66%, respectively. Moreover, the results of the ablation experiments showed that each add-on part of the DT is important and contributes to the segmentation accuracy.Conclusions: The proposed transformer-based method could achieve advanced segmentation performance on different small breast US image datasets without pretraining.
引用
收藏
页码:4512 / 4528
页数:17
相关论文
共 50 条
  • [41] RT-Unet: An advanced network based on residual network and transformer for medical image segmentation
    Li, Bo
    Liu, Sikai
    Wu, Fei
    Li, GuangHui
    Zhong, Meiling
    Guan, Xiaohui
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8565 - 8582
  • [42] Dense Residual Transformer for Image Denoising
    Yao, Chao
    Jin, Shuo
    Liu, Meiqin
    Ban, Xiaojuan
    ELECTRONICS, 2022, 11 (03)
  • [43] DAE-Former: Dual Attention-Guided Efficient Transformer for Medical Image Segmentation
    Azad, Reza
    Arimond, Rene
    Aghdam, Ehsan Khodapanah
    Kazerouni, Amirhossein
    Merhof, Dorit
    PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2023, 2023, 14277 : 83 - 95
  • [44] D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation
    Li, Haiyan
    Ding, Jiayu
    Shi, Xin
    Zhang, Qi
    Yu, Pengfei
    Li, Hongsong
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
  • [45] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
    Chen, Shaolong
    Qiu, Changzhen
    Yang, Weiping
    Zhang, Zhiyong
    SENSORS, 2022, 22 (10)
  • [46] Dense Dilated Multi-Scale Supervised Attention-Guided Network for histopathology image segmentation
    Das, Rangan
    Bose, Shirsha
    Chowdhury, Ritesh Sur
    Maulik, Ujjwal
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [47] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Wu, Yixuan
    Liao, Kuanlun
    Chen, Jintai
    Wang, Jinhong
    Chen, Danny Z.
    Gao, Honghao
    Wu, Jian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (02): : 1931 - 1944
  • [48] D-former: a U-shaped Dilated Transformer for 3D medical image segmentation
    Yixuan Wu
    Kuanlun Liao
    Jintai Chen
    Jinhong Wang
    Danny Z. Chen
    Honghao Gao
    Jian Wu
    Neural Computing and Applications, 2023, 35 : 1931 - 1944
  • [49] MultiIB-TransUNet: Transformer with multiple information bottleneck blocks for CT and ultrasound image segmentation
    Li, Guangju
    Jin, Dehu
    Yu, Qi
    Zheng, Yuanjie
    Qi, Meng
    MEDICAL PHYSICS, 2024, 51 (02) : 1178 - 1189
  • [50] TAGNet: A transformer-based axial guided network for bile duct segmentation
    Zhou, Guang-Quan
    Zhao, Fuxing
    Yang, Qing-Han
    Wang, Kai-Ni
    Li, Shengxiao
    Zhou, Shoujun
    Lu, Jian
    Chen, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86