Multi-task approach based on combined CNN-transformer for efficient segmentation and classification of breast tumors in ultrasound images

被引:8
作者
Tagnamas, Jaouad [1 ]
Ramadan, Hiba [1 ]
Yahyaouy, Ali [1 ]
Tairi, Hamid [1 ]
机构
[1] Univ Sidi Mohamed Ben Abdellah, Fac Sci Dhar El Mahraz, Dept Informat, Fes 30000, Morocco
关键词
Breast ultrasound segmentation; Convolutional neural networks; Swin Transformer; ConvNeXt; Efficient channel attention; Coordinate attention module; LESIONS;
D O I
10.1186/s42492-024-00155-w
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Accurate segmentation of breast ultrasound (BUS) images is crucial for early diagnosis and treatment of breast cancer. Further, the task of segmenting lesions in BUS images continues to pose significant challenges due to the limitations of convolutional neural networks (CNNs) in capturing long-range dependencies and obtaining global context information. Existing methods relying solely on CNNs have struggled to address these issues. Recently, ConvNeXts have emerged as a promising architecture for CNNs, while transformers have demonstrated outstanding performance in diverse computer vision tasks, including the analysis of medical images. In this paper, we propose a novel breast lesion segmentation network CS-Net that combines the strengths of ConvNeXt and Swin Transformer models to enhance the performance of the U-Net architecture. Our network operates on BUS images and adopts an end-to-end approach to perform segmentation. To address the limitations of CNNs, we design a hybrid encoder that incorporates modified ConvNeXt convolutions and Swin Transformer. Furthermore, to enhance capturing the spatial and channel attention in feature maps we incorporate the Coordinate Attention Module. Second, we design an Encoder-Decoder Features Fusion Module that facilitates the fusion of low-level features from the encoder with high-level semantic features from the decoder during the image reconstruction. Experimental results demonstrate the superiority of our network over state-of-the-art image segmentation methods for BUS lesions segmentation.
引用
收藏
页数:15
相关论文
共 57 条
  • [1] Multi-path decoder U-Net: A weakly trained real-time segmentation network for object detection and localization in ultrasound scans
    Al-Battal, Abdullah F.
    Lerman, Imanuel R.
    Nguyen, Truong Q.
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 107
  • [2] Dataset of breast ultrasound images
    Al-Dhabyani, Walid
    Gomaa, Mohammed
    Khaled, Hussien
    Fahmy, Aly
    [J]. DATA IN BRIEF, 2020, 28
  • [3] Vision transformer architecture and applications in digital health: a tutorial and survey
    Al-hammuri, Khalid
    Gebali, Fayez
    Kanan, Awos
    Chelvan, Ilamparithi Thirumarai
    [J]. VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2023, 6 (01)
  • [4] BUViTNet: Breast Ultrasound Detection via Vision Transformers
    Ayana, Gelan
    Choe, Se-Woon
    [J]. DIAGNOSTICS, 2022, 12 (11)
  • [5] Advances in medical image analysis with vision Transformers: A review
    Azad, Reza
    Kazerouni, Amirhossein
    Heidari, Moein
    Aghdam, Ehsan Khodapanah
    Molaei, Amirali
    Jia, Yiwei
    Jose, Abin
    Roy, Rijo
    Merhof, Dorit
    [J]. MEDICAL IMAGE ANALYSIS, 2024, 91
  • [6] Ba L. J., 2016, arXiv
  • [7] Deep learning and time series-to-image encoding for financial forecasting
    Barra, Silvio
    Carta, Salvatore Mario
    Corriga, Andrea
    Podda, Alessandro Sebastian
    Recupero, Diego Reforgiato
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (03) : 683 - 692
  • [8] Bray F, 2018, CA-CANCER J CLIN, V68, P1, DOI [DOI 10.3322/CANJCLIN.49.1.33, 10.3322/caac.21492, DOI 10.3322/caac.20115]
  • [9] Breast mass classification in sonography with transfer learning using a deep convolutional neural network and color conversion
    Byra, Michel
    Galperin, Michael
    Ojeda-Fournier, Haydee
    Olson, Linda
    O'Boyle, Mary
    Comstock, Christopher
    Andre, Michael
    [J]. MEDICAL PHYSICS, 2019, 46 (02) : 746 - 755
  • [10] Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9