Multi-task approach based on combined CNN-transformer for efficient segmentation and classification of breast tumors in ultrasound images

被引:0
作者
Jaouad Tagnamas
Hiba Ramadan
Ali Yahyaouy
Hamid Tairi
机构
[1] University of Sidi Mohamed Ben Abdellah,Department of Informatics, Faculty of Sciences Dhar El Mahraz
来源
Visual Computing for Industry, Biomedicine, and Art | / 7卷
关键词
Breast Ultrasound segmentation and classification; Breast tumors; Convolutional Neural Networks; Self-Attention; MLP-Mixer; Channel Attention;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, inspired by the great success of Transformers in Natural Language Processing, many applications of Vision Transformers (ViTs) have been investigated in the field of medical image analysis including breast ultrasound (BUS) image segmentation and classification. In this paper, we propose an efficient multi-task framework to segment and classify tumors in BUS images using hybrid convolutional neural networks (CNNs)-ViTs architecture and Multi-Perceptron (MLP)-Mixer. The proposed method uses a two-encoder architecture with EfficientNetV2 backbone and an adapted ViT encoder to extract tumor regions in BUS images. The self-attention (SA) mechanism in the Transformer encoder allows capturing a wide range of high-level and complex features while the EfficientNetV2 encoder preserves local information in image. To fusion the extracted features, a Channel Attention Fusion (CAF) module is introduced. The CAF module selectively emphasizes important features from both encoders, improving the integration of high-level and local information. The resulting feature maps are reconstructed to obtain the segmentation maps using a decoder. Then, our method classifies the segmented tumor regions into benign and malignant using a simple and efficient classifier based on MLP-Mixer, that is applied for the first time, to the best of our knowledge, for the task of lesion classification in BUS images. Experimental results illustrate the outperformance of our framework compared to recent works for the task of segmentation by producing 83.42% in terms of Dice coefficient as well as for the classification with 86% in terms of accuracy.
引用
收藏
相关论文
共 188 条
[1]  
Siegel RL(2018)Cancer statistics, 2018 CA Cancer J Clin. 68 7-30
[2]  
Miller KD(2018)Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries CA Cancer J Clin 68 394-424
[3]  
Jemal A(2023)Application and prospects of AI-based radiomics in ultrasound diagnosis Vis Comput Ind Biomed Art 6 20-692
[4]  
Bray F(2011)Review article: use of ultrasound in the developing world Int J Emerg Med 4 72-137
[5]  
Ferlay J(2020)Deep learning and time series-to-image encoding for financial forecasting IEEE/CAA J Autom Sin 7 683-366
[6]  
Soerjomataram I(2021)A survey on deep learning in medicine: why, how and when? Inf Fusion 66 111-110
[7]  
Siegel RL(2019)Artificial intelligence in breast imaging Clin Radiol 75 357-2565
[8]  
Torre LA(2023)A survey on vision transformer IEEE Trans Pattern Anal Mach Intell 45 87-186
[9]  
Jemal A(2023)Vision transformer architecture and applications in digital health: a tutorial and survey Vis Comput Ind Biomed Art 6 14-2833
[10]  
Zhang HY(2023)Transforming medical imaging with transformers? A comparative review of key properties, current progresses, and future perspectives Med Image Anal 85 102762-548