D-TrAttUnet: Toward hybrid CNN-transformer architecture for generic and subtle segmentation in medical images

被引:4
作者
Bougourzi F. [1 ]
Dornaika F. [3 ,4 ]
Distante C. [2 ]
Taleb-Ahmed A. [5 ]
机构
[1] Junia, UMR 8520, CNRS, Centrale Lille, University of Polytechnique Hauts-de-France, Lille
[2] Institute of Applied Sciences and Intelligent Systems, National Research Council of Italy, Lecce
[3] University of the Basque Country UPV/EHU, San Sebastian
[4] IKERBASQUE, Basque Foundation for Science, Bilbao
[5] Université Polytechnique Hauts-de-France, Université de Lille, CNRS, Valenciennes, Hauts-de-France
关键词
Bone Metastasis; Convolutional Neural Network; Covid-19; Deep learning; Segmentation; Transformer; Unet;
D O I
10.1016/j.compbiomed.2024.108590
中图分类号
学科分类号
摘要
Over the past two decades, machine analysis of medical imaging has advanced rapidly, opening up significant potential for several important medical applications. As complicated diseases increase and the number of cases rises, the role of machine-based imaging analysis has become indispensable. It serves as both a tool and an assistant to medical experts, providing valuable insights and guidance. A particularly challenging task in this area is lesion segmentation, a task that is challenging even for experienced radiologists. The complexity of this task highlights the urgent need for robust machine learning approaches to support medical staff. In response, we present our novel solution: the D-TrAttUnet architecture. This framework is based on the observation that different diseases often target specific organs. Our architecture includes an encoder–decoder structure with a composite Transformer-CNN encoder and dual decoders. The encoder includes two paths: the Transformer path and the Encoders Fusion Module path. The Dual-Decoder configuration uses two identical decoders, each with attention gates. This allows the model to simultaneously segment lesions and organs and integrate their segmentation losses. To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks. We also investigated the adaptability of the model by testing it without the second decoder in the segmentation of glands and nuclei. The results confirmed the superiority of our approach, especially in Covid-19 infections and the segmentation of bone metastases. In addition, the hybrid encoder showed exceptional performance in the segmentation of glands and nuclei, solidifying its role in modern medical image analysis. © 2024 The Author(s)
引用
收藏
相关论文
共 55 条
[1]  
Hambleton I.R., Caixeta R., Jeyaseelan S.M., Luciani S., Hennis A.J., The rising burden of non-communicable diseases in the americas and the impact of population aging: a secondary analysis of available data, Lancet Reg Health-Am., 21, (2023)
[2]  
Baker R.E., Mahmud A.S., Miller I.F., Rajeev M., Rasambainarivo F., Rice B.L., Takahashi S., Tatem A.J., Wagner C.E., Wang L.F., Et al., Infectious disease in an era of global change, Nat. Rev. Microbiol., 20, pp. 193-205, (2022)
[3]  
Shamshad F., Khan S., Zamir S.W., Khan M.H., Hayat M., Khan F.S., Fu H., Transformers in medical imaging: A survey, Med. Image Anal., (2023)
[4]  
Sirinukunwattana K., Raza S.E.A., Tsang Y.W., Snead D.R., Cree I.A., Rajpoot N.M., Locality sensitive deep learning for detection and classification of nuclei in routine colon cancer histology images, IEEE Trans. Med. Imaging, 35, pp. 1196-1206, (2016)
[5]  
Lei T., Zhang D., Du X., Wang X., Wan Y., Nandi A.K., Semi-supervised medical image segmentation using adversarial consistency learning and dynamic convolution network, IEEE Trans. Med. Imaging, (2022)
[6]  
Chaitanya K., Erdil E., Karani N., Konukoglu E., Contrastive learning of global and local features for medical image segmentation with limited annotations, Adv. Neural Inf. Process. Syst., 33, pp. 12546-12558, (2020)
[7]  
Garcea F., Serra A., Lamberti F., Morra L., Data augmentation for medical imaging: A systematic literature review, Comput. Biol. Med., 152, (2023)
[8]  
Wang H., Xie S., Lin L., Iwamoto Y., Han X.H., Chen Y.W., Tong R., Mixed transformer u-net for medical image segmentation, ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 2390-2394, (2022)
[9]  
Petit O., Thome N., Rambour C., Themyr L., Collins T., Soler L., U-net transformer: Self and cross attention for medical image segmentation, Machine Learning in Medical Imaging, pp. 267-276, (2021)
[10]  
Wang H., Cao P., Wang J., Zaiane O.R., Uctransnet: rethinking the skip connections in u-net from a channel-wise perspective with transformer, pp. 2441-2449, (2022)