MM-UKAN plus plus : A Novel Kolmogorov-Arnold Network-Based U-Shaped Network for Ultrasound Image Segmentation

被引:0
作者
Zhang, Boheng [1 ,2 ,3 ]
Huang, Haorui [3 ,4 ]
Shen, Yi [1 ,3 ]
Sun, Mingjian [1 ,3 ,5 ,6 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Harbin 150001, Heilongjiang, Peoples R China
[2] Pohang Univ Sci & Technol, Dept Convergence IT Engn, Pohang 37673, South Korea
[3] Suzhou Res Inst, Harbin Inst Technol, Suzhou 215000, Jiangsu, Peoples R China
[4] Harbin Inst Technol, Dept Informat Sci & Engn, Weihai 264200, Shandong, Peoples R China
[5] Harbin Inst Technol, Weihai 264200, Shandong, Peoples R China
[6] Shandong Lab Adv Biomat & Med Devices, Weihai 264209, Chandigarh, India
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Training; Transformers; Data mining; Splines (mathematics); Frequency control; Artificial intelligence; Acoustics; Accuracy; Attention mechanism; deep learning; Kolmogorov-Arnold network (KAN); multiscale feature integration; ultrasound (US) image segmentation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Ultrasound (US) imaging is an important and commonly used medical imaging modality. Accurate and fast automatic segmentation of regions of interest (ROIs) in US images is essential for enhancing the efficiency of clinical and robot-assisted diagnosis. However, US images suffer from low contrast, fuzzy boundaries, and significant scale variations in ROIs. Existing convolutional neural network (CNN)-based and transformer-based methods struggle with model efficiency and explainability. To address these challenges, we introduce MM-UKAN++, a novel U-shaped network based on Kolmogorov-Arnold networks (KANs). MM-UKAN++ leverages multilevel KAN layers as the encoder and decoder within the U-network architecture and incorporates an innovative multidimensional attention mechanism to refine skip connections by weighting features from frequency-channel and spatial perspectives. In addition, the network effectively integrates multiscale information, fusing outputs from different scale decoders to generate precise segmentation predictions. MM-UKAN++ achieves higher segmentation accuracy with lower computational cost and outperforms other mainstream methods on several open-source datasets for US image segmentation tasks, including achieving 69.42% IoU, 81.30% Dice, and 3.31 mm HD in the BUSI dataset with 3.17 G floating point of operations (FLOPs) and 9.90 M parameters. The excellent performance on our automatic carotid artery US scanning and diagnostic system further proves the speed and accuracy of MM-UKAN++. Besides, the good performance in other medical image segmentation tasks reveals the promising applications of MM-UKAN++. The code is available on GitHub.
引用
收藏
页码:498 / 514
页数:17
相关论文
共 54 条
[1]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[2]   Screening Breast Ultrasound: Past, Present, and Future [J].
Brem, Rachel F. ;
Lenihan, Megan J. ;
Lieberman, Jennifer ;
Torrente, Jessica .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2015, 204 (02) :234-240
[3]  
Cai PZ, 2024, Arxiv, DOI [arXiv:2310.00289, DOI 10.48550/ARXIV.2310.00289]
[4]  
Cambrin DR, 2024, Arxiv, DOI arXiv:2408.07040
[5]  
Cao H., 2021, P 30 INT C ART NEUR, V982, P437
[6]  
Chen J., 2021, PREPRINT
[7]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[8]   Low order adaptive region growing for lung segmentation on plain chest radiographs [J].
Chondro, Peter ;
Yao, Chih-Yuan ;
Ruan, Shanq-Jang ;
Chien, Li-Chien .
NEUROCOMPUTING, 2018, 275 :1002-1011
[9]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[10]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929