Breast Ultrasound Image BI-RADS Classification Based on Vision Transformer

被引：0

作者：

Wei, Yanbo ^{[1
]}

Ye, Junbo ^{[2
]}

Li, Xiaofeng ^{[3
]}

Zhao, Yuanyuan ^{[4
]}

Wang, Yanwei ^{[2
]}

机构：

[1] Harbin Inst Petr, Sch Intelligent Engn, Harbin 150027, Peoples R China

[2] Heilongjiang Univ Sci & Technol, Harbin 150022, Peoples R China

[3] Heilongjiang Int Univ, Dept Informat Engn, Harbin 150025, Peoples R China

[4] Heilongjiang Prov Hosp, Harbin 150001, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MULTIPHYSICS | 2024年 / 18卷 / 02期

基金：

黑龙江省自然科学基金;

关键词：

Ultrasound Imaging; Breast cancer; Vision Transformer; Convolutional Neural Network (CNN); Image Classification;

D O I：

暂无

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

The most common malignancy among women is breast cancer. Medical ultrasound images are a common tool for detecting breast cancer. In medical ultrasound image classification, Convolutional neural networks(CNNs) have demonstrated great success. However, in most studies on convolutional neural networks categorizes breast tumors into benign and malignant types. Additionally, as convolutional neural networks have a limited receptive field, they are unable to acquire global information. In order to resolve this issue, we explored the feasibility of using Vision Transformer (ViT) in breast ultrasound image BI-RADS classification tasks through transfer learning. We collected publicly available breast ultrasound datasets and enhanced the quality of ultrasound images using the CLAHE algorithm. Through a transfer learning strategy, we trained the ViT model. Using an independent test set, we compared the classification results of ViT with CNNs serving as the baseline model. Breast cancer were categorized based on the BI-RADS criteria, and the results were evaluated using precision, accuracy, and F1 score. According to the experimental analysis results,ViT's transfer learning model produced 94.57% accuracy, 94.11% precision, and 94.29% F1 scores in the classification of breast ultrasound images, respectively, in the breast ultrasound image's BI-RADS classification. The classification performance of the ViT model outperformed the CNN models, including DenseNet201, Xception, MobileNet, and GoogLeNet. The study showed that ViT can be effectively utilized in classifying breast ultrasound images according to the BI-RADS system. The ViT model performed well in breast cancer classification and showed potential as an alternative to CNN.

引用

页码：32 / 39

页数：8

共 18 条

[1]

Al-Dhabyani W, 2019, INT J ADV COMPUT SC, V10, P618

[2] Dataset of breast ultrasound images [J].

Al-Dhabyani, Walid ;

Gomaa, Mohammed ;

Khaled, Hussien ;

Fahmy, Aly .

DATA IN BRIEF, 2020, 28

[3]

[Anonymous], 2017, Mobilenets: Efficient convolutional neural networks for mobile vision applications

[4] Automatic semantic segmentation of breast tumors in ultrasound images based on combining fuzzy logic and deep learning-A feasibility study [J].

Badawy, Samir M. ;

Mohamed, Abd El-Naser A. ;

Hefnawy, Alaa A. ;

Zidan, Hassan E. ;

GadAllah, Mohammed T. ;

El-Banby, Ghada M. .

PLOS ONE, 2021, 16 (05)

[5] Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[6] Cancer statistics for the year 2020: An overview [J].

Ferlay, Jacques ;

Colombet, Murielle ;

Soerjomataram, Isabelle ;

Parkin, Donald M. ;

Pineros, Marion ;

Znaor, Ariana ;

Bray, Freddie .

INTERNATIONAL JOURNAL OF CANCER, 2021, 149 (04) :778-789

[7] Vision Transformers for Classification of Breast Ultrasound Images [J].

Gheflati, Behnaz ;

Rivaz, Hassan .

2022 44TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2022, :480-483

[8] UNETR: Transformers for 3D Medical Image Segmentation [J].

Hatamizadeh, Ali ;

Tang, Yucheng ;

Nath, Vishwesh ;

Yang, Dong ;

Myronenko, Andriy ;

Landman, Bennett ;

Roth, Holger R. ;

Xu, Daguang .

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :1748-1758

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10] Densely Connected Convolutional Networks [J].

Huang, Gao ;

Liu, Zhuang ;

van der Maaten, Laurens ;

Weinberger, Kilian Q. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269

← 1 2 →