BUS-Set: A benchmark for quantitative evaluation of breast ultrasound segmentation networks with public datasets

被引:12
作者
Thomas, Cory [1 ]
Byra, Michal [2 ,3 ]
Marti, Robert [4 ]
Yap, Moi Hoon [5 ]
Zwiggelaar, Reyer [1 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth, Wales
[2] Polish Acad Sci, Inst Fundamental Technol Res, Warsaw, Poland
[3] Univ Calif San Diego, Dept Radiol, San Diego, CA USA
[4] Univ Girona, Comp Vis & Robot Inst, Girona, Spain
[5] Manchester Metropolitan Univ, Dept Comp & Math, Manchester, England
关键词
breast segmentation; public datasets; ultrasound; BENIGN; IMAGES;
D O I
10.1002/mp.16287
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeBUS-Set is a reproducible benchmark for breast ultrasound (BUS) lesion segmentation, comprising of publicly available images with the aim of improving future comparisons between machine learning models within the field of BUS. MethodFour publicly available datasets were compiled creating an overall set of 1154 BUS images, from five different scanner types. Full dataset details have been provided, which include clinical labels and detailed annotations. Furthermore, nine state-of-the-art deep learning architectures were selected to form the initial benchmark segmentation result, tested using five-fold cross-validation and MANOVA/ANOVA with Tukey statistical significance test with a threshold of 0.01. Additional evaluation of these architectures was conducted, exploring possible training bias, and lesion size and type effects. ResultsOf the nine state-of-the-art benchmarked architectures, Mask R-CNN obtained the highest overall results, with the following mean metric scores: Dice score of 0.851, intersection over union of 0.786 and pixel accuracy of 0.975. MANOVA/ANOVA and Tukey test results showed Mask R-CNN to be statistically significant better compared to all other benchmarked models with a p-value >0.01. Moreover, Mask R-CNN achieved the highest mean Dice score of 0.839 on an additional 16 image dataset, that contained multiple lesions per image. Further analysis on regions of interest was conducted, assessing Hamming distance, depth-to-width ratio (DWR), circularity, and elongation, which showed that the Mask R-CNN's segmentations maintained the most morphological features with correlation coefficients of 0.888, 0.532, 0.876 for DWR, circularity, and elongation, respectively. Based on the correlation coefficients, statistical test indicated that Mask R-CNN was only significantly different to Sk-U-Net. ConclusionsBUS-Set is a fully reproducible benchmark for BUS lesion segmentation obtained through the use of public datasets and GitHub. Of the state-of-the-art convolution neural network (CNN)-based architectures, Mask R-CNN achieved the highest performance overall, further analysis indicated that a training bias may have occurred due to the lesion size variation in the dataset. All dataset and architecture details are available at GitHub: , which allows for a fully reproducible benchmark.
引用
收藏
页码:3223 / 3243
页数:21
相关论文
共 48 条
[1]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[2]   Going Deep in Medical Image Analysis: Concepts, Methods, Challenges, and Future Directions [J].
Altaf, Fouzia ;
Islam, Syed M. S. ;
Akhtar, Naveed ;
Janjua, Naeem Khalid .
IEEE ACCESS, 2019, 7 :99540-99572
[3]  
Breast Cancer Now, FACTS STAT 2020
[4]  
Breast Cancer UK, 2020, SURV BREAST CANC
[5]   Breast mass segmentation in ultrasound with selective kernel U-Net convolutional neural network [J].
Byra, Michal ;
Jarosik, Piotr ;
Szubert, Aleksandra ;
Galperin, Michael ;
Ojeda-Fournier, Haydee ;
Olson, Linda ;
O'Boyle, Mary ;
Comstock, Christopher ;
Andre, Michael .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 61
[6]  
Cao H., 2023, Lecture notes in computer science:, V3803, P205
[7]  
Chen J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   Detection and classification the breast tumors using mask R-CNN on sonograms [J].
Chiao, Jui-Ying ;
Chen, Kuan-Yung ;
Liao, Ken Ying-Kai ;
Hsieh, Po-Hsin ;
Zhang, Geoffrey ;
Huang, Tzung-Chi .
MEDICINE, 2019, 98 (19)
[10]   A non-invasive method for focusing ultrasound through the human skull [J].
Clement, GT ;
Hynynen, K .
PHYSICS IN MEDICINE AND BIOLOGY, 2002, 47 (08) :1219-1236