Comparative Analysis of Deep Learning Models for Breast Cancer Classification on Multimodal Data

被引:0
作者
Hussain, Sadam [1 ]
Ali, Mansoor [1 ]
Ali Pirzado, Farman [1 ]
Ahmed, Masroor [1 ]
Gerardo Tamez-Pena, Jose [2 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Monterrey, Nuevo Leon, Mexico
[2] Tecnol Monterrey, Sch Med & Hlth Sci, Monterrey, Nuevo Leon, Mexico
来源
PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON VISION-LANGUAGE MODELS FOR BIOMEDICAL APPLICATIONS, VLM4BIO 2024 | 2024年
关键词
Breast Cancer; Feature Fusion; Multi-modal Classification; Deep Learning; Vision Transformer; COMPUTER-AIDED DETECTION; MAMMOGRAMS; DIAGNOSIS;
D O I
10.1145/3689096.3689462
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rising breast cancer incidence and mortality represent significant global challenges for women. Deep learning has demonstrated superior diagnostic performance in breast cancer classification compared to human experts. However, most deep learning methods have relied on unimodal features, potentially limiting the performance of diagnostic models. Additionally, most studies conducted so far have used a single view of digital mammograms, which significantly reduces model performance due to limited overall perspective and generalizability. To address these limitations, we collected a multiview multimodal dataset, including digital mammograms four views two craniocaudal (CC), two mediolateral oblique (MLO) one for each breast, and textual data extracted from radiological reports. We propose a multimodal deep learning architecture for breast cancer classification, utilizing images (digital mammograms) and textual data (radiological reports) from our new in-house dataset. In addition, various augmentation techniques are applied to both imaging and textual data to enhance the training data size. In our investigation, we explored the performance of six state-of-the-art (SOTA) deep learning architectures: VGG16, VGG19, ResNet34, MobileNetV3, EfficientNetB7, and a vision transformer (ViT) as an imaging feature extractors. For textual feature extraction, we employed an artificial neural network (ANN). Afterwards, features were fused using an early fusion and late fusion strategy. The fused imaging and textual features were then inputted into an ANN classifier for breast cancer classification. We evaluated various feature extractors and an ANN classifier combinations, finding that VGG19 in association with ANN achieved the highest accuracy at 0.951. In terms of precision, again VGG19 and ANN combination surpassed other SOTA CNN and attention-based architectures, achieving a score of 0.95. The best sensitivity score of 0.893 was recorded by VGG16+ANN, followed by VGG19+ANN with 0.884. The highest F1 score of 0.922 was achieved by VGG19+ANN. VGG16+ANN achieved the best area under the curve (AUC) score of 0.929, closely followed by VGG19+ANN with a score of 0.915.
引用
收藏
页码:31 / 39
页数:9
相关论文
共 43 条
  • [1] Multimodal biomedical AI
    Acosta, Julian N.
    Falcone, Guido J.
    Rajpurkar, Pranav
    Topol, Eric J.
    [J]. NATURE MEDICINE, 2022, 28 (09) : 1773 - 1784
  • [2] Predicting Breast Cancer by Applying Deep Learning to Linked Health Records and Mammograms
    Akselrod-Ballin, Ayelet
    Chorev, Michal
    Shoshan, Yoel
    Spiro, Adam
    Hazan, Alon
    Melamed, Roie
    Barkan, Ella
    Herzel, Esma
    Naor, Shaked
    Karavani, Ehud
    Koren, Gideon
    Goldscbmidt, Yaara
    Shalev, Varda
    Rosen-Zvi, Michal
    Guindy, Michal
    [J]. RADIOLOGY, 2019, 292 (02) : 331 - 342
  • [3] Multi-View Probabilistic Classification of Breast Microcalcifications
    Bekker, Alan Joseph
    Shalhon, Moran
    Greenspan, Hayit
    Goldberger, Jacob
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (02) : 645 - 653
  • [4] BreastScreening-AI: Evaluating medical intelligent agents for human-AI interactions
    Calisto, Francisco Maria
    Santiago, Carlos
    Nunes, Nuno
    Nascimento, Jacinto C.
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127
  • [5] Automated Analysis of Unregistered Multi-View Mammograms With Deep Learning
    Carneiro, Gustavo
    Nascimento, Jacinto
    Bradley, Andrew P.
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (11) : 2355 - 2365
  • [6] Computer-aided detection and diagnosis of mammographic masses using multi-resolution analysis of oriented tissue patterns
    Chakraborty, Jayasree
    Midya, Abhishek
    Rabidas, Rinku
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 99 : 168 - 179
  • [7] Dhungel N, 2017, I S BIOMED IMAGING, P310, DOI 10.1109/ISBI.2017.7950526
  • [8] Dhungel N, 2015, 2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), P160
  • [9] Dosovitskiy A, 2021, INT C LEARN REPR
  • [10] Fields Clayton, 2023, arXiv