Diagnosis of breast cancer molecular subtypes using machine learning models on unimodal and multimodal datasets

被引:2
|
作者
Rani, Samta [1 ]
Ahmad, Tanvir [1 ]
Masood, Sarfaraz [1 ]
Saxena, Chandni [2 ]
机构
[1] Jamia Millia Islamia, Dept Comp Engn, New Delhi, India
[2] Chinese Univ Hong Kong, Sha Tin, Hong Kong, Peoples R China
关键词
Machine learning; Unimodal data; Multimodal data; Breast cancer molecular subtypes; Deep neural network;
D O I
10.1007/s00521-023-09005-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast cancer is a significant global health concern, with millions of cases and deaths each year. Accurate diagnosis is critical for timely treatment and medication. Machine learning techniques have shown promising results in detecting breast cancer. Previous studies have primarily used single-modality data for breast cancer diagnosis. Hence, this work aims to mobilize the benefits of multimodal data over unimodality samples. This study proposes a custom deep learning-based model pipeline that works over this multimodal data. This work has been separated into three phases. Phase 1 and Phase 2 under the unimodal category examine gene expression data and histopathological images separately. The Cancer Genome Atlas makes these datasets available. In Phase 3, the proposed pipeline operates on both data types' samples for each patient in the multimodal category. This study investigates how data pre-processing (cleaning, transformation, reduction) and cascaded filtering affect model performance. Precision, recall, f1-score, and accuracy assessed the models, whereas L2 regularization, exponentially weighted moving average, and transfer learning minimized over-fitting. A custom deep neural network and support vector machine obtained 86% accuracy in Phase 1, whereas the VGG16 model reached 80.21% accuracy in Phase 2. In Phase 3, the curated multimodal dataset was applied to a custom deep learning pipeline (VGG16 backbone with hyper-tuned machine learning models as head classifiers) to achieve 94% accuracy, demonstrating the importance of multimodal data over unimodal in breast cancer subtype classification. These findings highlight the importance of multimodal data for breast cancer diagnosis and subtype prediction.
引用
收藏
页码:24109 / 24121
页数:13
相关论文
共 50 条
  • [1] Machine learning to improve breast cancer diagnosis by multimodal ultrasound
    Sultan, Laith R.
    Schultz, Susan M.
    Cary, Theodore W.
    Sehgal, Chandra M.
    2018 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2018,
  • [2] Prediction of breast cancer using machine learning algorithms on different datasets
    Yavuz, Omer Cagri
    Calp, M. Hanefi
    Erkengel, Hazel Ceren
    INGENIERIA SOLIDARIA, 2023, 19 (01):
  • [3] Using Machine Learning Algorithms for Breast Cancer Diagnosis
    El-Lamey, Mazen Mobtasem
    Eid, Mohab Mohammed
    Gamal, Muhammad
    Bishady, Nour-Elhoda Mohamed
    Mohamed, Ali Wagdy
    INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING, 2021, 12 (04) : 117 - 154
  • [4] Breast Cancer Prediction using Machine Learning Models
    Iparraguirre-Villanueva, Orlando
    Epifania-Huerta, Andres
    Torres-Ceclen, Carmen
    Ruiz-Alvarado, John
    Cabanillas-Carbonell, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 610 - 620
  • [5] Molecular Classification Models for Triple Negative Breast Cancer Subtype Using Machine Learning
    Bissanum, Rassanee
    Chaichulee, Sitthichok
    Kamolphiwong, Rawikant
    Navakanitworakul, Raphatphorn
    Kanokwiroon, Kanyanatt
    JOURNAL OF PERSONALIZED MEDICINE, 2021, 11 (09):
  • [6] The role of explainable AI in enhancing breast cancer diagnosis using machine learning and deep learning models
    Zulfikar Ali Ansari
    Manish Madhava Tripathi
    Rafeeq Ahmed
    Discover Artificial Intelligence, 5 (1):
  • [7] Diagnosis of Breast Cancer on Imbalanced Dataset Using Various Sampling Techniques and Machine Learning Models
    Gupta, Ruchita
    Bhargava, Rupal
    Jayabalan, Manoj
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 162 - 167
  • [8] Using Machine Learning Methods in Early Diagnosis of Breast Cancer
    Erkal, Begum
    Ayyildiz, Tulin Ercelebi
    TIP TEKNOLOJILERI KONGRESI (TIPTEKNO'21), 2021,
  • [9] Using machine learning to identify Parkinson’s disease severity subtypes with multimodal data
    Hwayoung Park
    Changhong Youm
    Sang-Myung Cheon
    Bohyun Kim
    Hyejin Choi
    Juseon Hwang
    Minsoo Kim
    Journal of NeuroEngineering and Rehabilitation, 22 (1)
  • [10] Identification and exploration of the pyroptosis-related molecular subtypes of breast cancer by bioinformatics and machine learning
    Zhang, Li
    Chu, Xiu-Feng
    Xu, Jing-Wei
    Yao, Xue-Yuan
    Zhang, Hong-Qiao
    Guo, Yan-Wei
    AMERICAN JOURNAL OF TRANSLATIONAL RESEARCH, 2022, 14 (09): : 6521 - 6535