Diagnosis of breast cancer molecular subtypes using machine learning models on unimodal and multimodal datasets

被引:2
|
作者
Rani, Samta [1 ]
Ahmad, Tanvir [1 ]
Masood, Sarfaraz [1 ]
Saxena, Chandni [2 ]
机构
[1] Jamia Millia Islamia, Dept Comp Engn, New Delhi, India
[2] Chinese Univ Hong Kong, Sha Tin, Hong Kong, Peoples R China
关键词
Machine learning; Unimodal data; Multimodal data; Breast cancer molecular subtypes; Deep neural network;
D O I
10.1007/s00521-023-09005-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast cancer is a significant global health concern, with millions of cases and deaths each year. Accurate diagnosis is critical for timely treatment and medication. Machine learning techniques have shown promising results in detecting breast cancer. Previous studies have primarily used single-modality data for breast cancer diagnosis. Hence, this work aims to mobilize the benefits of multimodal data over unimodality samples. This study proposes a custom deep learning-based model pipeline that works over this multimodal data. This work has been separated into three phases. Phase 1 and Phase 2 under the unimodal category examine gene expression data and histopathological images separately. The Cancer Genome Atlas makes these datasets available. In Phase 3, the proposed pipeline operates on both data types' samples for each patient in the multimodal category. This study investigates how data pre-processing (cleaning, transformation, reduction) and cascaded filtering affect model performance. Precision, recall, f1-score, and accuracy assessed the models, whereas L2 regularization, exponentially weighted moving average, and transfer learning minimized over-fitting. A custom deep neural network and support vector machine obtained 86% accuracy in Phase 1, whereas the VGG16 model reached 80.21% accuracy in Phase 2. In Phase 3, the curated multimodal dataset was applied to a custom deep learning pipeline (VGG16 backbone with hyper-tuned machine learning models as head classifiers) to achieve 94% accuracy, demonstrating the importance of multimodal data over unimodal in breast cancer subtype classification. These findings highlight the importance of multimodal data for breast cancer diagnosis and subtype prediction.
引用
收藏
页码:24109 / 24121
页数:13
相关论文
共 50 条
  • [41] The Use of Gene Expression Profiling to Predict Molecular Subtypes of Breast Cancer by a New Machine Learning Algorithm: Random Forest
    Fararjeh, Abdul-Fattah
    Al-khlifeh, Enas
    Aloliqi, Abdulaziz A.
    Tarawneh, Ahmad S.
    Hassanat, Ahmad B.
    CURRENT BIOINFORMATICS, 2024,
  • [42] MACHINE LEARNING CLASSIFIERS, META CLASSIFIERS COMPARISON AND ANALYSIS ON BREAST CANCER AND DIABETES DATASETS
    Vidushi
    Agarwal, Manisha
    ADVANCES AND APPLICATIONS IN MATHEMATICAL SCIENCES, 2020, 19 (10): : 1017 - 1028
  • [43] Machine Learning With Computer Networks: Techniques, Datasets, and Models
    Afifi, Haitham
    Pochaba, Sabrina
    Boltres, Andreas
    Laniewski, Dominic
    Haberer, Janek
    Paeleke, Leonard
    Poorzare, Reza
    Stolpmann, Daniel
    Wehner, Nikolas
    Redder, Adrian
    Samikwa, Eric
    Seufert, Michael
    IEEE ACCESS, 2024, 12 : 54673 - 54720
  • [44] Enhancing noninvasive pancreatic cystic neoplasm diagnosis with multimodal machine learning
    Wei Huang
    Yue Xu
    Zhao Li
    Jun Li
    Qing Chen
    Qiang Huang
    Yaping Wu
    Hongtan Chen
    Scientific Reports, 15 (1)
  • [45] Mammography Image-Based Diagnosis of Breast Cancer Using Machine Learning: A Pilot Study
    Alshammari, Maha M.
    Almuhanna, Afnan
    Alhiyafi, Jamal
    SENSORS, 2022, 22 (01)
  • [46] Integration of ultrasound and mammogram for multimodal classification of breast cancer using hybrid residual neural network and machine learning
    Atrey, Kushangi
    Singh, Bikesh Kumar
    Bodhey, Narendra Kuber
    IMAGE AND VISION COMPUTING, 2024, 145
  • [47] Time-related survival prediction in molecular subtypes of breast cancer using time-to-event deep-learning-based models
    Shahraki, Saba Zarean
    Looha, Mehdi Azizmohammad
    Kazaj, Pooya Mohammadi
    Aria, Mehrad
    Akbari, Atieh
    Emami, Hassan
    Asadi, Farkhondeh
    Akbari, Mohammad Esmaeil
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [48] A Quantitative Comparison of Different Machine Learning Approaches for Human Spermatozoa Quality Prediction Using Multimodal Datasets
    Feng, Ming
    Xu, Kele
    Wang, Yin
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4659 - 4663
  • [49] Ensemble classifier for improve diagnosis of the breast cancer using optical coherence tomography and machine learning
    Dubey, Kavita
    Singla, Neeru
    Butola, Ankit
    Lathe, Astitwa
    Quaiser, Darakhshan
    Srivastava, Anurag
    Mehta, Dalip Singh
    Srivastava, Vishal
    LASER PHYSICS LETTERS, 2019, 16 (02)
  • [50] Diagnosis of triple negative breast cancer using expression data with several machine learning tools
    Pranaya, Sankaranarayanan
    Ragunath, P. K.
    Venkatesan, P.
    BIOINFORMATION, 2022, 18 (04) : 325 - 330