Multi-omics-based Machine Learning for the Subtype Classification of Breast Cancer

被引:0
|
作者
Hassan, Asmaa M. [1 ]
Naeem, Safaa M. [1 ]
Eldosoky, Mohamed A. A. [1 ]
Mabrouk, Mai S. [2 ]
机构
[1] Helwan Univ, Dept Biomed Engn, Cairo, Egypt
[2] Nile Univ, Sch Informat Technol & Comp Sci ITCS, Sheikh Zayed City, Egypt
关键词
Multi-omics; Machine learning; Predictive modeling; Systems biology;
D O I
10.1007/s13369-024-09341-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer is a complicated disease that produces deregulatory changes in cellular activities (such as proteins). Data from these levels must be integrated into multi-omics analyses to better understand cancer and its progression. Deep learning approaches have recently helped with multi-omics analysis of cancer data. Breast cancer is a prevalent form of cancer among women, resulting from a multitude of clinical, lifestyle, social, and economic factors. The goal of this study was to predict breast cancer using several machine learning methods. We applied the architecture for mono-omics data analysis of the Cancer Genome Atlas Breast Cancer datasets in our analytical investigation. The following classifiers were used: random forest, partial least squares, Naive Bayes, decision trees, neural networks, and Lasso regularization. They were used and evaluated using the area under the curve metric. The random forest classifier and the Lasso regularization classifier achieved the highest area under the curve values of 0.99 each. These areas under the curve values were obtained using the mono-omics data employed in this investigation. The random forest and Lasso regularization classifiers achieved the maximum prediction accuracy, showing that they are appropriate for this problem. For all mono-omics classification models used in this paper, random forest and Lasso regression offer the best results for all metrics (precision, recall, and F1 score). The integration of various risk factors in breast cancer prediction modeling can aid in early diagnosis and treatment, utilizing data collection, storage, and intelligent systems for disease management. The integration of diverse risk factors in breast cancer prediction modeling holds promise for early diagnosis and treatment. Leveraging data collection, storage, and intelligent systems can further enhance disease management strategies, ultimately contributing to improved patient outcomes.
引用
收藏
页码:1339 / 1352
页数:14
相关论文
共 50 条
  • [1] A survey on multi-omics-based cancer diagnosis using machine learning with the potential application in gastrointestinal cancer
    Wang, Suixue
    Wang, Shuling
    Wang, Zhengxia
    FRONTIERS IN MEDICINE, 2023, 9
  • [2] Multi-omics-based approach to colorectal cancer metabolism
    Satoh, Kiyotoshi
    Soga, Tomoyoshi
    CANCER RESEARCH, 2017, 77
  • [3] Pan-cancer classification of multi-omics data based on machine learning models
    Cava, Claudia
    Sabetian, Soudabeh
    Salvatore, Christian
    Castiglioni, Isabella
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [4] Machine Learning-Based Analysis of MR Multiparametric Radiomics for the Subtype Classification of Breast Cancer
    Xie, Tianwen
    Wang, Zhe
    Zhao, Qiufeng
    Bai, Qianming
    Zhou, Xiaoyan
    Gu, Yajia
    Peng, Weijun
    Wang, He
    FRONTIERS IN ONCOLOGY, 2019, 9
  • [5] Multimodal and multi-omics-based deep learning model for screening of optic neuropathy
    Lin, Ye-ting
    Zhou, Qiong
    Tan, Jian
    Tao, Yulin
    HELIYON, 2023, 9 (12)
  • [6] Multi-Omics-Based Discovery of Plant Signaling Molecules
    Luo, Fei
    Yu, Zongjun
    Zhou, Qian
    Huang, Ancheng
    METABOLITES, 2022, 12 (01)
  • [7] moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks
    Choi, Joung Min
    Chae, Heejoon
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [8] moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks
    Joung Min Choi
    Heejoon Chae
    BMC Bioinformatics, 24
  • [9] Multi-omics-based prediction of hybrid performance in canola
    Knoch, Dominic
    Werner, Christian R.
    Meyer, Rhonda C.
    Riewe, David
    Abbadi, Amine
    Luecke, Sophie
    Snowdon, Rod J.
    Altmann, Thomas
    THEORETICAL AND APPLIED GENETICS, 2021, 134 (04) : 1147 - 1165
  • [10] Deep Learning for Integrated Analysis of Breast Cancer Subtype Specific Multi-omics Data
    Rakshit, Somnath
    Saha, Indrajit
    Chakraborty, Subha Shankar
    Plewczyski, Dariusz
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1917 - 1922