Epidemiological breast cancer prediction by country: A novel machine learning approach

被引:1
作者
El Haji, Hasna [1 ]
Sbihi, Nada [1 ]
Guermah, Bassma [1 ]
Souadka, Amine [2 ]
Ghogho, Mounir [1 ]
机构
[1] Int Univ Rabat, TICLab, Rabat, Morocco
[2] Mohammed V Univ, Natl Inst Oncol, Surg Oncol Dept, Rabat, Morocco
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
DOSE-RESPONSE METAANALYSIS; TAIWANESE WOMEN; SMARTPHONE USE; RISK-FACTORS; ASSOCIATION; CONSUMPTION; DISEASE; IRAN;
D O I
10.1371/journal.pone.0308905
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Breast cancer remains a significant contributor to cancer-related deaths among women globally. We seek for this study to examine the correlation between the incidence rates of breast cancer and newly identified risk factors. Additionally, we aim to utilize machine learning models to predict breast cancer incidence at a country level. Following an extensive review of the available literature, we have identified a range of recently studied risk factors associated with breast cancer. Subsequently, we gathered data on these factors and breast cancer incidence rates from numerous online sources encompassing 151 countries. To evaluate the relationship between these factors and breast cancer incidence, we assessed the normality of the data and conducted Spearman's correlation test. Furthermore, we refined six regression models to forecast future breast cancer incidence rates. Our findings indicate that the incidence of breast cancer is most positively correlated with the average age of women in a country, as well as factors such as meat consumption, CO2 emissions, depression, sugar consumption, tobacco use, milk intake, mobile cells, alcohol consumption, pesticides, and oral contraceptive use. As for prediction, the CatBoost Regressor successfully predicted future breast cancer incidence with an R squared value of 0.84 +/- 0.03. An increased incidence of breast cancer is mainly associated with dietary habits and lifestyle. Our findings and recommendations can serve as a baseline for developing educational programs intended to heighten awareness amongst women in countries with heightened risk.
引用
收藏
页数:20
相关论文
共 110 条
  • [1] User's guide to correlation coefficients
    Akoglu, Haldun
    [J]. TURKISH JOURNAL OF EMERGENCY MEDICINE, 2018, 18 (03): : 91 - 93
  • [2] Combined Oral Contraceptives and Breast Cancer: an Unsolved Conundrum
    Ammembal, Akshata M. Kamath
    Udupa, Karthik
    [J]. INDIAN JOURNAL OF GYNECOLOGIC ONCOLOGY, 2021, 19 (04)
  • [3] Risk of cancer in bipolar disorder and the potential role of lithium: International collaborative systematic review and meta-analyses
    Anmella, Gerard
    Fico, Giovanna
    Lotfaliany, Mojtaba
    Hidalgo-Mazzei, Diego
    Soto-Angona, Oscar
    Gimenez-Palomo, Anna
    Amoretti, Silvia
    Murru, Andrea
    Radua, Joaquim
    Solanes, Aleix
    Pacchiarotti, Isabella
    Verdolini, Norma
    Cowdery, Stephanie
    Dodd, Seetal
    Williams, Lana J.
    Mohebbi, Mohammadreza
    Carvalho, Andre F.
    Kessing, Lars Vedel
    Vieta, Eduard
    Berk, Michael
    [J]. NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2021, 126 : 529 - 541
  • [4] [Anonymous], 2013, Report of the Societal Cancer Observatory
  • [5] [Anonymous], 2014, Indicators
  • [6] [Anonymous], 2013, US
  • [7] [Anonymous], World Contraceptive Use 2015
  • [8] [Anonymous], FAOSTAT
  • [9] [Anonymous], IARC MONOGRAPHS EVAL
  • [10] Epidemiology and prognosis of breast cancer in young women
    Assi, Hussein A.
    Khoury, Katia E.
    Dbouk, Haifa
    Khalil, Lana E.
    Mouhieddine, Tarek H.
    El Saghir, Nagi S.
    [J]. JOURNAL OF THORACIC DISEASE, 2013, 5 : S2 - S8