Machine Learning as a Tool for Early Detection: A Focus on Late-Stage Colorectal Cancer across Socioeconomic Spectrums

被引:1
作者
Galadima, Hadiza [1 ]
Anson-Dwamena, Rexford [1 ]
Johnson, Ashley [1 ]
Bello, Ghalib [2 ]
Adunlin, Georges [3 ]
Blando, James [1 ]
机构
[1] Old Dominion Univ, Sch Community & Environm Hlth, Norfolk, VA 23529 USA
[2] Icahn Sch Med Mt Sinai, Dept Environm Med & Publ Hlth, New York, NY 10029 USA
[3] Samford Univ, Dept Pharmaceut Social & Adm Sci, Birmingham, AL 35229 USA
关键词
socioeconomic disparity; cancer care; predictive modeling; machine learning; AI; social determinants of health in oncology; spatial analysis; precision care; ARTIFICIAL-INTELLIGENCE; US; CLASSIFICATION; OPPORTUNITIES; DETERMINANTS; PREVENTION; PREDICTION; MORTALITY; SURVIVAL; SOCIETY;
D O I
10.3390/cancers16030540
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary This research explores the potential of machine learning (ML) to predict late-stage colorectal cancer (CRC) diagnoses. The focus is on understanding how socioeconomic and regional factors affect cancer care, particularly in detecting CRC at an advanced stage. We aim to merge data on social determinants of health with individual demographics to uncover patterns indicating higher CRC risk. We compared various ML models, such as decision trees, random forest, and gradient boosting to find the most effective tool for this task. The goal is to utilize artificial intelligence (AI) for early, more accurate CRC detection, which can lead to better treatment outcomes. This study promises to significantly contribute to cancer research, potentially leading to more personalized and efficient healthcare strategies that could ultimately save lives.Abstract Purpose: To assess the efficacy of various machine learning (ML) algorithms in predicting late-stage colorectal cancer (CRC) diagnoses against the backdrop of socio-economic and regional healthcare disparities. Methods: An innovative theoretical framework was developed to integrate individual- and census tract-level social determinants of health (SDOH) with sociodemographic factors. A comparative analysis of the ML models was conducted using key performance metrics such as AUC-ROC to evaluate their predictive accuracy. Spatio-temporal analysis was used to identify disparities in late-stage CRC diagnosis probabilities. Results: Gradient boosting emerged as the superior model, with the top predictors for late-stage CRC diagnosis being anatomic site, year of diagnosis, age, proximity to superfund sites, and primary payer. Spatio-temporal clusters highlighted geographic areas with a statistically significant high probability of late-stage diagnoses, emphasizing the need for targeted healthcare interventions. Conclusions: This research underlines the potential of ML in enhancing the prognostic predictions in oncology, particularly in CRC. The gradient boosting model, with its robust performance, holds promise for deployment in healthcare systems to aid early detection and formulate localized cancer prevention strategies. The study's methodology demonstrates a significant step toward utilizing AI in public health to mitigate disparities and improve cancer care outcomes.
引用
收藏
页数:21
相关论文
共 97 条
  • [51] Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps, 2008: A joint guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology
    Levin, Bernard
    Lieberman, David A.
    McFarland, Beth
    Andrews, Kimberly S.
    Brooks, Durado
    Bond, John
    Dash, Chiranjeev
    Giardiello, Francis M.
    Glick, Seth
    Johnson, David
    Johnson, C. Daniel
    Levin, Theodore R.
    Pickhardt, Perry J.
    Rex, Douglas K.
    Smith, Robert A.
    Thorson, Alan
    Winawer, Sidney J.
    [J]. GASTROENTEROLOGY, 2008, 134 (05) : 1570 - 1595
  • [52] Effects of Organized Colorectal Cancer Screening on Cancer Incidence and Mortality in a Large Community-Based Population
    Levin, Theodore R.
    Corley, Douglas A.
    Jensen, Christopher D.
    Schottinger, Joanne E.
    Quinn, Virginia P.
    Zauber, Ann G.
    Lee, Jeffrey K.
    Zhao, Wei K.
    Udaltsova, Natalia
    Ghai, Nirupa R.
    Lee, Alexander T.
    Quesenberry, Charles P.
    Fireman, Bruce H.
    Doubeni, Chyke A.
    [J]. GASTROENTEROLOGY, 2018, 155 (05) : 1383 - +
  • [53] Early symptoms and sensations as predictors of lung cancer: a machine learning multivariate model
    Levitsky, Adrian
    Pernemalm, Maria
    Bernhardson, Britt-Marie
    Forshed, Jenny
    Kolbeck, Karl
    Olin, Maria
    Henriksson, Roger
    Lehtio, Janne
    Tishelman, Carol
    Eriksson, Lars E.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [54] Prediction of lung cancer risk in Chinese population with genetic-environment factor using extreme gradient boosting
    Li, Yutao
    Zou, Zixiu
    Gao, Zhunyi
    Wang, Yi
    Xiao, Man
    Xu, Chang
    Jiang, Gengxi
    Wang, Haijian
    Jin, Li
    Wang, Jiucun
    Wang, Huai Zhou
    Guo, Shicheng
    Wu, Junjie
    [J]. CANCER MEDICINE, 2022, 11 (23): : 4469 - 4478
  • [55] Updated Methodology for Projecting US- and State-Level Cancer Counts for the Current Calendar Year: Part I: Spatio-temporal Modeling for Cancer Incidence
    Liu, Benmei
    Zhu, Li
    Zou, Joe
    Chen, Huann-Sheng
    Miller, Kimberly D.
    Jemal, Ahmedin
    Siegel, Rebecca L.
    Feuer, Eric J.
    [J]. CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2021, 30 (09) : 1620 - 1626
  • [56] Machine learning for the prediction of bone metastasis in patients with newly diagnosed thyroid cancer
    Liu, Wen-Cai
    Li, Zhi-Qiang
    Luo, Zhi-Wen
    Liao, Wei-Jie
    Liu, Zhi-Li
    Liu, Jia-Ming
    [J]. CANCER MEDICINE, 2021, 10 (08): : 2802 - 2811
  • [57] Prediction of lung cancer patient survival via supervised machine learning classification techniques
    Lynch, Chip M.
    Abdollahi, Behnaz
    Fuqua, Joshua D.
    de Carlo, Alexandra R.
    Bartholomai, James A.
    Balgemann, Rayeanne N.
    van Berkel, Victor H.
    Frieboes, Hermann B.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 108 : 1 - 8
  • [58] Validation of a Machine Learning Algorithm to Predict 180-Day Mortality for Outpatients With Cancer
    Manz, Christopher R.
    Chen, Jinbo
    Liu, Manqing
    Chivers, Corey
    Regli, Susan Harkness
    Braun, Jennifer
    Draugelis, Michael
    Hanson, C. William
    Shulman, Lawrence N.
    Schuchter, Lynn M.
    O'Connor, Nina
    Bekelman, Justin E.
    Patel, Mitesh S.
    Parikh, Ravi B.
    [J]. JAMA ONCOLOGY, 2020, 6 (11) : 1723 - 1730
  • [59] Using medicare data to estimate the number of cases missed by a cancer registry - A 3-source capture-recapture model
    McClish, D
    Penberthy, L
    [J]. MEDICAL CARE, 2004, 42 (11) : 1111 - 1116
  • [60] McClure Laura A, 2016, J Registry Manag, V43, P16