A potential predictive model based on machine learning and CPD parameters in elderly patients with aplastic anemia and myelodysplastic neoplasms

被引:0
作者
Qi, Yuxiang [1 ,2 ]
Liu, Xu [2 ,3 ]
Ding, Zhishan [1 ]
Yu, Ying [2 ]
Zhuang, Zhenchao [4 ]
机构
[1] Zhejiang Chinese Med Univ, Sch Med Technol & Informat Engn, Hangzhou, Peoples R China
[2] Zhejiang Chinese Med Univ, Zhejiang Prov Hosp Chinese Med, Dept Lab Med, Affiliated Hosp 1, Hangzhou, Peoples R China
[3] Zhejiang Chinese Med Univ, Sch Clin Med 1, Hangzhou, Peoples R China
[4] Adicon Clin Labs, Hangzhou 310023, Zhejiang, Peoples R China
关键词
Cell population data; Parameters; Aplastic anemia; Myelodysplastic neoplasms; Machine learning;
D O I
10.1186/s12911-024-02781-z
中图分类号
R-058 [];
学科分类号
摘要
BackgroundAplastic anemia (AA) and myelodysplastic neoplasms (MDS) have similar peripheral blood manifestations and are clinically characterized by reduced hematological triad. It is challenging to distinguish and diagnose these two diseases. Hence, utilizing machine learning methods, we employed and validated an algorithm that used cell population data (CPD) parameters to diagnose AA and MDS.MethodsIn this study, CPD parameters were obtained from the Beckman Coulter DxH800 analyzer for 160 individuals diagnosed with AA or MDS through a comprehensive retrospective analysis. The individuals were unselectively assigned to a training cohort (77%) and a testing cohort (23%). Additionally, an external validation cohort consisting of eighty-six elderly patients with AA and MDS from two additional centers was established. The discriminative parameters were carefully analyzed through univariate analysis, and the most predictive variables were selected using least absolute shrinkage and selection operator (LASSO) regression. Six machine learning algorithms were utilized to compare the performance of forecasting AA and MDS patients. The area under the curves (AUCs), calibration curves, decision curves analysis (DCA), and shapley additive explanations (SHAP) plots were employed to interpret and assess the model's predictive accuracy, clinical utility, and stability.ResultsAfter the comparative evaluation of various models, the logistic regression model emerged as the most suitable machine learning model for predicting the probability of AA and MDS, which utilized five principal variables (age, MNVLY, SDVLY, MNLALSEGC, and MNCEGC) to accurately estimate the risk of these diseases. The best model delivered an AUC of 0.791 in the testing cohort and had a high specificity (0.850) and positive predictive value (0.818). Furthermore, the calibration curve indicated excellent agreement between actual and predicted probabilities. The DCA curve further supported the clinical utility of our model and offered significant clinical advantages in guiding treatment decisions. Moreover, the model's performance was consistent in an external validation group, with an AUC of 0.719.ConclusionsWe developed a novel model that effectively distinguished elderly patients with AA and MDS, which had the potential to provide physicians assistance in early diagnosis and the proper treatment for the elderly.
引用
收藏
页数:15
相关论文
共 43 条
  • [1] Impact of preexisting autoimmune disease on myelodysplastic syndromes outcomes: a population analysis
    Adrianzen-Herrera, Diego
    Sparks, Andrew D.
    Singh, Rohit
    Alejos-Castillo, David
    Batra, Akshee
    Glushakow-Smith, Shira
    Pradhan, Kith
    Shastri, Aditi
    Zakai, Neil A.
    [J]. BLOOD ADVANCES, 2023, 7 (22) : 6913 - 6922
  • [2] Unlocking the Potential of Artificial Intelligence in Acute Myeloid Leukemia and Myelodysplastic Syndromes
    Alhajahjeh, Abdulrahman
    Nazha, Aziz
    [J]. CURRENT HEMATOLOGIC MALIGNANCY REPORTS, 2024, 19 (01) : 9 - 17
  • [3] A Novel Algorithm Using Cell Population Data (VCS Parameters) as a Screening Discriminant between Alpha and Beta Thalassemia Traits
    Ambayya, Angeli
    Sahibon, Santina
    Yang, Thoo Wei
    Zhang, Qian-Yun
    Hassan, Rosline
    Sathar, Jameela
    [J]. DIAGNOSTICS, 2021, 11 (11)
  • [4] Neoteric Algorithm Using Cell Population Data (VCS Parameters) as a Rapid Screening Tool for Haematological Disorders
    Ambayya, Angeli
    Sathar, Jameela
    Hassan, Rosline
    [J]. DIAGNOSTICS, 2021, 11 (09)
  • [5] Autoimmune hemolytic anemia, autoimmune neutropenia and aplastic anemia in the elderly
    Barcellini, Wilma
    Fattizzo, Bruno
    Cortelezzi, Agostino
    [J]. EUROPEAN JOURNAL OF INTERNAL MEDICINE, 2018, 58 : 77 - 83
  • [6] Diagnostic criteria to distinguish hypocellular acute myeloid leukemia from hypocellular myelodysplastic syndromes and aplastic anemia: recommendations for a standardized approach
    Bennett, John M.
    Orazi, Attilio
    [J]. HAEMATOLOGICA-THE HEMATOLOGY JOURNAL, 2009, 94 (02): : 264 - 269
  • [7] Construction of the prediction model for multiple myeloma based on machine learning
    Cai, Jiangying
    Liu, Zhenhua
    Wang, Yingying
    Yang, Wanxia
    Sun, Zhipeng
    You, Chongge
    [J]. INTERNATIONAL JOURNAL OF LABORATORY HEMATOLOGY, 2024, 46 (05) : 918 - 926
  • [8] Myelodysplastic Syndromes
    Cazzola, Mario
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2020, 383 (14) : 1358 - 1374
  • [9] White Blood Cell Counts Reference Methodology
    Chabot-Richards, Devon S.
    George, Tracy I.
    [J]. CLINICS IN LABORATORY MEDICINE, 2015, 35 (01) : 11 - +
  • [10] Aplastic anemia in the elderly: a nationwide survey on behalf of the French Reference Center for Aplastic Anemia
    Contejean, Adrien
    Resche-Rigon, Matthieu
    Tamburini, Jerome
    Alcantara, Marion
    Jardin, Fabrice
    Lengline, Etienne
    Ades, Lionel
    Bouscary, Didier
    Marcais, Ambroise
    Lebon, Delphine
    Chabrot, Cecile
    Terriou, Louis
    Barraco, Fiorenza
    Banos, Anne
    Bussot, Lucile
    Cahn, Jean-Yves
    Hirsch, Pierre
    Maillard, Natacha
    Simon, Laurence
    Fornecker, Luc-Matthieu
    Socie, Gerard
    de latour, Regis Peffault
    de Fontbrune, Flore Sicre
    [J]. HAEMATOLOGICA, 2019, 104 (02) : 256 - 262