Risk factors associated with severe progression of Parkinson's disease: random forest and logistic regression models

被引:0
作者
Tan, Jie [1 ]
Huang, E. [2 ]
Hao, Yang [1 ]
Wan, Hongping [1 ]
Zhang, Qian [1 ]
机构
[1] Jianghan Univ, Dept Neurol, Hubei Peoples Hosp 3, Wuhan, Peoples R China
[2] Jianghan Univ, Support Ctr, Hubei Peoples Hosp 3, Wuhan, Peoples R China
来源
FRONTIERS IN NEUROLOGY | 2025年 / 16卷
关键词
Parkinson's disease; random forest; logistic regression; risk factors; disease progression; DEPRESSION;
D O I
10.3389/fneur.2025.1550789
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background and aim Parkinson's disease (PD) is a neurodegenerative disorder with significant variability in disease progression. Identifying clinical and environmental risk factors associated with severe progression is essential for early diagnosis and personalized treatment. This study evaluates the performance of Random Forest (RF) and Logistic Regression (LR) models in forecasting the major risk factors associated with severe PD progression.Methods We performed a retrospective analysis of 378 PD patients (aged 40-75 years) with at 2 years of follow-up. The dataset included patient demographics, clinical features, medication history, comorbidities, and environmental exposures. The data were randomly split into a training group (70%) and a validation group (30%). Both the RF and LR models were trained on the training set, and performance was assessed through accuracy, sensitivity, specificity, and the Area Under the Curve (AUC) derived from ROC analysis.Results Both models identified similar risk factors for severe PD progression, including older age, tremor-dominant motor subtype, long-term levodopa use, comorbid depression, and occupational pesticide exposure. The RF model outperformed the LR model, achieving an AUC of 0.85, accuracy of 82%, sensitivity of 79%, and specificity of 85%. In comparison, the LR model had an AUC of 0.78, accuracy of 76%, sensitivity of 74%, and specificity of 79%. ROC analysis showed that while both models could distinguish between slow and rapid disease progression, the RF model had stronger discriminatory power, particularly for identifying high-risk patients.Conclusion The RF model provides better predictive accuracy and discriminatory power compared to Logistic Regression in identifying risk factors for severe PD progression. This study highlights the potential of machine learning techniques like Random Forest for early risk stratification and personalized management of PD.
引用
收藏
页数:10
相关论文
共 27 条
  • [21] A clinico-pathological study of subtypes in Parkinson's disease
    Selikhova, M.
    Williams, D. R.
    Kempster, P. A.
    Holton, J. L.
    Revesz, T.
    Lees, A. J.
    [J]. BRAIN, 2009, 132 : 2947 - 2957
  • [22] Classification of Parkinson's disease and its stages using machine learning
    Templeton, John Michael
    Poellabauer, Christian
    Schneider, Sandra
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [23] Challenges in the diagnosis of Parkinson's disease
    Tolosa, Eduardo
    Garrido, Alicia
    Scholz, Sonja W.
    Poewe, Werner
    [J]. LANCET NEUROLOGY, 2021, 20 (05) : 385 - 397
  • [24] A Machine Learning Approach for Early Identification of Prodromal Parkinson's Disease
    Vaish, Anisha
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (06)
  • [25] The neuropsychiatry of Parkinson's disease: advances and challenges
    Weintraub, Daniel
    Aarsland, Dag
    Chaudhuri, Kallol Ray
    Dobkin, Roseanne D.
    Leentjens, Albert F. G.
    Rodriguez-Violante, Mayela
    Schrag, Anette
    [J]. LANCET NEUROLOGY, 2022, 21 (01) : 89 - 102
  • [26] Predictors of rapid eye movement sleep behavior disorder in patients with Parkinson's disease based on random forest and decision tree
    Wu Chong-Wen
    Li Sha-Sha
    Xu, E.
    [J]. PLOS ONE, 2022, 17 (06):
  • [27] A novel ensemble of random forest for assisting diagnosis of Parkinson 's disease on small handwritten dynamics dataset
    Xu, Shoujiang
    Pan, Zhigeng
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 144 (144)