Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer's Disease Data

被引:6
|
作者
Zhang, Fan [1 ,2 ]
Petersen, Melissa [1 ,2 ]
Johnson, Leigh [1 ,3 ]
Hall, James [1 ,2 ]
O'Bryant, Sid E. [1 ,2 ]
机构
[1] Univ North Texas, Hlth Sci Ctr, Inst Translat Res, Ft Worth, TX 76107 USA
[2] Univ North Texas, Hlth Sci Ctr, Dept Family Med, Ft Worth, TX 76107 USA
[3] Univ North Texas, Hlth Sci Ctr, Dept Pharmacol & Neurosci, Ft Worth, TX 76107 USA
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 13期
基金
美国国家卫生研究院;
关键词
hyperparameter tuning; high-performance computing; machine learning; imbalanced data; mild cognitive impairment; Alzheimer's disease;
D O I
10.3390/app12136670
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Accurate detection is still a challenge in machine learning (ML) for Alzheimer's disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperparameter tuning workflow with high-performance computing (HPC) for imbalanced data related to prevalent mild cognitive impairment (MCI) and AD in the Health and Aging Brain Study-Health Disparities (HABS-HD) project. We applied a single-node multicore parallel mode to hyperparameter tuning of gamma, cost, and class weight using a support vector machine (SVM) model with 10 times repeated fivefold cross-validation. We executed the hyperparameter tuning workflow with R's bigmemory, foreach, and doParallel packages on Texas Advanced Computing Center (TACC)'s Lonestar6 system. The computational time was dramatically reduced by up to 98.2% for the high-performance SVM hyperparameter tuning model, and the performance of cross-validation was also improved (the positive predictive value and the negative predictive value at base rate 12% were, respectively, 16.42% and 92.72%). Our results show that a single-node multicore parallel structure and high-performance SVM hyperparameter tuning model can deliver efficient and fast computation and achieve outstanding agility, simplicity, and productivity for imbalanced data in AD applications.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A Method for Analyzing the Performance Impact of Imbalanced Binary Data on Machine Learning Models
    Zheng, Ming
    Wang, Fei
    Hu, Xiaowen
    Miao, Yuhao
    Cao, Huo
    Tang, Mingjing
    AXIOMS, 2022, 11 (11)
  • [32] Data oversampling and imbalanced datasets: an investigation of performance for machine learning and feature engineering
    Mujahid, Muhammad
    Kina, Erol
    Rustam, Furqan
    Villar, Monica Gracia
    Alvarado, Eduardo Silva
    Diez, Isabel De La Torre
    Ashraf, Imran
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [33] Machine Learning on Imbalanced Data in Credit Risk
    Birla, Shiivong
    Kohli, Kashish
    Dutta, Akash
    7TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE IEEE IEMCON-2016, 2016,
  • [34] Applied machine learning in Alzheimer's disease research: omics, imaging, and clinical data
    Li, Ziyi
    Jiang, Xiaoqian
    Wang, Yizhuo
    Kim, Yejin
    EMERGING TOPICS IN LIFE SCIENCES, 2021, 5 (06) : 765 - 777
  • [35] Stacked Machine Learning Model for Predicting Alzheimer's Disease Based on Genetic Data
    Alatrany, Abbas Saad
    Hussain, Abir
    Jamila, Mustafina
    Al-Jumeiy, Dhiya
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 594 - 598
  • [36] A survey on machine and statistical learning for longitudinal analysis of neuroimaging data in Alzheimer's disease
    Marti-Juan, Gerard
    Sanroma-Guell, Gerard
    Piella, Gemma
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 189
  • [37] A machine learning model for Alzheimer's disease prediction
    Rani, Pooja
    Lamba, Rohit
    Sachdeva, Ravi Kumar
    Kumar, Karan
    Iwendi, Celestine
    IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2024, 9 (02) : 125 - 134
  • [38] Diagnosis of Alzheimer's Disease using Machine Learning
    Lodha, Priyanka
    Talele, Ajay
    Degaonkar, Kishori
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [39] Predicting Alzheimer's Disease with Interpretable Machine Learning
    Jia, Maoni
    Wu, Yafei
    Xiang, Chaoyi
    Fang, Ya
    DEMENTIA AND GERIATRIC COGNITIVE DISORDERS, 2023, 52 (04) : 249 - 257
  • [40] An Efficient Machine Learning Method to Solve Imbalanced Data in Metabolic Disease Prediction
    Cecchini, Vania
    Nguyen, Thanh-Phuong
    Pfau, Thomas
    De landtsheer, Sebastien
    Sauter, Thomas
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 357 - 361