Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer's Disease Data

被引:6
|
作者
Zhang, Fan [1 ,2 ]
Petersen, Melissa [1 ,2 ]
Johnson, Leigh [1 ,3 ]
Hall, James [1 ,2 ]
O'Bryant, Sid E. [1 ,2 ]
机构
[1] Univ North Texas, Hlth Sci Ctr, Inst Translat Res, Ft Worth, TX 76107 USA
[2] Univ North Texas, Hlth Sci Ctr, Dept Family Med, Ft Worth, TX 76107 USA
[3] Univ North Texas, Hlth Sci Ctr, Dept Pharmacol & Neurosci, Ft Worth, TX 76107 USA
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 13期
基金
美国国家卫生研究院;
关键词
hyperparameter tuning; high-performance computing; machine learning; imbalanced data; mild cognitive impairment; Alzheimer's disease;
D O I
10.3390/app12136670
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Accurate detection is still a challenge in machine learning (ML) for Alzheimer's disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperparameter tuning workflow with high-performance computing (HPC) for imbalanced data related to prevalent mild cognitive impairment (MCI) and AD in the Health and Aging Brain Study-Health Disparities (HABS-HD) project. We applied a single-node multicore parallel mode to hyperparameter tuning of gamma, cost, and class weight using a support vector machine (SVM) model with 10 times repeated fivefold cross-validation. We executed the hyperparameter tuning workflow with R's bigmemory, foreach, and doParallel packages on Texas Advanced Computing Center (TACC)'s Lonestar6 system. The computational time was dramatically reduced by up to 98.2% for the high-performance SVM hyperparameter tuning model, and the performance of cross-validation was also improved (the positive predictive value and the negative predictive value at base rate 12% were, respectively, 16.42% and 92.72%). Our results show that a single-node multicore parallel structure and high-performance SVM hyperparameter tuning model can deliver efficient and fast computation and achieve outstanding agility, simplicity, and productivity for imbalanced data in AD applications.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison
    Pfob, Andre
    Lu, Sheng-Chieh
    Sidey-Gibbons, Chris
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [42] Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison
    André Pfob
    Sheng-Chieh Lu
    Chris Sidey-Gibbons
    BMC Medical Research Methodology, 22
  • [43] Automated Hyperparameter Tuning and Ensemble Machine Learning Approach for Network Traffic Classification
    Chen, Liwei
    Sun, Xiu
    Li, Yuchan
    Jaseemuddin, Muhammad
    Kazi, Baha Uddin
    19TH IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, BMSB 2024, 2024, : 690 - 695
  • [44] Imbalanced cholesterol metabolism in Alzheimer's disease
    Zhao Xue-shan
    Peng Juan
    Wu Qi
    Ren Zhong
    Pan Li-hong
    Tang Zhi-han
    Jiang Zhi-sheng
    Wang Gui-xue
    Liu Lu-shan
    CLINICA CHIMICA ACTA, 2016, 456 : 107 - 114
  • [45] Model Performance Prediction for Hyperparameter Optimization of Deep Learning Models Using High Performance Computing and Quantum Annealing
    Garcia Amboage, Juan Pablo
    Wulff, Eric
    Girone, Maria
    Pena, Tomas F.
    26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS, CHEP 2023, 2024, 295
  • [46] Integrating Data Selection and Extreme Learning Machine for Imbalanced Data
    Mahdiyah, Umi
    Irawan, M. Isa
    Imah, Elly Matul
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2015), 2015, 59 : 221 - 229
  • [47] Optimizing health data analytics in fog computing using hyperparameter tuning and grid search
    Singh, Kiran Deep
    Singh, Prabh Deep
    Verma, Rohan
    Taneja, Harsh
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2024, 45 (02): : 429 - 438
  • [48] MagmaDNN: Towards High-Performance Data Analytics and Machine Learning for Data-Driven Scientific Computing
    Nichols, Daniel
    Tomov, Nathalie-Sofia
    Betancourt, Frank
    Tomov, Stanimire
    Wong, Kwai
    Dongarra, Jack
    HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 490 - 503
  • [49] Autocalibration experiments using machine learning and high performance computing
    Sloboda, M.
    Swayne, D. A.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2013, 40 : 302 - 315
  • [50] A multimodal learning machine framework for Alzheimer's disease diagnosis based on neuropsychological and neuroimaging data
    Zhang, Meiwei
    Cui, Qiushi
    Lu, Yang
    Yu, Weihua
    Li, Wenyuan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 197