Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer's Disease Data

被引:6
|
作者
Zhang, Fan [1 ,2 ]
Petersen, Melissa [1 ,2 ]
Johnson, Leigh [1 ,3 ]
Hall, James [1 ,2 ]
O'Bryant, Sid E. [1 ,2 ]
机构
[1] Univ North Texas, Hlth Sci Ctr, Inst Translat Res, Ft Worth, TX 76107 USA
[2] Univ North Texas, Hlth Sci Ctr, Dept Family Med, Ft Worth, TX 76107 USA
[3] Univ North Texas, Hlth Sci Ctr, Dept Pharmacol & Neurosci, Ft Worth, TX 76107 USA
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 13期
基金
美国国家卫生研究院;
关键词
hyperparameter tuning; high-performance computing; machine learning; imbalanced data; mild cognitive impairment; Alzheimer's disease;
D O I
10.3390/app12136670
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Accurate detection is still a challenge in machine learning (ML) for Alzheimer's disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperparameter tuning workflow with high-performance computing (HPC) for imbalanced data related to prevalent mild cognitive impairment (MCI) and AD in the Health and Aging Brain Study-Health Disparities (HABS-HD) project. We applied a single-node multicore parallel mode to hyperparameter tuning of gamma, cost, and class weight using a support vector machine (SVM) model with 10 times repeated fivefold cross-validation. We executed the hyperparameter tuning workflow with R's bigmemory, foreach, and doParallel packages on Texas Advanced Computing Center (TACC)'s Lonestar6 system. The computational time was dramatically reduced by up to 98.2% for the high-performance SVM hyperparameter tuning model, and the performance of cross-validation was also improved (the positive predictive value and the negative predictive value at base rate 12% were, respectively, 16.42% and 92.72%). Our results show that a single-node multicore parallel structure and high-performance SVM hyperparameter tuning model can deliver efficient and fast computation and achieve outstanding agility, simplicity, and productivity for imbalanced data in AD applications.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Accelerating Hyperparameter Tuning in Machine Learning for Alzheimer's Disease With High Performance Computing
    Zhang, Fan
    Petersen, Melissa
    Johnson, Leigh
    Hall, James
    O'Bryant, Sid E.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [2] Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data
    Schratz, Patrick
    Muenchow, Jannes
    Iturritxa, Eugenia
    Richter, Jakob
    Brenning, Alexander
    ECOLOGICAL MODELLING, 2019, 406 : 109 - 120
  • [3] Impact of Hyperparameter Tuning in Classifying Highly Imbalanced Big Data
    Hancock, John
    Khoshgoftaar, Taghi M.
    2021 IEEE 22ND INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2021), 2021, : 348 - 354
  • [4] ParSMURF-NG: A Machine Learning High Performance Computing System for the Analysis of Imbalanced Big Omics Data
    Petrini, Alessandro
    Notaro, Marco
    Gliozzo, Jessica
    Castrignano, Tiziana
    Robinson, Peter N.
    Casiraghi, Elena
    Valentini, Giorgio
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS. AIAI 2022 IFIP WG 12.5 INTERNATIONAL WORKSHOPS, 2022, 652 : 424 - 435
  • [5] Machine Learning Assisted Hyperparameter Tuning for Optimization
    Linkous, Lauren
    Lundquist, Jonathan
    Suche, Michael
    Topsakal, Erdem
    2024 IEEE INC-USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2024, : 107 - 108
  • [6] Enabling Hyperparameter Tuning of Machine Learning Classifiers in Production
    Sandha, Sandeep Singh
    Aggarwal, Mohit
    Saha, Swapnil Sayan
    Srivastava, Mani
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 262 - 271
  • [7] Exploring Hyperparameter Usage and Tuning in Machine Learning Research
    Simon, Sebastian
    Kolyada, Nikolay
    Akiki, Christopher
    Potthast, Martin
    Stein, Benno
    Siegmund, Norbert
    2023 IEEE/ACM 2ND INTERNATIONAL CONFERENCE ON AI ENGINEERING - SOFTWARE ENGINEERING FOR AI, CAIN, 2023, : 68 - 79
  • [8] Comparative Analysis of Performance Metrics for Machine Learning Classifiers with a Focus on Alzheimer's Disease Data
    Rajayyan, Sivakani
    Mustafa, Syed Masood Mohamed
    ACTA INFORMATICA PRAGENSIA, 2023, 12 (01) : 54 - 70
  • [9] Hyperparameter Tuning on Classical Machine Learning Models in Orthopedic Disease Prediction on Biomechanical Features
    Hai Thanh Nguyen
    Hong Minh Nguyen
    Nhu Bich Thi Pham
    Tai Tan Phan
    Linh Thuy Thi Pham
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2024, 2024, 87 : 48 - 59
  • [10] A Survey of Big Data, High Performance Computing, and Machine Learning Benchmarks
    Ihde, Nina
    Marten, Paula
    Eleliemy, Ahmed
    Poerwawinata, Gabrielle
    Silva, Pedro
    Tolovski, Ilin
    Ciorba, Florina M.
    Rabl, Tilmann
    PERFORMANCE EVALUATION AND BENCHMARKING, TPCTC 2021, 2022, 13169 : 98 - 118