Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer's Disease Data

被引:6
|
作者
Zhang, Fan [1 ,2 ]
Petersen, Melissa [1 ,2 ]
Johnson, Leigh [1 ,3 ]
Hall, James [1 ,2 ]
O'Bryant, Sid E. [1 ,2 ]
机构
[1] Univ North Texas, Hlth Sci Ctr, Inst Translat Res, Ft Worth, TX 76107 USA
[2] Univ North Texas, Hlth Sci Ctr, Dept Family Med, Ft Worth, TX 76107 USA
[3] Univ North Texas, Hlth Sci Ctr, Dept Pharmacol & Neurosci, Ft Worth, TX 76107 USA
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 13期
基金
美国国家卫生研究院;
关键词
hyperparameter tuning; high-performance computing; machine learning; imbalanced data; mild cognitive impairment; Alzheimer's disease;
D O I
10.3390/app12136670
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Accurate detection is still a challenge in machine learning (ML) for Alzheimer's disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperparameter tuning workflow with high-performance computing (HPC) for imbalanced data related to prevalent mild cognitive impairment (MCI) and AD in the Health and Aging Brain Study-Health Disparities (HABS-HD) project. We applied a single-node multicore parallel mode to hyperparameter tuning of gamma, cost, and class weight using a support vector machine (SVM) model with 10 times repeated fivefold cross-validation. We executed the hyperparameter tuning workflow with R's bigmemory, foreach, and doParallel packages on Texas Advanced Computing Center (TACC)'s Lonestar6 system. The computational time was dramatically reduced by up to 98.2% for the high-performance SVM hyperparameter tuning model, and the performance of cross-validation was also improved (the positive predictive value and the negative predictive value at base rate 12% were, respectively, 16.42% and 92.72%). Our results show that a single-node multicore parallel structure and high-performance SVM hyperparameter tuning model can deliver efficient and fast computation and achieve outstanding agility, simplicity, and productivity for imbalanced data in AD applications.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study
    Bach, Philipp
    Schacht, Oliver
    Chernozhukov, Victor
    Klaassen, Sven
    Spindler, Martin
    CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 1065 - 1117
  • [22] Diabetes Prediction Using Machine Learning with Feature Engineering and Hyperparameter Tuning
    El Massari, Hakim
    Gherabi, Noreddine
    Qanouni, Fatima
    Mhammedi, Sajida
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 171 - 179
  • [23] Hyperparameter Tuning for Machine Learning Algorithms Used for Arabic Sentiment Analysis
    Elgeldawi, Enas
    Sayed, Awny
    Galal, Ahmed R.
    Zaki, Alaa M.
    INFORMATICS-BASEL, 2021, 8 (04):
  • [24] Automatic Hyperparameter Tuning of Machine Learning Models under Time Constraints
    Wang, Zhen
    Agung, Mulya
    Egawa, Ryusuke
    Suda, Reiji
    Takizawa, Hiroyuki
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4967 - 4973
  • [25] A Statistical Approach to Hyperparameter Tuning of Deep Learning for Construction Machine Classification
    Ottoni, Andre Luiz C.
    Novo, Marcela S.
    Oliveira, Marcos S.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (04) : 5117 - 5128
  • [26] A Statistical Approach to Hyperparameter Tuning of Deep Learning for Construction Machine Classification
    André Luiz C. Ottoni
    Marcela S. Novo
    Marcos S. Oliveira
    Arabian Journal for Science and Engineering, 2024, 49 : 5117 - 5128
  • [27] Multiresponse surface methodology for hyperparameter tuning to optimize multiple performance measures of statistical and machine learning algorithms
    Lin, Chang-Yun
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2023, 39 (07) : 2995 - 3013
  • [28] A high-generalizability machine learning framework for predicting the progression of Alzheimer’s disease using limited data
    Caihua Wang
    Yuanzhong Li
    Yukihiro Tsuboshita
    Takuya Sakurai
    Tsubasa Goto
    Hiroyuki Yamaguchi
    Yuichi Yamashita
    Atsushi Sekiguchi
    Hisateru Tachimori
    npj Digital Medicine, 5
  • [29] A high-generalizability machine learning framework for predicting the progression of Alzheimer's disease using limited data
    Wang, Caihua
    Li, Yuanzhong
    Tsuboshita, Yukihiro
    Sakurai, Takuya
    Goto, Tsubasa
    Yamaguchi, Hiroyuki
    Yamashita, Yuichi
    Sekiguchi, Atsushi
    Tachimori, Hisateru
    NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [30] Imbalanced Data Problem in Machine Learning: A Review
    Altalhan, Manahel
    Algarni, Abdulmohsen
    Alouane, Monia Turki-Hadj
    IEEE ACCESS, 2025, 13 : 13686 - 13699