Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks

被引:5
作者
Tousi, Ashkan [1 ]
Lujan, Mikel [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
Benchmark testing; Predictive models; Data models; Feature extraction; Software; Hardware; Analytical models; Machine learning; performance analysis; predictive models; SPEC CPU2017; supervised learning; REGRESSION; SELECTION;
D O I
10.1109/ACCESS.2022.3142240
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Simulation-based performance prediction is cumbersome and time-consuming. An alternative approach is to consider supervised learning as a means of predicting the performance scores of Standard Performance Evaluation Corporation (SPEC) benchmarks. SPEC CPU2017 contains a public dataset of results obtained by executing 43 standardised performance benchmarks organised into 4 suites on various system configurations. This paper analyses the dataset and aims to answer the following questions: I) can we accurately predict the SPEC results based on the configurations provided in the dataset, without having to actually run the benchmarks? II) what are the most important hardware and software features? III) what are the best predictive models and hyperparameters, in terms of prediction error and time? and IV) can we predict the performance of future systems using the past data? We present how to prepare data, select features, tune hyperparameters and evaluate regression models based on Multi-Task Elastic-Net, Decision Tree, Random Forest, and Multi-Layer Perceptron neural networks estimators. Feature selection is performed in three steps: removing zero variance features, removing highly correlated features, and Recursive Feature Elimination based on different feature importance metrics: elastic-net coefficients, tree-based importance measures and Permutation Importance. We select the best models using grid search on the hyperparameter space, and finally, compare and evaluate the performance of the models. We show that tree-based models with the original 29 features provide accurate predictions with an average error of less than 4%. The average error of faster Decision Tree and Random Forest models with 10 features is still below 6% and 5% respectively.
引用
收藏
页码:11994 / 12011
页数:18
相关论文
共 50 条
  • [41] Comparative Analysis of Machine Learning Models for Predictive Analysis of Machine Failures
    Baldovino, Renann G.
    Camacho, Ken Sammuel I.
    Chua-Unsu, Megan Victoria Hillary Y.
    Go, Jed Leonard C.
    Munsayac, Francisco Emmanuel T. Jr, III
    Bugtai, Nilo T.
    9TH INTERNATIONAL CONFERENCE ON MECHATRONICS ENGINEERING, ICOM 2024, 2024, : 288 - 293
  • [42] Comparative Analysis of Various Machine Learning Approaches for Bitcoin Price Prediction
    Muvvala, Abhishek
    Chivukula, Rohit
    Lakshmi, T. Jaya
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SIGNAL PROCESSING AND ARTIFICIAL INTELLIGENCE, ASPAI' 2020, 2020, : 161 - 164
  • [43] Comparative Analysis of Deep Learning and Statistical Models for Air Pollutants Prediction in Urban Areas
    Naz, Fareena
    Mccann, Conor
    Fahim, Muhammad
    Cao, Tuan-Vu
    Hunter, Ruth
    Viet, Nguyen Trung
    Nguyen, Long D.
    Duong, Trung Q.
    IEEE ACCESS, 2023, 11 : 64016 - 64025
  • [44] The Interplay of Sampling and Machine Learning for Software Performance Prediction
    Kaltenecker, Christian
    Grebhahn, Alexander
    Siegmund, Norbert
    Apel, Sven
    IEEE SOFTWARE, 2020, 37 (04) : 58 - 66
  • [45] Interpretable Machine Learning Models for PISA Results in Mathematics
    Gomez-Talal, Ismael
    Bote-Curiel, Luis
    Luis Rojo-Alvarez, Jose
    IEEE ACCESS, 2025, 13 : 27371 - 27397
  • [46] Crop Classification and Yield Prediction Using Robust Machine Learning Models for Agricultural Sustainability
    Badshah, Abid
    Alkazemi, Basem Yousef
    Din, Fakhrud
    Zamli, Kamal Z.
    Haris, Muhammad
    IEEE ACCESS, 2024, 12 : 162799 - 162813
  • [47] Intelligent computational techniques of machine learning models for demand analysis and prediction
    Naveen Sundar G.
    Anushka Xavier K.
    Narmadha D.
    Martin Sagayam K.
    Amir Anton Jone A.
    Pomplun M.
    Dang H.
    Int. J. Intell. Inf. Database Syst., 2023, 1 (39-61): : 39 - 61
  • [48] Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems
    Sekeroglu, Boran
    Ever, Yoney Kirsal
    Dimililer, Kamil
    Al-Turjman, Fadi
    DATA INTELLIGENCE, 2022, 4 (03) : 620 - 652
  • [49] Prediction of Therapeutic Peptides Using Machine Learning: Computational Models, Datasets, and Feature Encodings
    Attique, Muhammad
    Farooq, Muhammad Shoaib
    Khelifi, Adel
    Abid, Adnan
    IEEE ACCESS, 2020, 8 (08): : 148570 - 148594
  • [50] Comparative Analysis of Machine Learning Algorithms for Rainfall Prediction
    Patil, Rudragoud
    Bedekar, Gayatri
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 833 - 842