Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks

被引:5
作者
Tousi, Ashkan [1 ]
Lujan, Mikel [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
Benchmark testing; Predictive models; Data models; Feature extraction; Software; Hardware; Analytical models; Machine learning; performance analysis; predictive models; SPEC CPU2017; supervised learning; REGRESSION; SELECTION;
D O I
10.1109/ACCESS.2022.3142240
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Simulation-based performance prediction is cumbersome and time-consuming. An alternative approach is to consider supervised learning as a means of predicting the performance scores of Standard Performance Evaluation Corporation (SPEC) benchmarks. SPEC CPU2017 contains a public dataset of results obtained by executing 43 standardised performance benchmarks organised into 4 suites on various system configurations. This paper analyses the dataset and aims to answer the following questions: I) can we accurately predict the SPEC results based on the configurations provided in the dataset, without having to actually run the benchmarks? II) what are the most important hardware and software features? III) what are the best predictive models and hyperparameters, in terms of prediction error and time? and IV) can we predict the performance of future systems using the past data? We present how to prepare data, select features, tune hyperparameters and evaluate regression models based on Multi-Task Elastic-Net, Decision Tree, Random Forest, and Multi-Layer Perceptron neural networks estimators. Feature selection is performed in three steps: removing zero variance features, removing highly correlated features, and Recursive Feature Elimination based on different feature importance metrics: elastic-net coefficients, tree-based importance measures and Permutation Importance. We select the best models using grid search on the hyperparameter space, and finally, compare and evaluate the performance of the models. We show that tree-based models with the original 29 features provide accurate predictions with an average error of less than 4%. The average error of faster Decision Tree and Random Forest models with 10 features is still below 6% and 5% respectively.
引用
收藏
页码:11994 / 12011
页数:18
相关论文
共 50 条
  • [11] Comparative analysis of explainable machine learning prediction models for hospital mortality
    Eline Stenwig
    Giampiero Salvi
    Pierluigi Salvo Rossi
    Nils Kristian Skjærvold
    BMC Medical Research Methodology, 22
  • [12] Comparative analysis of explainable machine learning prediction models for hospital mortality
    Stenwig, Eline
    Salvi, Giampiero
    Rossi, Pierluigi Salvo
    Skjaervold, Nils Kristian
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [13] Enhancing Machine Learning Models for Path Loss Prediction Using Image Texture Techniques
    Sotiroudis, Sotirios P.
    Siakavara, Katherine
    Koudouridis, Georgios P.
    Sarigiannidis, Panagiotis
    Goudos, Sotirios K.
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2021, 20 (08): : 1443 - 1447
  • [14] Prediction of Flotation Deinking Performance: A Comparative Analysis of Machine Learning Techniques
    Gavrilovic, Tamara
    Despotovic, Vladimir
    Zot, Madalina-Ileana
    Trumic, Maja S.
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [15] Machine Learning for Clinical Outcome Prediction
    Shamout, Farah
    Zhu, Tingting
    Clifton, David A.
    IEEE REVIEWS IN BIOMEDICAL ENGINEERING, 2021, 14 : 116 - 126
  • [16] Performance Analysis of Diabetic Retinopathy Prediction using Machine Learning Models
    Emon, Minhaz Uddin
    Zannat, Raihana
    Khatun, Tania
    Rahman, Mahfujur
    Keya, Maria Sultana
    Ohidujjaman
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1048 - 1052
  • [17] Comparative analysis of machine learning prediction models of container ships propulsion power
    Ferreira, Ricardo dos Santos
    Padilha de Lima, Joao Victor
    Caprace, Jean-David
    OCEAN ENGINEERING, 2022, 255
  • [18] A Comparative Analysis of Machine/Deep Learning Models for Parking Space Availability Prediction
    Awan, Faraz Malik
    Saleem, Yasir
    Minerva, Roberto
    Crespi, Noel
    SENSORS, 2020, 20 (01)
  • [19] Multi-technique comparative analysis of machine learning algorithms for improving the prediction of teams' performance
    Giannakas, Filippos
    Troussas, Christos
    Krouska, Akrivi
    Sgouropoulou, Cleo
    Voyiatzis, Ioannis
    EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (06) : 8461 - 8487
  • [20] Multi-technique comparative analysis of machine learning algorithms for improving the prediction of teams’ performance
    Filippos Giannakas
    Christos Troussas
    Akrivi Krouska
    Cleo Sgouropoulou
    Ioannis Voyiatzis
    Education and Information Technologies, 2022, 27 : 8461 - 8487