SPARC: Statistical Performance Analysis With Relevance Conclusions

被引:1
|
作者
Tullos, Justin C. [1 ]
Graham, Scott R. [1 ]
Jordan, Jeremy D. [2 ]
Patel, Pranav R. [3 ]
机构
[1] Air Force Inst Technol, Dept Elect & Comp Engn, Wright Patterson AFB, OH 45434 USA
[2] Air Force Inst Technol, Dept Math, Wright Patterson AFB, OH 45434 USA
[3] Sensors Directorate Air Force Res Lab, Wright Patterson AFB, OH 45434 USA
来源
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY | 2021年 / 2卷
关键词
Benchmark testing; Testing; Computer performance; Performance evaluation; Statistical analysis; Program processors; Sociology; Performance benchmarking; RISC-V; relevance testing; statistical analysis; SAMPLE-SIZE DETERMINATION; EQUIVALENCE; PROGRESS; TESTS; POWER;
D O I
10.1109/OJCS.2021.3060658
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of one computer relative to another is traditionally characterized through benchmarking, a practice occasionally deficient in statistical rigor. The performance is often trivialized through simplified measures, such as the approach of central tendency, but doing so risks a loss of perspective of the variability and non-determinism of modern computer systems. Authentic performance evaluations are derived from statistical methods that accurately interpret and assess data. Methods that currently exist within performance comparison frameworks are limited in efficacy, statistical inference is either overtly simplified or altogether avoided. A prevalent criticism from computer performance literature suggests that the results from difference hypothesis testing lack substance. To address this problem, we propose a new framework, SPARC, that pioneers a synthesis of difference and equivalence hypothesis testing to provide relevant conclusions. It is a union of three key components: (i) identifying either superiority or similarity through difference and equivalence hypotheses (ii) scalable methodology (based on the number of benchmarks), and (iii) a conditional feedback loop from test outcomes that produces informative conclusions of relevance, equivalence, trivial, or indeterminant. We present an experimental analysis characterizing the performance of a trio of RISC-V open-source processors to evaluate SPARC and its efficacy compared to similar frameworks.
引用
收藏
页码:117 / 129
页数:13
相关论文
共 50 条
  • [1] Statistical analysis of top performers in sport with emphasis on the relevance of outliers
    Tony Aitken
    Sports Engineering, 2004, 7 (2) : 75 - 88
  • [2] Benchmarking and performance measurement: a statistical analysis
    Moffett, Sandra
    Anderson-Gillespie, Karen
    McAdam, Rodney
    BENCHMARKING-AN INTERNATIONAL JOURNAL, 2008, 15 (04) : 368 - 381
  • [3] Statistical analysis in MSW collection performance assessment
    Teixeira, Carlos Afonso
    Avelino, Catarina
    Ferreira, Fatima
    Bentes, Isabel
    WASTE MANAGEMENT, 2014, 34 (09) : 1584 - 1594
  • [4] Statistical Analysis of Anthropometric and Physiologic Performance of the Hand
    Serban, I.
    Baritz, M.
    Rosca, I. C.
    Cotoros, L. D.
    INTERNATIONAL CONFERENCE ON ADVANCEMENTS OF MEDICINE AND HEALTH CARE THROUGH TECHNOLOGY, 2011, 36 : 380 - 383
  • [5] Statistical significance versus clinical relevance
    van Rijn, Marieke H. C.
    Bech, Anneke
    Bouyer, Jean
    van den Brand, Jan A. J. G.
    NEPHROLOGY DIALYSIS TRANSPLANTATION, 2017, 32 : 6 - 12
  • [6] Statistical Performance Analysis and Estimation for Parallel Multimedia Processing
    Min Li
    Tanja Van Achteren
    Erik Brockmeyer
    Francky Catthoor
    Journal of Signal Processing Systems, 2010, 58 : 105 - 116
  • [7] Statistical Performance Analysis and Estimation for Parallel Multimedia Processing
    Li, Min
    Van Achteren, Tanja
    Brockmeyer, Erik
    Catthoor, Francky
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 58 (02): : 105 - 116
  • [8] Statistical analysis of doubletalk detection for calibration and performance evaluation
    Gordy, James D.
    Goubran, Rafik A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1035 - 1043
  • [9] Statistical Performance Analysis in a GPU
    Salonikidis, Dionisis
    Manolakis, Dimitris E.
    2022 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2022, : 129 - 132
  • [10] Impact of pulse time uncertainty on synchronous average: Statistical analysis and relevance to rotating machinery diagnosis
    Camerini, V
    Coppotelli, G.
    Bendisch, S.
    Kiehn, D.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 129 : 308 - 336