SPARC: Statistical Performance Analysis With Relevance Conclusions

被引:1
|
作者
Tullos, Justin C. [1 ]
Graham, Scott R. [1 ]
Jordan, Jeremy D. [2 ]
Patel, Pranav R. [3 ]
机构
[1] Air Force Inst Technol, Dept Elect & Comp Engn, Wright Patterson AFB, OH 45434 USA
[2] Air Force Inst Technol, Dept Math, Wright Patterson AFB, OH 45434 USA
[3] Sensors Directorate Air Force Res Lab, Wright Patterson AFB, OH 45434 USA
来源
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY | 2021年 / 2卷
关键词
Benchmark testing; Testing; Computer performance; Performance evaluation; Statistical analysis; Program processors; Sociology; Performance benchmarking; RISC-V; relevance testing; statistical analysis; SAMPLE-SIZE DETERMINATION; EQUIVALENCE; PROGRESS; TESTS; POWER;
D O I
10.1109/OJCS.2021.3060658
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of one computer relative to another is traditionally characterized through benchmarking, a practice occasionally deficient in statistical rigor. The performance is often trivialized through simplified measures, such as the approach of central tendency, but doing so risks a loss of perspective of the variability and non-determinism of modern computer systems. Authentic performance evaluations are derived from statistical methods that accurately interpret and assess data. Methods that currently exist within performance comparison frameworks are limited in efficacy, statistical inference is either overtly simplified or altogether avoided. A prevalent criticism from computer performance literature suggests that the results from difference hypothesis testing lack substance. To address this problem, we propose a new framework, SPARC, that pioneers a synthesis of difference and equivalence hypothesis testing to provide relevant conclusions. It is a union of three key components: (i) identifying either superiority or similarity through difference and equivalence hypotheses (ii) scalable methodology (based on the number of benchmarks), and (iii) a conditional feedback loop from test outcomes that produces informative conclusions of relevance, equivalence, trivial, or indeterminant. We present an experimental analysis characterizing the performance of a trio of RISC-V open-source processors to evaluate SPARC and its efficacy compared to similar frameworks.
引用
收藏
页码:117 / 129
页数:13
相关论文
共 50 条
  • [11] Statistical analysis of seismic performance of traditional wooden houses in kyoto
    Hayashi Y.
    Sugino M.
    Hatada R.
    Kimoto Y.
    AIJ Journal of Technology and Design, 2021, 27 (66) : 702 - 707
  • [12] Statistical Analysis and Forecasts of Performance Indicators in the Romanian Healthcare System
    Dragan, Cristian Ovidiu
    Mihai, Laurentiu Stelian
    Popescu, Ana-Maria Camelia
    Buligiu, Ion
    Mirescu, Lucian
    Militaru, Daniel
    HEALTHCARE, 2025, 13 (02)
  • [13] Arc Furnace Performance Validation Thru Modeling, Monitoring and Statistical Analysis
    Morello, Sam
    Gnesda, John
    Dionise, Thomas J.
    2017 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2017,
  • [14] WEIGHTING OF ITEMS IN A TUTORIAL PERFORMANCE EVALUATION INSTRUMENT: STATISTICAL ANALYSIS AND RESULTS
    Lack, Melanie L.
    Bruce, Judith C.
    Becker, Piet J.
    HEALTH SA GESONDHEID, 2009, 14 (01):
  • [15] Impact resistance of ultra-high-performance concrete retrofitted with polyurethane grout material: Experimental investigation and statistical analysis
    Al-shawafi, Ali
    Zhu, Han
    Haruna, S. I.
    Bo, Zhao
    Laqsum, Saleh Ahmed
    Borito, Said Mirgan
    STRUCTURES, 2023, 55 : 185 - 200
  • [16] Assessment of image sensor performance with statistical perception performance analysis
    Franz, Stefan
    Willersinn, Dieter
    Kroschel, Kristian
    INTELLIGENT ROBOTS AND COMPUTER VISION XXVII: ALGORITHMS AND TECHNIQUES, 2010, 7539
  • [17] A Statistical Analysis of the Performance Variability of Read/Write Operations on Parallel File Systems
    Inacio, Eduardo C.
    Barbetta, Pedro A.
    Dantas, Mario A. R.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2393 - 2397
  • [18] Statistical analysis of the performance and simulation of a two-axis tracking PV system
    Perpinan, O.
    SOLAR ENERGY, 2009, 83 (11) : 2074 - 2085
  • [19] Statistical Analysis Framework to Evaluate Asphalt Concrete Overlay Reflective Cracking Performance
    Haslett, Katie
    Dave, Eshan
    Sias, Jo
    Linder, Ernst
    TRANSPORTATION RESEARCH RECORD, 2022, 2676 (05) : 132 - 146
  • [20] Evaluating the mechanical performance of Flemish bituminous mixtures containing RA by statistical analysis
    Margaritis, Alexandros
    Blom, Johan
    Van den bergh, Wim
    ROAD MATERIALS AND PAVEMENT DESIGN, 2019, 20 : S725 - S739