SPARC: Statistical Performance Analysis With Relevance Conclusions

被引:1
|
作者
Tullos, Justin C. [1 ]
Graham, Scott R. [1 ]
Jordan, Jeremy D. [2 ]
Patel, Pranav R. [3 ]
机构
[1] Air Force Inst Technol, Dept Elect & Comp Engn, Wright Patterson AFB, OH 45434 USA
[2] Air Force Inst Technol, Dept Math, Wright Patterson AFB, OH 45434 USA
[3] Sensors Directorate Air Force Res Lab, Wright Patterson AFB, OH 45434 USA
来源
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY | 2021年 / 2卷
关键词
Benchmark testing; Testing; Computer performance; Performance evaluation; Statistical analysis; Program processors; Sociology; Performance benchmarking; RISC-V; relevance testing; statistical analysis; SAMPLE-SIZE DETERMINATION; EQUIVALENCE; PROGRESS; TESTS; POWER;
D O I
10.1109/OJCS.2021.3060658
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of one computer relative to another is traditionally characterized through benchmarking, a practice occasionally deficient in statistical rigor. The performance is often trivialized through simplified measures, such as the approach of central tendency, but doing so risks a loss of perspective of the variability and non-determinism of modern computer systems. Authentic performance evaluations are derived from statistical methods that accurately interpret and assess data. Methods that currently exist within performance comparison frameworks are limited in efficacy, statistical inference is either overtly simplified or altogether avoided. A prevalent criticism from computer performance literature suggests that the results from difference hypothesis testing lack substance. To address this problem, we propose a new framework, SPARC, that pioneers a synthesis of difference and equivalence hypothesis testing to provide relevant conclusions. It is a union of three key components: (i) identifying either superiority or similarity through difference and equivalence hypotheses (ii) scalable methodology (based on the number of benchmarks), and (iii) a conditional feedback loop from test outcomes that produces informative conclusions of relevance, equivalence, trivial, or indeterminant. We present an experimental analysis characterizing the performance of a trio of RISC-V open-source processors to evaluate SPARC and its efficacy compared to similar frameworks.
引用
收藏
页码:117 / 129
页数:13
相关论文
共 50 条
  • [31] Effects of Filler Characteristics on the Performance of Asphalt Mastic: A Statistical Analysis of the Laboratory Testing Results
    Zhou, Sheng Bo
    Liu, Shengjie
    Xiang, Yiming
    INTERNATIONAL JOURNAL OF CIVIL ENGINEERING, 2018, 16 (9A) : 1175 - 1183
  • [32] Effects of Filler Characteristics on the Performance of Asphalt Mastic: A Statistical Analysis of the Laboratory Testing Results
    Sheng Bo Zhou
    Shengjie Liu
    Yiming Xiang
    International Journal of Civil Engineering, 2018, 16 : 1175 - 1183
  • [33] Design and statistical analysis of method transfer studies for biotechnology products
    Shen, Meiyu
    Xu, Lixin
    BIOANALYSIS, 2017, 9 (08) : 595 - 600
  • [34] On the Use of Cox Regression for Statistical Analysis of Fatigue Life Results
    Ulu, K. Narynbek
    Huneau, B.
    Verron, E.
    Beranger, A. S.
    Heuillet, P.
    JOURNAL OF TESTING AND EVALUATION, 2020, 48 (02) : 1439 - 1451
  • [35] On the Application of Statistical Analysis for Interpretation of Experimental Results in Environmental Microbiology
    Kallistova, A. Yu
    Sabrekov, A. F.
    Goncharov, V. M.
    Pimenov, N., V
    Glagolev, M., V
    MICROBIOLOGY, 2019, 88 (02) : 232 - 239
  • [36] Assessment of factors responsible for polymer electrolyte membrane fuel cell electrode performance by statistical analysis
    Velayutham, G.
    Dhathathreyan, K. S.
    Rajalakshmi, N.
    Raman, D. Sampangi
    JOURNAL OF POWER SOURCES, 2009, 191 (01) : 10 - 15
  • [37] Hierarchical statistical analysis of performance variation for continuous-time delta-sigma modulators
    Tang, Hua
    VLSI-SOC 2007: PROCEEDINGS OF THE 2007 IFIP WG 10.5 INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION, 2007, : 37 - 41
  • [38] Stormwater Filtration Performance for the Ecosol Storm Pit (Class 2): Statistical Analysis of Field Data
    Pooya Nejad, Fereydoon
    Zecchin, Aaron C.
    WATER, 2020, 12 (10)
  • [39] Statistical Analysis of Coal Beneficiation Performance in a Continuous Air Dense Medium Fluidized Bed Separator
    Azimi, Ebrahim
    Karimipour, Shayan
    Xu, Zhenghe
    Szymanski, Jozef
    Gupta, Rajender
    INTERNATIONAL JOURNAL OF COAL PREPARATION AND UTILIZATION, 2017, 37 (01) : 12 - 32
  • [40] Performance Monitoring for Grate-kiln-cooler Process Based on Quality Prediction and Statistical Analysis
    Yang, Gui-ming
    Fan, Xiao-hui
    Huang, Xiao-xian
    Chen, Xu-ling
    7TH INTERNATIONAL SYMPOSIUM ON HIGH-TEMPERATURE METALLURGICAL PROCESSING, 2016, : 377 - 384