Statistical Robustness of Markov Chain Monte Carlo Accelerators

被引:5
作者
Zhang, Xiangyu [1 ]
Bashizade, Ramin [1 ]
Wang, Yicheng [1 ]
Mukherjee, Sayan [1 ]
Lebeck, Alvin R. [1 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
来源
ASPLOS XXVI: TWENTY-SIXTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS | 2021年
基金
美国国家科学基金会;
关键词
accelerator; statistical machine learning; probabilistic computing; statistical robustness; markov chain monte carlo; INFERENCE; HARDWARE; CONVERGENCE; QUALITY;
D O I
10.1145/3445814.3446697
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Statistical machine learning often uses probabilistic models and algorithms, such as Markov Chain Monte Carlo (MCMC), to solve a wide range of problems. Probabilistic computations, often considered too slow on conventional processors, can be accelerated with specialized hardware by exploiting parallelism and optimizing the design using various approximation techniques. Current methodologies for evaluating correctness of probabilistic accelerators are often incomplete, mostly focusing only on end-point result quality ("accuracy"). It is important for hardware designers and domain experts to look beyond end-point "accuracy" and be aware of how hardware optimizations impact statistical properties. This work takes a first step toward defining metrics and a methodology for quantitatively evaluating correctness of probabilistic accelerators. We propose three pillars of statistical robustness: 1) sampling quality, 2) convergence diagnostic, and 3) goodness of fit. We apply our framework to a representative MCMC accelerator and surface design issues that cannot be exposed using only application end-point result quality. We demonstrate the benefits of this framework to guide design space exploration in a case study showing that statistical robustness comparable to floating-point software can be achieved with limited precision, avoiding floating-point hardware overheads.
引用
收藏
页码:959 / 974
页数:16
相关论文
共 88 条
  • [1] [Anonymous], 2015, P ISPD, DOI DOI 10.1145/2717764.2717783
  • [2] [Anonymous], 2017, P 34 INT C MACHINE L
  • [3] Thompson MB, 2010, Arxiv, DOI arXiv:1011.0175
  • [4] A Database and Evaluation Methodology for Optical Flow
    Baker, Simon
    Scharstein, Daniel
    Lewis, J. P.
    Roth, Stefan
    Black, Michael J.
    Szeliski, Richard
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) : 1 - 31
  • [5] CACTI 7: New Tools for Interconnect Exploration in Innovative Off-Chip Memories
    Balasubramonian, Rajeev
    Kahng, Andrew B.
    Muralimanohar, Naveen
    Shafiee, Ali
    Srinivas, Vaishnav
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (02)
  • [6] AcMC2: Accelerated Markov Chain Monte Carlo for Probabilistic Models
    Banerjee, Subho S.
    Kalbarczyk, Zbigniew T.
    Iyer, Ravishankar K.
    [J]. TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 515 - 528
  • [7] Barnard Aubrey, 2019, THESIS U WISCONSINS
  • [8] STOCHASTIC STEREO MATCHING OVER SCALE
    BARNARD, ST
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 1989, 3 (01) : 17 - 32
  • [9] Betancourt M, 2018, Arxiv, DOI [arXiv:1701.02434, 10.48550/arXiv.1701.02434, DOI 10.48550/ARXIV.1701.02434]
  • [10] General methods for monitoring convergence of iterative simulations
    Brooks, SP
    Gelman, A
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (04) : 434 - 455