Statistical Robustness of Markov Chain Monte Carlo Accelerators

被引：5

作者：

Zhang, Xiangyu ^{[1
]}

Bashizade, Ramin ^{[1
]}

Wang, Yicheng ^{[1
]}

Mukherjee, Sayan ^{[1
]}

Lebeck, Alvin R. ^{[1
]}

机构：

[1] Duke Univ, Durham, NC 27706 USA

来源：

ASPLOS XXVI: TWENTY-SIXTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS | 2021年

基金：

美国国家科学基金会;

关键词：

accelerator; statistical machine learning; probabilistic computing; statistical robustness; markov chain monte carlo; INFERENCE; HARDWARE; CONVERGENCE; QUALITY;

D O I：

10.1145/3445814.3446697

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Statistical machine learning often uses probabilistic models and algorithms, such as Markov Chain Monte Carlo (MCMC), to solve a wide range of problems. Probabilistic computations, often considered too slow on conventional processors, can be accelerated with specialized hardware by exploiting parallelism and optimizing the design using various approximation techniques. Current methodologies for evaluating correctness of probabilistic accelerators are often incomplete, mostly focusing only on end-point result quality ("accuracy"). It is important for hardware designers and domain experts to look beyond end-point "accuracy" and be aware of how hardware optimizations impact statistical properties. This work takes a first step toward defining metrics and a methodology for quantitatively evaluating correctness of probabilistic accelerators. We propose three pillars of statistical robustness: 1) sampling quality, 2) convergence diagnostic, and 3) goodness of fit. We apply our framework to a representative MCMC accelerator and surface design issues that cannot be exposed using only application end-point result quality. We demonstrate the benefits of this framework to guide design space exploration in a case study showing that statistical robustness comparable to floating-point software can be achieved with limited precision, avoiding floating-point hardware overheads.

引用

页码：959 / 974

页数：16

共 88 条

[1] [Anonymous], 2015, P ISPD, DOI DOI 10.1145/2717764.2717783
[2] [Anonymous], 2017, P 34 INT C MACHINE L
[3] Thompson MB, 2010, Arxiv, DOI arXiv:1011.0175
[4] A Database and Evaluation Methodology for Optical Flow
Baker, Simon
Scharstein, Daniel
Lewis, J. P.
Roth, Stefan
Black, Michael J.
Szeliski, Richard
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) : 1 - 31
[5] CACTI 7: New Tools for Interconnect Exploration in Innovative Off-Chip Memories
Balasubramonian, Rajeev
Kahng, Andrew B.
Muralimanohar, Naveen
Shafiee, Ali
Srinivas, Vaishnav
[J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (02)
[6] AcMC2: Accelerated Markov Chain Monte Carlo for Probabilistic Models
Banerjee, Subho S.
Kalbarczyk, Zbigniew T.
Iyer, Ravishankar K.
[J]. TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 515 - 528
[7] Barnard Aubrey, 2019, THESIS U WISCONSINS
[8] STOCHASTIC STEREO MATCHING OVER SCALE
BARNARD, ST
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 1989, 3 (01) : 17 - 32
[9] Betancourt M, 2018, Arxiv, DOI [arXiv:1701.02434, 10.48550/arXiv.1701.02434, DOI 10.48550/ARXIV.1701.02434]
[10] General methods for monitoring convergence of iterative simulations
Brooks, SP
Gelman, A
[J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (04) : 434 - 455

← 1 2 3 4 5 6 7 8 9 →