AcMC2: Accelerated Markov Chain Monte Carlo for Probabilistic Models

被引：12

作者：

Banerjee, Subho S. ^{[1
]}

Kalbarczyk, Zbigniew T. ^{[1
]}

Iyer, Ravishankar K. ^{[1
]}

机构：

[1] Univ Illinois, Champaign, IL 61820 USA

来源：

TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV) | 2019年

基金：

美国国家科学基金会;

关键词：

Accelerator; Markov Chain Monte Carlo; Probabilistic Graphical Models; Probabilistic Programming; ALGORITHMS; INFERENCE; ARCHITECTURES;

D O I：

10.1145/3297858.3304019

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Probabilistic models (PMs) are ubiquitously used across a variety of machine learning applications. They have been shown to successfully integrate structural prior information about data and effectively quantify uncertainty to enable the development of more powerful, interpretable, and efficient learning algorithms. This paper presents AcMC2, a compiler that transforms PMs into optimized hardware accelerators (for use in FPGAs or ASICs) that utilize Markov chain Monte Carlo methods to infer and query a distribution of posterior samples from the model. The compiler analyzes statistical dependencies in the PM to drive several optimizations to maximally exploit the parallelism and data locality available in the problem. We demonstrate the use of AcMC2 to implement several learning and inference tasks on a Xilinx Virtex-7 FPGA. AcMC2-generated accelerators provide a 47 - 100x improvement in runtime performance over a 6-core IBM Power8 CPU and a 8 - 18x improvement over an NVIDIA K80 GPU. This corresponds to a 753 - 1600x improvement over the CPU and 248 - 463x over the GPU in performance-per-watt terms.

引用

页码：515 / 528

页数：14

共 67 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

Andres B., 2012, ABS12060111 CORR

[3]

Angelino E, 2014, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, P22

[4]

[Anonymous], 2004, Monte Carlo methods

[5]

[Anonymous], 2011, P 14 INT C ART INT S

[6]

[Anonymous], 2016, Deep learning. vol

[7]

[Anonymous], 2011, HDB MARKOV CHAIN MON

[8]

[Anonymous], 2012, ARXIV12122991

[9]

Appleby A., 2008, MurmurHash

[10]

Asadi Narges Bani, 2010, 24th ACM International Conference on Supercomputing 2010, P83

← 1 2 3 4 5 6 7 →