CoopMC: Algorithm-Architecture Co-Optimization for Markov Chain Monte Carlo Accelerators

Yuji Chai,Glenn G. Ko, Wei-Te Mark Ting,Luke Bailey,David Brooks,Gu-Yeon Wei

2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA)（2022）

引用 3|浏览24

暂无评分

摘要

Bayesian machine learning is useful for applications that may make high-risk decisions with limited, noisy, or unlabeled data, as it provides great data efficiency and uncertainty estimation. Building on previous efforts, this work presents CoopMC, an algorithm-architecture co-optimization for developing more efficient MCMC-based Bayesian inference accelerators. CoopMC utilizes dynamic normalization (DyNorm), LUT-based exponential kernels (TableExp), and log-domain kernel fusion (LogFusion) to reduce computational precision and shrink ALU area by 7.5× without noticeable reduction in model performance. Also, a Tree-based Gibbs sampler (TreeSampler) improves hardware runtime from $\mathcal{O}$(N) to $\mathcal{O}$(log(N)), an 8.7× speedup, and yields 1.9× better area efficiency than the existing state-of-the-art Gibbs sampling architecture. These methods have been tested on 10 diverse workloads using 3 different types of Bayesian models, demonstrating applicability to many Bayesian algorithms. In an end-to-end case study, these optimizations achieve a 33% logic area reduction, 62% power reduction, and 1.53× speedup over previous state-of-the-art end-to-end MCMC accelerators.

查看译文

关键词

Algorithm-Architecture Co-Design,Hardware Accelerator,Bayesian Machine Learning,Markov Chain Monte Carlo

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要