Energy-Efficient Recognition And Mining Processor Using Scalable Effort Design

2013 IEEE CUSTOM INTEGRATED CIRCUITS CONFERENCE (CICC)(2013)

引用 11|浏览46
暂无评分
摘要
A domain-specific processor for energy-efficient execution of Recognition and Data Mining (RM) workloads is presented. The processor consists of a 2-D array of processing elements and a streaming memory hierarchy and interconnect network that are customized to efficiently execute dominant computational kernels (matrix-vector multiplication, vector dot product, L1 norm, and L2 norm) from a wide range of RM algorithms. To achieve further energy efficiency, the RM processor utilizes scalable effort design, a technique that exploits the inherent resilience of algorithms to inexactness in their constituent computations. The scalable effort RM processor adopts a cross-layer approach by combining scaling mechanisms at the algorithm, architecture, and circuit levels, to create a desirable trade off between energy consumption and output quality. Measurements from the implemented chip in 65nm CMOS indicate processing efficiencies of 569 GOPS/W - 4.68 TOPS/W. The use of scalable effort design achieves energy savings of 1.2-2.3X with no loss in output quality, and 2X-20X with modest reduction in quality.
更多
查看译文
关键词
data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要