Unleashing the high-performance and low-power of multi-core DSPs for general-purpose HPC

High Performance Computing, Networking, Storage and Analysis(2012)

引用 40|浏览0
暂无评分
摘要
Take a multicore Digital Signal Processor (DSP) chip designed for cellular base stations and radio network controllers, add floating-point capabilities to support 4G networks, and out of thin air a HPC engine is born. The potential for HPC is clear: It promises 128 GFLOPS (single precision) for 10 Watts; It is used in millions of network related devices and hence benefits from economies of scale; It should be simpler to program than a GPU. Simply put, it is fast, green, and cheap. But is it easy to use? In this paper, we show how this potential can be applied to general-purpose high performance computing, more specifically to dense matrix computations, without major changes in existing codes and methodologies, and with excellent performance and power consumption numbers.
更多
查看译文
关键词
digital signal processing chips,mathematics computing,matrix algebra,multi-threading,multiprocessing systems,program compilers,4G network,GPU,cellular base station,compiler support,dense matrix computation,digital signal processor,economies of scale,floating-point capability,general-purpose HPC engine,graphics processing unit,high performance computing,multicore DSP chip,power consumption number,radio network controller,thread library,DSPs,Low-power architectures,linear algebra
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要