Improving Utilization of Dataflow Architectures Through Software and Hardware Co-Design.

Euro-Par(2023)

引用 0|浏览13
暂无评分
摘要
Dataflow architectures can achieve much better performance and higher efficiency than general-purpose core, approaching the performance of a specialized design while retaining programmability. However, dataflow architectures often face challenges of low utilization of computational resources if the application algorithms are irregular. In this paper, we propose a software and hardware co-design technique that makes both regular and irregular applications efficient on dataflow architectures. First, we dispatch instructions between dataflow graph (DFG) nodes to ensure load balance. Second, we decouple threads within the DFG nodes into consecutive pipeline stages and provide architectural support. By time-multiplexing these stages on each PE, dataflow hardware can achieve much higher utilization and performance. We show that our method improves performance by gmean 2.55 × (and up to 3.71 × ) over a conventional dataflow architecture (and by gmean 1.80 × over Plasticine) on a variety of challenging applications.
更多
查看译文
关键词
dataflow architectures,utilization,software,co-design
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要