An introduction to processor-time-optimal systolic arrays

Highly parallel computaions(2001)

引用 23|浏览7
暂无评分
摘要
We consider computations suitable for systolic arrays, often called regular array computations or systems of uniform recurrence relations. In such computations, the tasks to be computed are viewed as the nodes of a directed acyclic graph (dag), where the data dependencies are represented as arcs. A processor-time-minimal schedule measures the minimum number of processors needed to extract the maximum parallelism from the dag. We present a technique for finding a lower bound on the number of processors needed to achieve a given schedule of an algorithm represented as a dag. The application of this technique is illustrated with a tensor product computation. We then consider the free schedule of algorithms for matrix product, Gaussian elimination, and transitive closure. For each problem, we provide a time-minimal processor schedule that meets the computed processor lower bounds, including the one for tensor product.
更多
查看译文
关键词
systolic array,time-minimal processor schedule,computed processor,gaussian elimination,minimum number,tensor product computation,tensor product,lower bound,free schedule,matrix product,processor-time-minimal schedule
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要