Dynamic Task Scheduling Model and Fault-Tolerant via Queuing Theory

Journal of Computer Research and Development(2016)

引用 1|浏览0
暂无评分
摘要
The design of efficient dynamic task scheduling and fault‐tolerant mechanism is an issue of crucial importance in high‐performance computing field .Most existing methods ,however ,can hardly achieve good scalability on large‐scale system .In this paper , we propose a scalable dynamic task scheduling model via N‐level queuing theory ,which dramatically reduces the programming burden by providing programmer with concise parallel programming framework .On one hand ,we utilize the Poisson process theory to analyze the average wait time of tasks ,and then decide the task layers according to threshold .On the other hand ,we reduce the fault tolerance overhead using region‐aware light‐weight degradation model .Experimental results with Micro Benchmark on Bluelight system with 32 768 cores show that our method achieves good scalability when the tasks take 3.4 s on average and the overhead is just 7 .2% of traditional model .Running on 16 384 cores ,pharmacological application DOCK achieves performance improvement by 34.3% with our scheduling .Moreover ,the results of DOCK show our fault‐tolerant model achieves 3.75% ~ 5.13% performance improvements over traditional mechanism .
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要