Enabling Fast Preemption Via Dual-Kernel Support On Gpus
2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC)(2017)
摘要
To consider QoS for resource-limited mobile systems, we introduce a fast preemption mechanism on GPUs. First, we involve a dual-kernel execution model to support fine-grained preemption, and a resource allocation policy to avoid resource fragmentation problem. Second, we propose a preemption victim selection scheme to reduce the throughput overhead while satisfying a required preemption latency. Evaluations show that we can reach very close to the ideal preemption scheme within 2% difference in terms of deadline violations. Furthermore, on average we improve GPU resource utilization by 2.93x over prior technique during preemption.
更多查看译文
关键词
dual-kernel support,QoS,resource-limited mobile systems,fast preemption mechanism,dual-kernel execution model,resource allocation policy,resource fragmentation,GPU resource utilization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络