Dynamic Colocation Policies with Reinforcement Learning
ACM Transactions on Architecture and Code Optimization(2020)
摘要
We draw on reinforcement learning frameworks to design and implement an adaptive controller for managing resource contention. During runtime, the controller observes the dynamic system conditions and optimizes control policies that satisfy latency targets yet improve server utilization. We evaluate a physical prototype that guarantees 95th percentile latencies for a search engine and improves server utilization by up to 70%, compared to exclusively reserving servers for interactive services, for varied batch workloads in machine learning.
更多查看译文
关键词
Resource contention,adaptive control,machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络