Performance Interference-Aware Vertical Elasticity for Cloud-Hosted Latency-Sensitive Applications

Shashank Shekhar,Hamzah Abdel-Aziz,Anirban Bhattacharjee,Aniruddha S. Gokhale,Xenofon D. Koutsoukos

2018 IEEE 11th International Conference on Cloud Computing (CLOUD)（2018）

引用 36|浏览74

暂无评分

摘要

Elastic auto-scaling in cloud platforms has primarily used horizontal scaling by assigning application instances to distributed resources. Owing to rapid advances in hardware, cloud providers are now seeking vertical elasticity before attempting horizontal scaling to provide elastic auto-scaling for applications. Vertical elasticity solutions must, however, be cognizant of performance interference that stems from multi-tenant collocated applications since interference significantly impacts application quality-of-service (QoS) properties, such as latency. The problem becomes more pronounced for latency-sensitive applications that demand strict QoS properties. Further exacerbating the problem are variations in workloads, which make it hard to determine the right kinds of timely resource adaptations for latency-sensitive applications. To address these challenges and overcome limitations in existing offline approaches, we present an online, data-driven approach which utilizes Gaussian Processes-based machine learning techniques to build runtime predictive models of the performance of the system under different levels of interference. The predictive online models are then used in dynamically adapting to the workload variability by vertically auto-scaling co-located applications such that performance interference is minimized and QoS properties of latency-sensitive applications are met.

查看译文

关键词

Cloud computing,Data center,Multi-tenancy,Workload variability,Latency sensitive,Latency critical,Performance interference,Vertical elasticity,Virtualization,Linux containers,Docker,Online predictive models,Gaussian processes

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要