An Experimental Comparison Between Genetic Algorithm and Particle Swarm Optimization in Spark Performance Tuning

ETCD@ASPLOS(2017)

引用 3|浏览69
暂无评分
摘要
The most popular in-memory computing framework --- Spark --- has a number of performance-critical configuration parameters. Manually tuning these parameters for optimized performance is not practical because the parameter tuning space is huge. Searching algorithms such as genetic algorithm can be used to automatically search the optimal configurations. However, there are several such algorithms and it is unclear which one is better in the case of Spark configuration parameter tuning. To address this issue, we experimentally compare two searching algorithms --- the Genetic Algorithm (GA) and the Particle Swarm Optimization (PSO) --- in searching the optimal configurations for Spark applications. We made several interesting observations. For one, PSO converges 2x faster than GA but the performance tuned by the configuration parameters found by PSO is slightly poorer than that by GA. Second, PSO shows better scalability with respect to the number of configuration parameters than GA. Finally, we find PSO is more robust than GA across different searching processes. Based on these observations, we recommend one to use PSO in Spark performance tuning context.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要