Stochastic Gradient Coding for Flexible Straggler Mitigation in Distributed Learning
2019 IEEE Information Theory Workshop (ITW)(2019)
摘要
We consider distributed gradient descent in the presence of stragglers. Recent work on gradient coding and approximate gradient coding have shown how to add redundancy in distributed gradient descent to guarantee convergence even if some workers are slow or non-responsive. In this work we propose a new type of approximate gradient coding which we call Stochastic Gradient Coding (SGC). The idea of SGC is very simple: we distribute data points redundantly to workers according to a good combinatorial design. We prove that the convergence rate of SGC mirrors that of batched Stochastic Gradient Descent (SGD) for the l
2
loss function, and show how the convergence rate can improve with the redundancy. We show empirically that SGC requires a small amount of redundancy to handle a large number of stragglers and that it can outperform existing approximate gradient codes when the number of stragglers is large.
更多查看译文
关键词
approximate gradient coding,convergence rate,stochastic gradient descent,flexible straggler mitigation,stochastic gradient coding,distributed gradient descent-based machine learning algorithm,parallelizable gradient descent
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络