Katyusha: the first direct acceleration of stochastic gradient methods.

symposium on the theory of computing（2017）

引用 654|浏览587

暂无评分

摘要

Nesterovâs momentum trick is famously known for accelerating gradient descent, and has been proven useful in building fast iterative algorithms. However, in the stochastic setting, counterexamples exist and prevent Nesterovâs momentum from providing similar acceleration, even if the underlying problem is convex. We introduce Katyusha, a direct, primal-only stochastic gradient method to fix this issue. It has a provably accelerated convergence rate in convex (off-line) stochastic optimization. The main ingredient is Katyusha momentum, a novel ânegative momentumâ on top of Nesterovâs momentum that can be incorporated into a variance-reduction based algorithm and speed it up. Since variance reduction has been successfully applied to a growing list of practical problems, our paper suggests that in each of such cases, one could potentially give Katyusha a hug.

查看译文

关键词

acceleration,momentum,first-order method,stochastic gradient descent,accelerated gradient descent

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要