Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

arXiv: Computer Vision and Pattern Recognition, Volume abs/1706.02677, 2017.

Cited by: 1190|Bibtex|Views272
EI
Other Links: dblp.uni-trier.de|academic.microsoft.com|arxiv.org

Abstract:

Deep learning thrives with large neural networks and large datasets. However, larger networks and larger datasets result in longer training times that impede research and development progress. Distributed synchronous SGD offers a potential solution to this problem by dividing SGD minibatches over a pool of parallel workers. Yet to make th...More

Code:

Data:

Your rating :
0

 

Tags
Comments