Piecewise linear activations substantially shape the loss surfaces of neural networks
ICLR, 2020.
EI
Weibo:
Abstract:
Understanding the loss surface of a neural network is fundamentally important to the understanding of deep learning. This paper presents how piecewise linear activation functions substantially shape the loss surfaces of neural networks. We first prove that the loss surfaces of many neural networks have infinite spurious local minima, whic...More
Tags
Comments