Deep Boltzmann Machines and the Centering Trick.
Neural Networks: Tricks of the Trade (2nd ed.), pp.621-637, (2012)
Deep Boltzmann machines are in principle powerful models for extracting the hierarchical structure of data. Unfortunately, attempts to train layers jointly (without greedy layer-wise pretraining) have been largely unsuccessful. We propose a modification of the learning algorithm that initially recenters the output of the activation func...More
PPT (Upload PPT)