Energy-Based Models in Document Recognition and Computer Vision

ICDAR-1(2007)

引用 55|浏览60
暂无评分
摘要
The Machine Learning and Pattern Recognition commu- nities are facing two challenges: solving thenormalization problem, and solving the deep learning problem. The normalization problem is related to the difficulty of training probabilistic models over large spaces while keep - ing them properly normalized. In recent years, the ML and Natural Language communities have devoted considerable efforts to circumventing this problem by developing "un- normalized" learning models for tasks in which the output is highly structured (e.g. English sentences). This class of models was in fact originally developed during the 90's in the handwriting recognition community, and includes Graph Transformer Networks, Conditional Random Fields, Hidden Markov SVMs, and Maximum Margin Markov Net- works. We describe these models within the unifying frame- work of "Energy-Based Models" (EBM). The Deep Learning Problem is related to the issue of training all the levels of a recognition system (e.g. seg- mentation, feature extraction, recognition, etc) in an int e- grated fashion. We first consider "traditional" methods for deep learning, such as convolutional networks and back- propagation, and show that, although they produce very low error rates for handwriting and object recognition, they re - quire many training samples. We show that using unsuper- vised learning to initialize the layers of a deep network dra - matically reduces the required number of training samples, particularly for such tasks as the recognition of everyday objects at the category level.
更多
查看译文
关键词
energy-based models,document recognition,computer vision,graph transformation,natural language processing,probabilistic model,handwriting recognition,natural language,feature extraction,back propagation,pattern recognition,conditional random field,conditional random fields,unsupervised learning,deep learning,machine learning,error rate,object recognition,backpropagation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要