Unnormalized Exponential And Neural Network Language Models

Abhinav Sethy,Stanley Chen,Ebru Arisoy,Bhuvana Ramabhadran

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)（2015）

引用 13|浏览106

暂无评分

摘要

Model M, an exponential class-based language model, and neural network language models (NNLM's) have outperformed word n-gram language models over a wide range of tasks. However, these gains come at the cost of vastly increased computation when calculating word probabilities. For both models, the bulk of this computation involves evaluating the softmax function over a large word or class vocabulary to ensure that probabilities sum to 1. In this paper, we study unnormalized variants of Model M and NNLM's, whereby the softmax function is simply omitted. Accordingly, model training must be modified to encourage scores to sum close to 1. In this paper, we demonstrate up to a factor of 35 faster n-gram lookups with unnormalized models over their normalized counterparts, while still yielding state-of-the-art performance in WER (10.2 on the English broadcast news rt04 set).

查看译文

关键词

Model M,unnormalized models,neural network language models,fast lookup

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要