Direct Error Rate Minimization for Statistical Machine Translation.

WMT '12: Proceedings of the Seventh Workshop on Statistical Machine Translation(2012)

引用 3|浏览34
暂无评分
摘要
Minimum error rate training is often the preferred method for optimizing parameters of statistical machine translation systems. MERT minimizes error rate by using a surrogate representation of the search space, such as N -best lists or hypergraphs, which only offer an incomplete view of the search space. In our work, we instead minimize error rate directly by integrating the decoder into the minimizer. This approach yields two benefits. First, the function being optimized is the true error rate. Second, it lets us optimize parameters of translations systems other than standard linear model features, such as distortion limit. Since integrating the decoder into the minimizer is often too slow to be practical, we also exploit statistical significance tests to accelerate the search by quickly discarding unpromising models. Experiments with a phrase-based system show that our approach is scalable, and that optimizing the parameters that MERT cannot handle brings improvements to translation results.
更多
查看译文
关键词
search space,MERT minimizes error rate,error rate,minimum error rate training,true error rate,approach yield,optimizing parameter,statistical machine translation system,statistical significance test,translation result,direct error rate minimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要