Noisy SMS Machine Translation in Low-Density Languages.
WMT '11: Proceedings of the Sixth Workshop on Statistical Machine Translation(2011)
摘要
This paper presents the system we developed for the 2011 WMT Haitian Creole--English SMS featured translation task. Applying standard statistical machine translation methods to noisy real-world SMS data in a low-density language setting such as Haitian Creole poses a unique set of challenges, which we attempt to address in this work. Along with techniques to better exploit the limited available training data, we explore the benefits of several methods for alleviating the additional noise inherent in the SMS and transforming it to better suite the assumptions of our hierarchical phrase-based model system. We show that these methods lead to significant improvements in BLEU score over the baseline.
更多查看译文
关键词
translation,low-density low-density
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络