Combining Advanced Methods in Japanese-Vietnamese Neural Machine Translation

2018 10th International Conference on Knowledge and Systems Engineering (KSE)(2018)

引用 7|浏览53
暂无评分
摘要
Neural machine translation (NMT) systems have recently obtained state-of-the-art in many machine translation systems between popular language pairs because of the availability of data. For low-resourced language pairs, there are few researches in this field due to the lack of bilingual data. In this paper, we attempt to build the first NMT systems for a low-resourced language pair: Japanese-Vietnamese. We have also shown significant improvements when combining advanced methods to reduce the adverse impacts of data sparsity and improve the quality of NMT systems. In addition, we proposed a variant of Byte-Pair Encoding algorithm to perform effective word segmentation for Vietnamese texts and alleviate the rare-word problem that persists in NMT systems.
更多
查看译文
关键词
Neural Machine Translation,Japanese - Vietnamese Machine Translation,Low-resourced,Byte-Pair Encoding,Back Translation,Mix-Source
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要