On the Importance of Word Boundaries in Character-level Neural Machine Translation

Duygu Ataman
Duygu Ataman
Mattia Antonino Di Gangi
Mattia Antonino Di Gangi

Proceedings of the 3rd Workshop on Neural Generation and Translation, 2019.

Cited by: 2|Bibtex|Views77|DOI:https://doi.org/10.18653/v1/d19-5619
Other Links: academic.microsoft.com|arxiv.org

Abstract:

Neural Machine Translation (NMT) models generally perform translation using a fixed-size lexical vocabulary, which is an important bottleneck on their generalization capability and overall translation quality. The standard approach to overcome this limitation is to segment words into subword units, typically using some external tools wi...More

Code:

Data:

Full Text
Your rating :
0

 

Tags
Comments