谷歌浏览器插件
订阅小程序
在清言上使用

Getting More Data for Low-resource Morphological Inflection - Language Models and Data Augmentation.

LREC(2020)

引用 0|浏览1
暂无评分
摘要
We investigate the effect of data augmentation on low-resource morphological segmentation. We compare two settings: the pure low-resource one, when only 100 annotated word forms are available, and the augmented one, where we use the original training set and 1000 unlabeled word forms to generate 1000 artificial inflected forms. Evaluating on Sigmorphon 2018 dataset, we observe that using the best among these two models reduces the error rate of state-of-the-art model by 6%, while for our baseline model the error reduction is 17%
更多
查看译文
关键词
inflection, encoder-decoder, abstract paradigms, language models, data augmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要