Fine-grained Morphosyntactic Analysis and Generation Tools for More Than One Thousand Languages.
LREC(2020)
摘要
Exploiting the broad translation of the Bible into the world's languages, we train and distribute morphosyntactic tools for approximately one thousand languages, vastly outstripping previous distributions of tools devoted to the processing of inflectional morphology. Evaluation of the tools on a subset of available inflectional dictionaries demonstrates strong initial models, supplemented and improved through ensembling and dictionary-based reranking. Likewise, a novel type-to-token based evaluation metric allows us to confirm that models generalize well across rare and common forms alike.
更多查看译文
关键词
morphology, low-resource, tools
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络