Collaborative Entity Extraction and Translation

Current Issues in Linguistic TheoryRecent Advances in Natural Language Processing V(2007)

引用 15|浏览9
暂无评分
摘要
Entity extraction is the task of identifying names and nominal phrases ('mentions') in a text and linking coreferring mentions. We propose the use of a new source of data for improving entity extraction: the information gleaned from large bitexts and captured by a statistical, phrase-based machine translation system. We translate the individual mentions and test properties of the translated mentions, as well as comparing the translations of coreferring mentions. The results provide feedback to improve source language entity extraction. Experiments on Chinese and English show that this approach can significantly improve Chinese entity extraction (2.2%-relative improvement in name tagging F- measure, representing a 15.0% error reduction), as well as Chinese to English entity translation (9.1% relative improvement in F-measure), over state-of-the-art entity extraction and machine translation systems.
更多
查看译文
关键词
joint inference,named entities,machine translation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要