Design and Development of a Named Entity Recognizer for an Agglutinative Language

mag(2004)

引用 27|浏览44
暂无评分
摘要
This paper presents the conclusions reached from the development of a system for Named Entity recognition in written Basque. The system was designed in four steps: first, the development of a recognizer based on linguistic information represented on finitestate-transducers; second, the generation of semi-automatically annotated corpora from the result of these transducers; third, the achievement of the best possible recognizer by training different ML techniques on these corpora; and finally, the combination of the different recognizers obtained. Being Basque an agglutinative language, a linguistic preprocess previous to these steps was required.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要