Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features.

Empirical Methods in Natural Language Processing(2007)

引用 21|浏览26
暂无评分
摘要
This paper discusses automatic determination of case in Arabic. This task is a major source of errors in full diacritization of Arabic. We use a gold-standard syntactic tree, and obtain an error rate of about 4.2%, with a machine learning based system outperforming a system using hand-written rules. A careful error analysis suggests that when we account for annotation errors in the gold standard, the error rate drops to 0.8%, with the hand-written rules outperforming the machine learning-based system.
更多
查看译文
关键词
complex linguistic behavior,complex linguistic features,arabic,determining case
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要