Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features.

Nizar Habash,Ryan Gabbard,Owen Rambow,Seth Kulick,Mitchell P. Marcus

Empirical Methods in Natural Language Processing（2007）

引用 21|浏览26

暂无评分

摘要

This paper discusses automatic determination of case in Arabic. This task is a major source of errors in full diacritization of Arabic. We use a gold-standard syntactic tree, and obtain an error rate of about 4.2%, with a machine learning based system outperforming a system using hand-written rules. A careful error analysis suggests that when we account for annotation errors in the gold standard, the error rate drops to 0.8%, with the hand-written rules outperforming the machine learning-based system.

查看译文

关键词

complex linguistic behavior,complex linguistic features,arabic,determining case

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要