Non-Vocalised Arabic Word Classifications Based On Mining Affixes Features

Sari Awwad,Mustafa Hammad, Safaa Al-Haj Saleh

Journal of Computer Applications in Technology(2019)

引用 0|浏览1
暂无评分
摘要
Arabic word classification is a challenging problem owing to the cursive nature of the language and modulation marks. The existing approaches are based on databases and dictionaries to classify Arabic words, which makes classification process operation slow. Therefore, this paper investigates Arabic word classification in the non-vocalised Arabic text by solely using affixes features and explores the extent to which we can rely on these features to determine Arabic word class without the need for dictionaries or word lists. The proposed approach is mainly based on affixes features and Support Vector Machine (SVM). A Fisher encoding is also applied to remove any redundancy and to preserve important information. Our approach is tested on a data set of two main classes (noun and verb) and different six noun sub-classes. The results indicate that this approach is helpful in achieving a success rate approaching 64% of the total words in the articles used in this study. The unsuccessful classification rate appears because there are no affixes in the target Arabic word or some original characters are considered as affixes.
更多
查看译文
关键词
affixes features, word classification, SVM, support vector machine, Fisher encoding, Arabic language
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要